Skip to content

Conversation

@PenghuiCheng
Copy link
Contributor

@PenghuiCheng PenghuiCheng commented Dec 9, 2022

Signed-off-by: Cheng, Penghui [email protected]

Type of Change

API changed

Description

Fixed pruning and distillation bug for new API.
Removed neural_compressor.quantization class

Expected Behavior & Potential Risk

Pruning and distillation work with new API.

How has this PR been tested?

Local tested.

@chensuyue
Copy link
Contributor

chensuyue commented Dec 9, 2022

Need to divide into 2 PRs, 1 is new API update which should be merged today, and 1 is new API examples which requires extension test.

@PenghuiCheng PenghuiCheng force-pushed the penghuic/pruning_distillation_bug branch from 846c45f to b161319 Compare December 9, 2022 09:37
@chensuyue chensuyue added enhancement New feature or request review labels Dec 9, 2022
@chensuyue chensuyue added this to the v2.0 milestone Dec 9, 2022
@PenghuiCheng PenghuiCheng changed the title Fixed pruning and distillation bug and update examples Fixed pruning and distillation bug and remove invlid code Dec 9, 2022
@PenghuiCheng PenghuiCheng changed the title Fixed pruning and distillation bug and remove invlid code Fixed pruning and distillation bug and remove invalid code Dec 9, 2022
Signed-off-by: Cheng, Penghui <[email protected]>
Signed-off-by: Cheng, Penghui <[email protected]>
Signed-off-by: Cheng, Penghui <[email protected]>
@chensuyue chensuyue merged commit a230726 into master Dec 9, 2022
@chensuyue chensuyue deleted the penghuic/pruning_distillation_bug branch December 9, 2022 16:40
intel-zhangyi pushed a commit that referenced this pull request Dec 13, 2022
zehao-intel pushed a commit that referenced this pull request Dec 20, 2022
xin3he pushed a commit that referenced this pull request Jul 15, 2025
* Added post forward hook to dump measurements according to samples counter 
* add support in samples counter in config
* removed function in RowParllelLinear as it is removed from the vllm upstream code
* currently only blocking method is operational, will complete async methods in future commit



* fix CR comments

* remove unused files

* add reslove_input method

it can't be defined in vllm due to upstream considerations,
so it is copied here

* fixed logging acoording to cr

* fixed resolve_input and moved the hook function
XuehaoSun pushed a commit that referenced this pull request Jul 19, 2025
* Added post forward hook to dump measurements according to samples counter 
* add support in samples counter in config
* removed function in RowParllelLinear as it is removed from the vllm upstream code
* currently only blocking method is operational, will complete async methods in future commit



* fix CR comments

* remove unused files

* add reslove_input method

it can't be defined in vllm due to upstream considerations,
so it is copied here

* fixed logging acoording to cr

* fixed resolve_input and moved the hook function
yiliu30 pushed a commit that referenced this pull request Jul 22, 2025
* Added post forward hook to dump measurements according to samples counter 
* add support in samples counter in config
* removed function in RowParllelLinear as it is removed from the vllm upstream code
* currently only blocking method is operational, will complete async methods in future commit



* fix CR comments

* remove unused files

* add reslove_input method

it can't be defined in vllm due to upstream considerations,
so it is copied here

* fixed logging acoording to cr

* fixed resolve_input and moved the hook function
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request review

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants