-
Notifications
You must be signed in to change notification settings - Fork 281
add examples for GPTJ #162
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Signed-off-by: Wang, Chang1 <[email protected]>
Signed-off-by: changwa1 <[email protected]>
5a5a600 to
8c57ebe
Compare
Contributor
|
what's the perf? |
Signed-off-by: changwa1 <[email protected]>
Signed-off-by: changwa1 <[email protected]>
Contributor
Author
Signed-off-by: changwa1 <[email protected]>
Contributor
Author
Signed-off-by: changwa1 <[email protected]>
Contributor
Author
PenghuiCheng
approved these changes
Dec 2, 2022
Contributor
|
Pls update in readme model list: https://github.com/intel/neural-compressor/blob/wangchang/gptj/examples/README.md |
Signed-off-by: changwa1 <[email protected]>
Signed-off-by: changwa1 <[email protected]>
chensuyue
reviewed
Dec 2, 2022
...pytorch/nlp/huggingface_models/language-modeling/quantization/ptq_static/fx/requirements.txt
Show resolved
Hide resolved
chensuyue
approved these changes
Dec 2, 2022
lvliang-intel
pushed a commit
that referenced
this pull request
Dec 5, 2022
Signed-off-by: Wang, Chang1 <[email protected]> Signed-off-by: Lv, Liang1 <[email protected]>
xin3he
pushed a commit
that referenced
this pull request
Dec 5, 2022
Signed-off-by: Wang, Chang1 <[email protected]>
yiliu30
pushed a commit
that referenced
this pull request
Dec 7, 2022
Signed-off-by: Wang, Chang1 <[email protected]> Signed-off-by: yiliu30 <[email protected]>
zehao-intel
pushed a commit
that referenced
this pull request
Dec 9, 2022
Signed-off-by: Wang, Chang1 <[email protected]> Signed-off-by: zehao-intel <[email protected]>
zehao-intel
pushed a commit
that referenced
this pull request
Dec 20, 2022
Signed-off-by: Wang, Chang1 <[email protected]> Signed-off-by: zehao-intel <[email protected]>
VincyZhang
pushed a commit
that referenced
this pull request
Feb 12, 2023
yiliu30
pushed a commit
that referenced
this pull request
Apr 5, 2025
…ons (#162) * [SW-219831] - Set scale attributes in INC to reduce grpah recompilation * add scaling methods ids * fix scaling method ids check and set * enable feature also for Load QuantMode * move scale tensors to cpu when feature is enabled * fix scaling methods ids to start at 1 * fix cr comments * remove unnecessary imports * fix cr comments * fix more cr comments * fix cr comments * move scale to float on cpu in scale handler for dynamic scaling * fix cr comments * Add unit test * fix sending scale tensor to bridge and unit-test bug
mengniwang95
pushed a commit
that referenced
this pull request
Apr 15, 2025
…ons (#162) * [SW-219831] - Set scale attributes in INC to reduce grpah recompilation * add scaling methods ids * fix scaling method ids check and set * enable feature also for Load QuantMode * move scale tensors to cpu when feature is enabled * fix scaling methods ids to start at 1 * fix cr comments * remove unnecessary imports * fix cr comments * fix more cr comments * fix cr comments * move scale to float on cpu in scale handler for dynamic scaling * fix cr comments * Add unit test * fix sending scale tensor to bridge and unit-test bug
xin3he
pushed a commit
that referenced
this pull request
Apr 22, 2025
…ons (#162) * [SW-219831] - Set scale attributes in INC to reduce grpah recompilation * add scaling methods ids * fix scaling method ids check and set * enable feature also for Load QuantMode * move scale tensors to cpu when feature is enabled * fix scaling methods ids to start at 1 * fix cr comments * remove unnecessary imports * fix cr comments * fix more cr comments * fix cr comments * move scale to float on cpu in scale handler for dynamic scaling * fix cr comments * Add unit test * fix sending scale tensor to bridge and unit-test bug
XuehaoSun
pushed a commit
that referenced
this pull request
May 13, 2025
…ons (#162) * [SW-219831] - Set scale attributes in INC to reduce grpah recompilation * add scaling methods ids * fix scaling method ids check and set * enable feature also for Load QuantMode * move scale tensors to cpu when feature is enabled * fix scaling methods ids to start at 1 * fix cr comments * remove unnecessary imports * fix cr comments * fix more cr comments * fix cr comments * move scale to float on cpu in scale handler for dynamic scaling * fix cr comments * Add unit test * fix sending scale tensor to bridge and unit-test bug
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Signed-off-by: Wang, Chang1 [email protected]
Type of Change
feature or bug fix or documentation or validation or others
API changed or not
Description
detail description
JIRA ticket: xxx
Expected Behavior & Potential Risk
the expected behavior that triggered by this PR
How has this PR been tested?
how to reproduce the test (including hardware information)
Dependency Change?
any library dependency introduced or removed