-
Notifications
You must be signed in to change notification settings - Fork 375
Add support for copy_ for plain layout and tensor core tiled layout (#1791) #1804
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1804
Note: Links to docs will display an error until the docs builds have been completed. This comment was automatically generated by Dr. CI and updates every 15 minutes. |
37ee439 to
514f0cc
Compare
…1791) * Add support for copy_ for plain layout and tensor core tiled layout Summary: att, only support copy_ from AQT to another AQT with same metadata (shapes etc.) Tested int4wo, int8wo, int8dq Test Plan: python test/dtypes/test_affine_quantized.py -k test_copy_ Reviewers: Subscribers: Tasks: Tags: * remove print * add metadata mismatch test * rebase and add float8 * cutlass int4 support stack-info: PR: #1804, branch: drisspg/stack/40
|
cc @metascroy is someone looking at the failing AO experimental tests? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks!
we need to skip "apply_quant = Int8DynamicActivationInt4WeightConfig(group_size=None, layout=CutlassInt4PackedLayout(), mapping_type=<MappingType.SYMMETRIC: 1>, act_mapping_type=<MappingType.SYMMETRIC: 1>)" I think
the experimental test failure might be a flaky test, it's not related to the PR I think
514f0cc to
18b7e16
Compare
…1791) * Add support for copy_ for plain layout and tensor core tiled layout Summary: att, only support copy_ from AQT to another AQT with same metadata (shapes etc.) Tested int4wo, int8wo, int8dq Test Plan: python test/dtypes/test_affine_quantized.py -k test_copy_ Reviewers: Subscribers: Tasks: Tags: * remove print * add metadata mismatch test * rebase and add float8 * cutlass int4 support stack-info: PR: #1804, branch: drisspg/stack/40
dab1ead to
d790d7d
Compare
…1791) * Add support for copy_ for plain layout and tensor core tiled layout Summary: att, only support copy_ from AQT to another AQT with same metadata (shapes etc.) Tested int4wo, int8wo, int8dq Test Plan: python test/dtypes/test_affine_quantized.py -k test_copy_ Reviewers: Subscribers: Tasks: Tags: * remove print * add metadata mismatch test * rebase and add float8 * cutlass int4 support stack-info: PR: #1804, branch: drisspg/stack/40
d790d7d to
3a0fd37
Compare
…1791) * Add support for copy_ for plain layout and tensor core tiled layout Summary: att, only support copy_ from AQT to another AQT with same metadata (shapes etc.) Tested int4wo, int8wo, int8dq Test Plan: python test/dtypes/test_affine_quantized.py -k test_copy_ Reviewers: Subscribers: Tasks: Tags: * remove print * add metadata mismatch test * rebase and add float8 * cutlass int4 support stack-info: PR: #1804, branch: drisspg/stack/40
3a0fd37 to
45067de
Compare
…1791) (#1804) * Add support for copy_ for plain layout and tensor core tiled layout Summary: att, only support copy_ from AQT to another AQT with same metadata (shapes etc.) Tested int4wo, int8wo, int8dq Test Plan: python test/dtypes/test_affine_quantized.py -k test_copy_ Reviewers: Subscribers: Tasks: Tags: * remove print * add metadata mismatch test * rebase and add float8 * cutlass int4 support stack-info: PR: #1804, branch: drisspg/stack/40 Co-authored-by: Jerry Zhang <[email protected]>
Stacked PRs:
Take two / running on ROCM to surface errors
Add support for copy_ for plain layout and tensor core tiled layout (#1791)
Summary:
att, only support copy_ from AQT to another AQT with same metadata (shapes etc.)
Tested int4wo, int8wo, int8dq
Test Plan:
python test/dtypes/test_affine_quantized.py -k test_copy_
Reviewers:
Subscribers:
Tasks:
Tags:
remove print
add metadata mismatch test
rebase and add float8
cutlass int4 support