Skip to content

Conversation

@drisspg
Copy link
Contributor

@drisspg drisspg commented Feb 28, 2025

Stacked PRs:


Take two / running on ROCM to surface errors

Add support for copy_ for plain layout and tensor core tiled layout (#1791)

  • Add support for copy_ for plain layout and tensor core tiled layout

Summary:
att, only support copy_ from AQT to another AQT with same metadata (shapes etc.)

Tested int4wo, int8wo, int8dq

Test Plan:
python test/dtypes/test_affine_quantized.py -k test_copy_

Reviewers:

Subscribers:

Tasks:

Tags:

  • remove print

  • add metadata mismatch test

  • rebase and add float8

  • cutlass int4 support

@pytorch-bot
Copy link

pytorch-bot bot commented Feb 28, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1804

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

drisspg pushed a commit that referenced this pull request Feb 28, 2025
…1791)

* Add support for copy_ for plain layout and tensor core tiled layout

Summary:
att, only support copy_ from AQT to another AQT with same metadata (shapes etc.)

Tested int4wo, int8wo, int8dq

Test Plan:
python test/dtypes/test_affine_quantized.py -k test_copy_

Reviewers:

Subscribers:

Tasks:

Tags:

* remove print

* add metadata mismatch test

* rebase and add float8

* cutlass int4 support

stack-info: PR: #1804, branch: drisspg/stack/40
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 28, 2025
@drisspg drisspg requested a review from jainapurva February 28, 2025 23:43
@drisspg drisspg added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Feb 28, 2025
@drisspg
Copy link
Contributor Author

drisspg commented Mar 1, 2025

cc @metascroy is someone looking at the failing AO experimental tests?

Copy link
Contributor

@jerryzh168 jerryzh168 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks!

we need to skip "apply_quant = Int8DynamicActivationInt4WeightConfig(group_size=None, layout=CutlassInt4PackedLayout(), mapping_type=<MappingType.SYMMETRIC: 1>, act_mapping_type=<MappingType.SYMMETRIC: 1>)" I think

the experimental test failure might be a flaky test, it's not related to the PR I think

@drisspg drisspg force-pushed the drisspg/stack/40 branch from 514f0cc to 18b7e16 Compare March 4, 2025 23:15
drisspg pushed a commit that referenced this pull request Mar 4, 2025
…1791)

* Add support for copy_ for plain layout and tensor core tiled layout

Summary:
att, only support copy_ from AQT to another AQT with same metadata (shapes etc.)

Tested int4wo, int8wo, int8dq

Test Plan:
python test/dtypes/test_affine_quantized.py -k test_copy_

Reviewers:

Subscribers:

Tasks:

Tags:

* remove print

* add metadata mismatch test

* rebase and add float8

* cutlass int4 support

stack-info: PR: #1804, branch: drisspg/stack/40
@drisspg drisspg force-pushed the drisspg/stack/40 branch 2 times, most recently from dab1ead to d790d7d Compare March 5, 2025 01:29
drisspg pushed a commit that referenced this pull request Mar 5, 2025
…1791)

* Add support for copy_ for plain layout and tensor core tiled layout

Summary:
att, only support copy_ from AQT to another AQT with same metadata (shapes etc.)

Tested int4wo, int8wo, int8dq

Test Plan:
python test/dtypes/test_affine_quantized.py -k test_copy_

Reviewers:

Subscribers:

Tasks:

Tags:

* remove print

* add metadata mismatch test

* rebase and add float8

* cutlass int4 support

stack-info: PR: #1804, branch: drisspg/stack/40
@drisspg drisspg force-pushed the drisspg/stack/40 branch from d790d7d to 3a0fd37 Compare March 5, 2025 06:04
…1791)

* Add support for copy_ for plain layout and tensor core tiled layout

Summary:
att, only support copy_ from AQT to another AQT with same metadata (shapes etc.)

Tested int4wo, int8wo, int8dq

Test Plan:
python test/dtypes/test_affine_quantized.py -k test_copy_

Reviewers:

Subscribers:

Tasks:

Tags:

* remove print

* add metadata mismatch test

* rebase and add float8

* cutlass int4 support

stack-info: PR: #1804, branch: drisspg/stack/40
@drisspg drisspg force-pushed the drisspg/stack/40 branch from 3a0fd37 to 45067de Compare March 5, 2025 06:07
@drisspg drisspg merged commit 661a43d into main Mar 5, 2025
6 of 18 checks passed
liangel-02 pushed a commit that referenced this pull request Aug 25, 2025
…1791) (#1804)

* Add support for copy_ for plain layout and tensor core tiled layout

Summary:
att, only support copy_ from AQT to another AQT with same metadata (shapes etc.)

Tested int4wo, int8wo, int8dq

Test Plan:
python test/dtypes/test_affine_quantized.py -k test_copy_

Reviewers:

Subscribers:

Tasks:

Tags:

* remove print

* add metadata mismatch test

* rebase and add float8

* cutlass int4 support

stack-info: PR: #1804, branch: drisspg/stack/40

Co-authored-by: Jerry Zhang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/rocm CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: not user facing Use this tag if you don't want this PR to show up in release notes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants