Rocm mx gemm #5

petrex · 2025-06-09T17:50:07Z

This pull request introduces support for AMD MI355x GPUs with ROCm 6.5+ and HIPBLASLT for MX gemm operations, alongside updates to the documentation and validation logic. The changes expand the framework's compatibility and functionality for AMD hardware while maintaining support for existing NVIDIA configurations.

AMD Support Enhancements:

torchao/prototype/mx_formats/config.py:
- Added HIPBLASLT as a new kernel choice in MXGemmKernelChoice and MXFP8_HIPBLASLT in MXLinearRecipeName.
- Updated _validate_gemm_kernel_choice to include validation for HIPBLASLT, ensuring specific block size, data types, and ROCm availability.
- Extended from_recipe_name to handle MXFP8_HIPBLASLT.
torchao/prototype/mx_formats/mx_ops.py:
- Modified _addmm_mx_dispatch to support HIPBLASLT for matrix-matrix operations in addition to existing kernel choices. [1] [2]

Documentation Updates:

torchao/prototype/mx_formats/README.md:
- Updated to reflect support for AMD MI355x GPUs with ROCm 6.5+ and gfx950, including usage examples and performance optimization notes. [1] [2] [3]

…dation logic. Added MXFP8_HIPBLASLT recipe and adjusted mx_mm function to accommodate new kernel options.

…ASLT kernel choice for mxfp8 gemm. Enhance documentation on end-to-end performance optimization efforts for AMD GPUs.

…py to include HIPBLASLT as a valid kernel choice for MX FP8 operations.

petrex · 2025-06-09T17:50:59Z

bugbot run

cursor

Bug: Outdated Assertion Message for MX FP8 Operations

The assertion error message "CUBLAS is the only supported kernel choice for MX FP8 operations" is incorrect. The code's assertion logic now accepts both CUBLAS and HIPBLASLT kernel choices for MX FP8 operations. This outdated message misleads users about supported kernel choices and should be updated.

torchao/prototype/mx_formats/mx_ops.py#L109-L113

ao/torchao/prototype/mx_formats/mx_ops.py

Lines 109 to 113 in 129a6d6

    
           assert b._elem_dtype == torch.float8_e4m3fn 
        
           assert gemm_choice in ( 
        
               MXGemmKernelChoice.CUBLAS, 
        
               MXGemmKernelChoice.HIPBLASLT, 
        
           ), "CUBLAS is the only supported kernel choice for MX FP8 operations"

Fix in Cursor

BugBot free trial expires on June 16, 2025
You have used $0.00 of your $20.00 spend limit so far. Manage your spend limit in the Cursor dashboard.

Was this report helpful? Give feedback by reacting with 👍 or 👎

… HIPBLASLT are supported kernel choices for MX FP8 operations.

…l choices for MX FP8 operations.

Co-authored-by: Copilot <[email protected]>

- Introduced `is_ROCm_mx_supported` function to verify ROCm environment compatibility for MX operations. - Added `test_hipblaslt_fp8` to validate FP8 operations using the HIPBLASLT backend, including SQNR verification for output accuracy. - Updated imports in `test_mx_mm.py` to include necessary utilities for the new test.

- Replaced `compute_sqnr` with `compute_error` for improved accuracy in error measurement. - Updated assertion to ensure output accuracy meets the specified threshold.

- Updated the function to ensure `torch.version.hip` is not None before checking the version, improving robustness against potential NoneType errors.

- Reformatted the return statement to enhance clarity and maintainability of the code.

Peter Y. Yeh and others added 10 commits April 16, 2025 15:59

Enhance MX formats to support HIPBLASLT kernel choice and update vali…

c21d24c

…dation logic. Added MXFP8_HIPBLASLT recipe and adjusted mx_mm function to accommodate new kernel options.

Update README.md to include support for AMD MI355x hardware and HIPBL…

36dd5b7

…ASLT kernel choice for mxfp8 gemm. Enhance documentation on end-to-end performance optimization efforts for AMD GPUs.

lint

c75df8e

Merge branch 'main' into rocm_mx_gemm

9b7b602

Merge branch 'main' into rocm_mx_gemm

5ee124e

lint

df2c220

Merge branch 'main' into rocm_mx_gemm

8df1d85

Update HIPBLASLT comment in config.py and adjust assertion in mx_ops.…

8ae4021

…py to include HIPBLASLT as a valid kernel choice for MX FP8 operations.

lint

8505860

lint

129a6d6

petrex added the enhancement New feature or request label Jun 9, 2025

cursor bot reviewed Jun 9, 2025

View reviewed changes

Peter Y. Yeh and others added 8 commits June 9, 2025 10:56

Update assertion message in mx_ops.py to clarify that both CUBLAS and…

c807d70

… HIPBLASLT are supported kernel choices for MX FP8 operations.

Refactor assertion in mx_ops.py to improve clarity on supported kerne…

75db95e

…l choices for MX FP8 operations.

Update torchao/prototype/mx_formats/config.py

3ecc91e

Co-authored-by: Copilot <[email protected]>

add space

f88f1cf

Refactor SQNR calculation in HIPBLASLT FP8 test

5d2b55d

- Replaced `compute_sqnr` with `compute_error` for improved accuracy in error measurement. - Updated assertion to ensure output accuracy meets the specified threshold.

Enhance ROCm MX support check in is_ROCm_mx_supported function

979893a

- Updated the function to ensure `torch.version.hip` is not None before checking the version, improving robustness against potential NoneType errors.

Refactor is_ROCm_mx_supported function for improved readability

012f938

- Reformatted the return statement to enhance clarity and maintainability of the code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rocm mx gemm #5

Rocm mx gemm #5

Uh oh!

petrex commented Jun 9, 2025

Uh oh!

petrex commented Jun 9, 2025

Uh oh!

cursor bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	assert b._elem_dtype == torch.float8_e4m3fn
	assert gemm_choice in (
	MXGemmKernelChoice.CUBLAS,
	MXGemmKernelChoice.HIPBLASLT,
	), "CUBLAS is the only supported kernel choice for MX FP8 operations"

Rocm mx gemm #5

Are you sure you want to change the base?

Rocm mx gemm #5

Uh oh!

Conversation

petrex commented Jun 9, 2025

AMD Support Enhancements:

Documentation Updates:

Uh oh!

petrex commented Jun 9, 2025

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Bug: Outdated Assertion Message for MX FP8 Operations

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants