[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950

vivekkhandelwal1 · 2025-01-09T09:19:37Z

No description provided.

Signed-off-by: Vivek Khandelwal <[email protected]>

pashu123

LGTM! Are there any reasons why we aren't decomposing this torch op into another set of torch ops?

vivekkhandelwal1 · 2025-01-13T04:26:13Z

LGTM! Are there any reasons why we aren't decomposing this torch op into another set of torch ops?

AFAIK, the reason for not doing that is since we want the attention op as a single kernel. Hence, we just lower it to tm_tensor.attention and the rest is taken care of during the codegen.

[MLIR][TORCH] Add support for enable_gqa flag in SDPA op

c4b0c18

Signed-off-by: Vivek Khandelwal <[email protected]>

vivekkhandelwal1 requested review from AmosLewis, pashu123, rsuderman and zjgarvey January 9, 2025 09:19

Add comment for reference implementation

26d246c

pashu123 approved these changes Jan 9, 2025

View reviewed changes

vivekkhandelwal1 requested a review from Groverkss January 16, 2025 13:27

vivekkhandelwal1 merged commit 25aa0c6 into llvm:main Feb 5, 2025
3 checks passed

vivekkhandelwal1 deleted the aten-sdpa-gqa branch February 5, 2025 05:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950

[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950

Uh oh!

vivekkhandelwal1 commented Jan 9, 2025

Uh oh!

pashu123 left a comment

Uh oh!

vivekkhandelwal1 commented Jan 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[MLIR][TORCH] Add support for enable_gqa flag in SDPA op #3950

[MLIR][TORCH] Add support for enable_gqa flag in SDPA op #3950

Uh oh!

Conversation

vivekkhandelwal1 commented Jan 9, 2025

Uh oh!

pashu123 left a comment

Choose a reason for hiding this comment

Uh oh!

vivekkhandelwal1 commented Jan 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950

[MLIR][TORCH] Add support for `enable_gqa` flag in SDPA op #3950