NHD layout #603

sunjiweiswift · 2025-11-03T09:15:14Z

NHD: the last 3 dimensions are organized as (seq_len, num_heads, head_dim).
HND: the last 3 dimensions are organized as (num_heads, seq_len, head_dim).

In VLLM/sglang, NHD is a more commonly used format. Support for NHD has been added in the release pr.

sunjiweiswift · 2025-11-04T04:30:49Z

@petercad pls review

sunjiweiswift · 2025-11-04T06:04:17Z

@jiyang1011 @taozha2 @tdeng5 pls review

examples/06_bmg_flash_attention/xe_fmha_fwd_runner.hpp

Copilot

Pull Request Overview

This PR adds support for NHD (seq_len, num_heads, head_dim) layout in addition to the existing HND (num_heads, seq_len, head_dim) layout for the BMG flash attention example. The NHD layout is commonly used in VLLM/sglang frameworks and is set as the new default.

Key changes:

Added --layout command-line option with validation for "NHD" and "HND" values
Updated stride calculations to support both layout formats
Modified the verification function to handle layout-specific tensor indexing and data reordering

examples/06_bmg_flash_attention/xe_fmha_fwd_runner.hpp

Co-authored-by: Copilot <[email protected]>

sunjiweiswift · 2025-11-05T07:35:17Z

@copilot open a new pull request to apply changes based on the comments in this thread

sunjiweiswift · 2025-11-06T02:58:42Z

@rolandschulz pls review and merge

hshen14 · 2025-11-06T10:45:44Z

@sunjiweiswift did you observe some perf gains using HND than NHD when low precision e.g., FP8 is enabled? FlashInfer says it's more friendly for GPU implementation.

sunjiweiswift · 2025-11-07T03:31:29Z

@sunjiweiswift did you observe some perf gains using HND than NHD when low precision e.g., FP8 is enabled? FlashInfer says it's more friendly for GPU implementation.

@hshen14 There is not much difference in the case of BF16. FP8 and other low-precision types are not currently supported.

sunjiweiswift marked this pull request as draft November 3, 2025 09:15

sunjiweiswift force-pushed the fmha_NHD branch from a236952 to bd7b908 Compare November 4, 2025 03:03

NHD v1.0

606442a

sunjiweiswift force-pushed the fmha_NHD branch from bd7b908 to 606442a Compare November 4, 2025 03:40

sunjiweiswift marked this pull request as ready for review November 4, 2025 04:29

sunjiweiswift changed the title ~~NHD v1.0~~ NHD layout Nov 4, 2025

petercad reviewed Nov 4, 2025

View reviewed changes

examples/06_bmg_flash_attention/xe_fmha_fwd_runner.hpp Show resolved Hide resolved

petercad reviewed Nov 4, 2025

View reviewed changes

examples/06_bmg_flash_attention/xe_fmha_fwd_runner.hpp Outdated Show resolved Hide resolved

sunjiweiswift added 2 commits November 5, 2025 09:58

layout error-checking

3aab228

use make_stride replase make_cute_packed_stride

55cec40

sunjiweiswift force-pushed the fmha_NHD branch from b0c8015 to 55cec40 Compare November 5, 2025 02:19

delete iostream

04e6b4f

sunjiweiswift requested a review from petercad November 5, 2025 03:09

tdeng5 requested review from jiyang1011 and tdeng5 November 5, 2025 03:39

jiyang1011 approved these changes Nov 5, 2025

View reviewed changes

Antonyvance requested a review from Copilot November 5, 2025 07:29

Copilot AI reviewed Nov 5, 2025

View reviewed changes

examples/06_bmg_flash_attention/xe_fmha_fwd_runner.hpp Show resolved Hide resolved

examples/06_bmg_flash_attention/xe_fmha_fwd_runner.hpp Outdated Show resolved Hide resolved

Update examples/06_bmg_flash_attention/xe_fmha_fwd_runner.hpp

0f11125

Co-authored-by: Copilot <[email protected]>

petercad approved these changes Nov 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NHD layout #603

NHD layout #603

Uh oh!

sunjiweiswift commented Nov 3, 2025 •

edited

Loading

Uh oh!

sunjiweiswift commented Nov 4, 2025

Uh oh!

sunjiweiswift commented Nov 4, 2025

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

sunjiweiswift commented Nov 5, 2025

Uh oh!

sunjiweiswift commented Nov 6, 2025

Uh oh!

hshen14 commented Nov 6, 2025

Uh oh!

sunjiweiswift commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

NHD layout #603

Are you sure you want to change the base?

NHD layout #603

Uh oh!

Conversation

sunjiweiswift commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sunjiweiswift commented Nov 4, 2025

Uh oh!

sunjiweiswift commented Nov 4, 2025

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

sunjiweiswift commented Nov 5, 2025

Uh oh!

sunjiweiswift commented Nov 6, 2025

Uh oh!

hshen14 commented Nov 6, 2025

Uh oh!

sunjiweiswift commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sunjiweiswift commented Nov 3, 2025 •

edited

Loading