Why FMHA is not supported in V100 and T4

I'm running TensorRT-LLM on V100,  when I enabled fmha with `--enable_context_fmha`, 
I got this error message:
`[TensorRT-LLM][ERROR] Assertion failed: Unsupported architecture (/home/build/TensorRT_LLM/TensorRT-LLM-master/cpp/tensorrt_llm/kernels/contextFusedMultiHeadAttention/fmhaRunner.cpp:87)`

I checked the code of FusedMHARunnerV2, it seems sm70 and sm75 are not supported.

may I know why V100 is not supported for fmha? or is there any plan on the way?

Thanks!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why FMHA is not supported in V100 and T4 #320

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Why FMHA is not supported in V100 and T4 #320

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions