-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Closed
Labels
feature requestNew feature or request. This includes new model, dtype, functionality supportNew feature or request. This includes new model, dtype, functionality supporttriagedIssue has been triaged by maintainersIssue has been triaged by maintainers
Description
I'm running TensorRT-LLM on V100, when I enabled fmha with --enable_context_fmha,
I got this error message:
[TensorRT-LLM][ERROR] Assertion failed: Unsupported architecture (/home/build/TensorRT_LLM/TensorRT-LLM-master/cpp/tensorrt_llm/kernels/contextFusedMultiHeadAttention/fmhaRunner.cpp:87)
I checked the code of FusedMHARunnerV2, it seems sm70 and sm75 are not supported.
may I know why V100 is not supported for fmha? or is there any plan on the way?
Thanks!
cody-moveworks, hubble-bubble and xinqingguo
Metadata
Metadata
Labels
feature requestNew feature or request. This includes new model, dtype, functionality supportNew feature or request. This includes new model, dtype, functionality supporttriagedIssue has been triaged by maintainersIssue has been triaged by maintainers