Skip to content

Conversation

@peiying779
Copy link

Summary:
Added environment argument "use_persistent" (default is False) to explicitly turn off non-persistent kernel and use persistent kernel.
Throws error when both "use_persistent" and "no_use_persistent" are specified in the arguments.
Example usage:
Persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent
Non-persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --no_use_persistent

When both specified in the arguments:
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent --no_use_persistent
IT WILL THROW ERROR: Cannot specify both '--use_persistent' and '--no_use_persistent' at the same time. These options are mutually exclusive. Please use only one.

Reviewed By: jwfromm

Differential Revision: D86579911

@meta-cla meta-cla bot added the cla signed label Nov 14, 2025
@meta-codesync
Copy link
Contributor

meta-codesync bot commented Nov 14, 2025

@peiying779 has exported this pull request. If you are a Meta employee, you can view the originating Diff in D86579911.

peiying779 pushed a commit to peiying779/FBGEMM-1 that referenced this pull request Nov 14, 2025
Summary:

X-link: facebookresearch/FBGEMM#2130

Added environment argument "use_persistent" (default is False) to explicitly turn off non-persistent kernel and use persistent kernel.
Throws error when both "use_persistent" and "no_use_persistent" are specified in the arguments.
Example usage:
Persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent
Non-persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --no_use_persistent

When both specified in the arguments:
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent --no_use_persistent
IT WILL THROW ERROR: Cannot specify both '--use_persistent' and '--no_use_persistent' at the same time. These options are mutually exclusive. Please use only one.

Reviewed By: jwfromm

Differential Revision: D86579911
facebook-github-bot pushed a commit to meta-pytorch/tritonbench that referenced this pull request Nov 14, 2025
Summary:
X-link: pytorch/FBGEMM#5129

X-link: facebookresearch/FBGEMM#2130

Added environment argument "use_persistent" (default is False) to explicitly turn off non-persistent kernel and use persistent kernel.
Throws error when both "use_persistent" and "no_use_persistent" are specified in the arguments.
Example usage:
Persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent
Non-persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --no_use_persistent

When both specified in the arguments:
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent --no_use_persistent
IT WILL THROW ERROR: Cannot specify both '--use_persistent' and '--no_use_persistent' at the same time. These options are mutually exclusive. Please use only one.

Reviewed By: jwfromm

Differential Revision: D86579911
facebook-github-bot pushed a commit to meta-pytorch/tritonbench that referenced this pull request Nov 19, 2025
Summary:
X-link: pytorch/FBGEMM#5129

X-link: facebookresearch/FBGEMM#2130

Added environment argument "use_persistent" (default is False) to explicitly turn off non-persistent kernel and use persistent kernel.
Throws error when both "use_persistent" and "no_use_persistent" are specified in the arguments.
Example usage:
Persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent
Non-persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --no_use_persistent

When both specified in the arguments:
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent --no_use_persistent
IT WILL THROW ERROR: Cannot specify both '--use_persistent' and '--no_use_persistent' at the same time. These options are mutually exclusive. Please use only one.

Reviewed By: jwfromm

Differential Revision: D86579911
peiying779 pushed a commit to peiying779/FBGEMM-1 that referenced this pull request Nov 19, 2025
Summary:

X-link: facebookresearch/FBGEMM#2130

Added environment argument "use_persistent" (default is False) to explicitly turn off non-persistent kernel and use persistent kernel.
Throws error when both "use_persistent" and "no_use_persistent" are specified in the arguments.
Example usage:
Persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent
Non-persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --no_use_persistent

When both specified in the arguments:
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent --no_use_persistent
IT WILL THROW ERROR: Cannot specify both '--use_persistent' and '--no_use_persistent' at the same time. These options are mutually exclusive. Please use only one.

Reviewed By: jwfromm

Differential Revision: D86579911
Summary:

X-link: facebookresearch/FBGEMM#2130

Added environment argument "use_persistent" (default is False) to explicitly turn off non-persistent kernel and use persistent kernel.
Throws error when both "use_persistent" and "no_use_persistent" are specified in the arguments.
Example usage:
Persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent
Non-persistent kernel--
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --no_use_persistent

When both specified in the arguments:
buck2 run mode/{dev-nosan,amd-gpu} -c xlog.level=WARNING -m ovr_config//triton:trunk -m rocm7 -c fbcode.nvcc_arch=mi350 -c fbcode.enable_gpu_sections=true pytorch/tritonbench:run -- --op fp8_gemm_rowwise --no_use_tma --use_persistent --no_use_persistent
IT WILL THROW ERROR: Cannot specify both '--use_persistent' and '--no_use_persistent' at the same time. These options are mutually exclusive. Please use only one.

Reviewed By: jwfromm

Differential Revision: D86579911
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant