-
Notifications
You must be signed in to change notification settings - Fork 798
Closed
Labels
cudaCUDA back-endCUDA back-endenhancementNew feature or requestNew feature or requestperformancePerformance related issuesPerformance related issues
Description
After 4fdbfae, there are preparations to switch atomic fetch_add
/fetch_sub
FP implementations to using the new SPIR-V operand. Providing a "native" implementation in the CUDA BE would enable us to use the leveraged function for NVPTX targets as well (#if !defined(__NVPTX__)
macros would have to be removed to achieve this).
Metadata
Metadata
Assignees
Labels
cudaCUDA back-endCUDA back-endenhancementNew feature or requestNew feature or requestperformancePerformance related issuesPerformance related issues