Closed
Description
After 4fdbfae, there are preparations to switch atomic fetch_add
/fetch_sub
FP implementations to using the new SPIR-V operand. Providing a "native" implementation in the CUDA BE would enable us to use the leveraged function for NVPTX targets as well (#if !defined(__NVPTX__)
macros would have to be removed to achieve this).