Skip to content

Implement AtomicFAddEXT for the CUDA BE #2853

Closed
@AGindinson

Description

@AGindinson

After 4fdbfae, there are preparations to switch atomic fetch_add/fetch_sub FP implementations to using the new SPIR-V operand. Providing a "native" implementation in the CUDA BE would enable us to use the leveraged function for NVPTX targets as well (#if !defined(__NVPTX__) macros would have to be removed to achieve this).

Metadata

Metadata

Assignees

Labels

cudaCUDA back-endenhancementNew feature or requestperformancePerformance related issues

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions