[SYCL] Specialize atomic fetch_add for floating point types #2765

AGindinson · 2020-11-12T07:07:19Z

The new EXT/SPV_EXT_shader_atomic_float_add SPIR-V extension
allows us to further specialize atomic::fetch_add() for
floating point types. In device mode, we'll now be creating
an external call to a built-in-like __spirv_AtomicFAddEXT().
This is similar to what is done for other atomic binary
instructions, e.g. the integer specialization of fetch_add()
being mapped onto __spirv_AtomicIAdd().

Furthermore, atomic::fetch_sub() is also re-implemented
to use __spirv_AtomicFAddEXT(), the added operand being
a negation of the original one.

The new implementation can be exposed if a dedicated macro is
defined: SYCL_USE_NATIVE_FP_ATOMICS. Otherwise, a fallback
is used, where the atomic operation is done via spinlock emulation.
At the moment of committing this, only Intel GPUs support the
"native" implementation, which relies on a SPIR-V extension.

Tests for the feature have been finalized in
intel/llvm-test-suite#104.

Signed-off-by: Artem Gindinson artem.gindinson@intel.com

sycl/include/CL/sycl/ONEAPI/atomic_ref.hpp

Pennycook

The changes look good to me.

One quick question to make sure I understand: am I right in thinking that we now default to the native FAdd implementation for all our devices? So the old compare-exchange implementation will now only be tested for CUDA devices?

@bader We should add a TODO somewhere to implement AtomicFAddEXT in the CUDA backend.

bader · 2020-11-30T18:14:35Z

We should add a TODO somewhere to implement AtomicFAddEXT in the CUDA backend.

I suggest adding a new issue with "cuda" tag - https://github.com/intel/llvm/issues.

AGindinson · 2020-11-30T18:23:30Z

One quick question to make sure I understand: am I right in thinking that we now default to the native FAdd implementation for all our devices? So the old compare-exchange implementation will now only be tested for CUDA devices?

Yes, this is aimed for all device BEs. I'll create an issue for CUDA once the extension support is pulled down into this repository.

AGindinson · 2020-12-08T20:59:02Z

@Pennycook, could you please check if __SYCL_EMULATE_NATIVE_ATOMICS__ is a fine name for the macro (as introduced by d8712b5)?

Also, should the IR checks in test/atomic_ref/ be updated to include the emulated checks under the macro defined, or would it be sufficient to update github.com/intel/llvm-test-suite to use the macro for non-supporting targets and make sure all tests pass?

AGindinson · 2020-12-08T20:59:18Z

/summary:run

sycl/test/atomic_ref/sub.cpp

Pennycook · 2020-12-09T14:46:02Z

@Pennycook, could you please check if __SYCL_EMULATE_NATIVE_ATOMICS__ is a fine name for the macro (as introduced by d8712b5)?

That commit says __SYCL_EMULATE_FLOAT_ATOMICS__ rather than __SYCL_EMULATE_NATIVE_ATOMICS__. But I think __SYCL_EMULATE_FLOAT_ATOMICS__ is a good name.

Also, should the IR checks in test/atomic_ref/ be updated to include the emulated checks under the macro defined, or would it be sufficient to update github.com/intel/llvm-test-suite to use the macro for non-supporting targets and make sure all tests pass?

Good question. If we're documenting __SYCL_EMULATE_FLOAT_ATOMICS__ then I think we need to update the tests -- a device that implements AtomicFAdd would need to run correctly with and without the macro enabled, and so we should test that to ensure we don't accidentally break things. If we're not documenting __SYCL_EMULATE_FLOAT_ATOMICS__, and it's only a temporary implementation detail, I think it's sufficient to ensure that the existing tests pass when the macro is set to the expected value.

AGindinson · 2020-12-09T15:07:58Z

That commit says SYCL_EMULATE_FLOAT_ATOMICS rather than SYCL_EMULATE_NATIVE_ATOMICS. But I think SYCL_EMULATE_FLOAT_ATOMICS is a good name.

That was an unfortunate typo on my side, sorry for that.

A device that implements AtomicFAdd would need to run correctly with and without the macro enabled.

Fair enough. And yet I tend to agree that:

it's only a temporary implementation detail

My personal approach would be to rely on the E2E tests passing with correct macro presence/absence. Should there be a lack of consensus among the reviewers, though, I'm prepared to lay this back and make all testing stages as detailed as possible. How would you recommend treating this? @AlexeySotkin, @AlexeySachkov, could you comment as well, please?

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

New macro name is __SYCL_USE_NATIVE_FP_ATOMICS__ Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

…here yet Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Make the user-exposed macro consistent with naming standards for C++ libraries, ours in particular. https://eel.is/c++draft/lex.name Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

AGindinson · 2021-01-21T12:58:58Z

Updates:

Switched to using the emulated implementation as default.
Renamed the macro correspondingly - it's SYCL_USE_NATIVE_FP_ATOMICS from now on. This is reflected in the test-suite PR: Address the atomics' macro rework: emulation runs by default llvm-test-suite#104
Removed the changes from atomic.hpp, as this would've regressed the common atomic class for other targets. atomic_ref remains as the go-to class for native atomics.

romanovvlad

LGTM

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Minor implementation details aside, this is a follow-up to intel#2765. The end-to-end tests are already done, the latest update being intel/llvm-test-suite#118. Signed-off-by: Artem Gindinson <amgindinson@gmail.com>

Minor implementation details aside, this is a follow-up to #2765. The end-to-end tests are already done, the latest update being intel/llvm-test-suite#118. Signed-off-by: Artem Gindinson <amgindinson@gmail.com>

Original commit: KhronosGroup/SPIRV-LLVM-Translator@e8fce056867bb1b

AGindinson force-pushed the private/agindins/atomic-fadd-draft branch 3 times, most recently from 9668b1b to d141d43 Compare November 12, 2020 11:11

AlexeySotkin self-requested a review November 12, 2020 12:08

AlexeySotkin reviewed Nov 26, 2020

View reviewed changes

sycl/include/CL/sycl/ONEAPI/atomic_ref.hpp Show resolved Hide resolved

AlexeySachkov requested a review from Pennycook November 26, 2020 18:49

AGindinson force-pushed the private/agindins/atomic-fadd-draft branch 3 times, most recently from c7961b1 to 7579487 Compare November 30, 2020 12:57

AGindinson mentioned this pull request Nov 30, 2020

[SYCL][Test] Add IR checks into atomic tests #2834

Merged

Pennycook previously approved these changes Nov 30, 2020

View reviewed changes

AGindinson dismissed Pennycook’s stale review via 8fd82f0 December 2, 2020 12:41

AGindinson force-pushed the private/agindins/atomic-fadd-draft branch from 7579487 to 8fd82f0 Compare December 2, 2020 12:41

AGindinson changed the title ~~[DRAFT][SYCL] Support atomic float add in headers & SPIR-V~~ [SYCL] Specialize atomic fetch_add for floating point types Dec 2, 2020

AGindinson mentioned this pull request Dec 2, 2020

Implement AtomicFAddEXT for the CUDA BE #2853

Closed

AGindinson force-pushed the private/agindins/atomic-fadd-draft branch from 02e8042 to d8712b5 Compare December 8, 2020 20:49

AGindinson force-pushed the private/agindins/atomic-fadd-draft branch from 56da357 to fc3ffef Compare December 9, 2020 09:35

AlexeySotkin reviewed Dec 9, 2020

View reviewed changes

sycl/test/atomic_ref/sub.cpp Show resolved Hide resolved

AGindinson mentioned this pull request Dec 30, 2020

Configure usage of native atomic float add/sub functions intel/llvm-test-suite#86

Merged

AGindinson marked this pull request as ready for review January 13, 2021 13:50

AGindinson requested a review from a team as a code owner January 13, 2021 13:50

AGindinson requested a review from v-klochkov January 13, 2021 13:50

AGindinson force-pushed the private/agindins/atomic-fadd-draft branch from fc3ffef to f7a585a Compare January 13, 2021 14:32

AGindinson requested a review from Pennycook January 13, 2021 14:40

Artem Gindinson added 11 commits January 21, 2021 15:54

Provide a general macro for float atomics' emulation

28a3a1b

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

[SYCL][Test] Configure atomic float tests for CUDA/CPU

d6a5208

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Address review comments: fetch_sub in atomic.hpp, RUN lines' cleanup

3beb65a

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Fix the negation subject in the fetch_sub implementation

a9d8d88

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Remove the "emulated" atomics usage for the HOST tests

1e2ff47

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Mark the tests unsupported on CUDA

73b5b5d

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Edit the UNSUPPORTED placement & content

da6edc5

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Update tests to check the IR for spinlock emulation behavior

212aa4d

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Switch to using the emulation by default

0b178bc

New macro name is __SYCL_USE_NATIVE_FP_ATOMICS__ Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Revert changes in atomic.hpp - the new implementation can't be used t…

342a179

…here yet Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

Remove double underscore from the macro name

be71125

Make the user-exposed macro consistent with naming standards for C++ libraries, ours in particular. https://eel.is/c++draft/lex.name Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

AGindinson force-pushed the private/agindins/atomic-fadd-draft branch from e484016 to be71125 Compare January 21, 2021 12:55

romanovvlad previously approved these changes Jan 21, 2021

View reviewed changes

Switch to XFAIL for CUDA

906c413

Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>

AGindinson dismissed romanovvlad’s stale review via 906c413 January 21, 2021 13:18

AGindinson requested a review from romanovvlad January 21, 2021 13:19

AlexeySachkov approved these changes Jan 21, 2021

View reviewed changes

AlexeySotkin approved these changes Jan 21, 2021

View reviewed changes

romanovvlad approved these changes Jan 21, 2021

View reviewed changes

Pennycook approved these changes Jan 21, 2021

View reviewed changes

bader merged commit 37a9a2a into intel:sycl Jan 21, 2021

This was referenced Jan 22, 2021

[SYCL] Update kernel name diagnostics logic #3069

Closed

[SYCL] Remove XFAIL for passing tests #3076

Merged

diptorupd mentioned this pull request Feb 3, 2021

Generating native atomic instructions in Numba IntelPython/numba-dpex#236

Closed

AGindinson mentioned this pull request Mar 3, 2021

[SYCL] Specialize atomic fetch_min/fetch_max for FP types #3297

Merged

AGindinson deleted the private/agindins/atomic-fadd-draft branch September 22, 2021 08:52

jsji pushed a commit that referenced this pull request Nov 7, 2024

Clarify optionality of spirv-val in test suite (#2765)

38c9811

Original commit: KhronosGroup/SPIRV-LLVM-Translator@e8fce056867bb1b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL] Specialize atomic fetch_add for floating point types #2765

[SYCL] Specialize atomic fetch_add for floating point types #2765

Uh oh!

AGindinson commented Nov 12, 2020 •

edited

Loading

Uh oh!

Uh oh!

Pennycook left a comment

Uh oh!

bader commented Nov 30, 2020

Uh oh!

AGindinson commented Nov 30, 2020

Uh oh!

AGindinson commented Dec 8, 2020

Uh oh!

AGindinson commented Dec 8, 2020

Uh oh!

Uh oh!

Pennycook commented Dec 9, 2020

Uh oh!

AGindinson commented Dec 9, 2020

Uh oh!

AGindinson commented Jan 21, 2021

Uh oh!

romanovvlad left a comment

Uh oh!

Uh oh!

[SYCL] Specialize atomic fetch_add for floating point types #2765

[SYCL] Specialize atomic fetch_add for floating point types #2765

Uh oh!

Conversation

AGindinson commented Nov 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Pennycook left a comment

Choose a reason for hiding this comment

Uh oh!

bader commented Nov 30, 2020

Uh oh!

AGindinson commented Nov 30, 2020

Uh oh!

AGindinson commented Dec 8, 2020

Uh oh!

AGindinson commented Dec 8, 2020

Uh oh!

Uh oh!

Pennycook commented Dec 9, 2020

Uh oh!

AGindinson commented Dec 9, 2020

Uh oh!

AGindinson commented Jan 21, 2021

Uh oh!

romanovvlad left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AGindinson commented Nov 12, 2020 •

edited

Loading