-
Notifications
You must be signed in to change notification settings - Fork 769
[SYCL] Enable native FP atomics by default #3869
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
5bdbadf
to
5cf5477
Compare
Since Intel GPU and CPU support the target, pass the required macro by default during device FE compilation for SPIR non-FPGA targets. This alleviates the users of the native FP atomic functionality from passing the macro explicitly. Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>
5cf5477
to
08a21dc
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sycl/test
LGTM. Can you please fix "Jenkins/Precommit"?
@dm-vodopyanov Done; an uplift of CPU RT version had been the only thing required. @mdtoguchi, could you please review now that CI is fine? |
Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>
I have no idea if the Jenkins failures are related to this patch or not. There's basically no useful information in this failure that I can find:
|
Thanks for highlighting - these probably shouldn't be related, but I'll make sure to check by restarting the CI via a stylistic fix. |
Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
/summary:run |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sycl/test
LGTM
@bader, could you please merge the PR once convenient? Thanks! |
Also includes a relevant stylistic fix suggested in intel#3869 (comment). Signed-off-by: Artem Gindinson <artem.gindinson@intel.com>
* upstream/sycl: (489 commits) [SYCL][NFC] Lower overhead of making plugin calls (intel#3982) [SYCL][NFC] Use default macro initialization where applicable (intel#3979) [SYCL] Enable SPV_INTEL_fpga_invocation_pipelining_attributes extension (intel#3864) [SYCL] Disable reassociate pass to reduce register pressure (intel#3615) [Driver][SYCL][FPGA] Restrict -O0 for FPGA with hardware (intel#3966) [SYCL][NFC] Fix warnings coming out of SYCL headers. (intel#3978) [SYCL] Fix bugs with recursion in SYCL kernel (intel#3958) [SYCL][LevelZero] Add support to detect host->device and device->host transfers for USM (intel#3975) [SYCL] Enable native FP atomics by default (intel#3869) [sycl-post-link] Avoid copying from nullptr (intel#3963) [SYCL-PTX] Add warp-reduce path in sub-group reduce (intel#3949) [BuildBot] Uplift CPU/FPGAEMU RT version for CI Process (intel#3946) Fix handling of complex constant expressions Handle OpSpecConstantOp with CompositeExtract and CompositeInsert Handle OpSpecConstantOp with VectorShuffle [FuncSpec] Don't specialise functions with NoDuplicate instructions. [mlir][linalg] Support low padding in subtensor(pad_tensor) lowering [gn build] Port 208332d [AMDGPU] Add Optimize VGPR LiveRange Pass. [mlir][Linalg] NFC - Drop unused variable definition. ...
Since Intel GPU and CPU support the target, pass the required
macro by default during device FE compilation for SPIR non-FPGA
targets. This alleviates the users of the native FP atomic
functionality from passing the macro explicitly.
Tested in intel/llvm-test-suite#308.