[SYCL][LIBCLC] Add sqrt for doubles for amdgcn-amdhsa #4223

npmiller · 2021-07-30T14:52:09Z

No description provided.

bader

Does it make sense to add sqrt for fp16 as well?

npmiller · 2021-07-30T15:35:05Z

Does it make sense to add sqrt for fp16 as well?

Good point, I'll add it as well, it's just that I ran into an application using the double variant.

bader · 2021-07-30T15:43:54Z

Does it make sense to add sqrt for fp16 as well?

Good point, I'll add it as well, it's just that I ran into an application using the double variant.

Considering that typical built-in implementation for amdgcn-amdhsa target is a simple wrapper around a compiler built-in, adding implementation for all the types seems like a good rule to follow.

npmiller · 2021-07-30T16:28:50Z

Does it make sense to add sqrt for fp16 as well?

Good point, I'll add it as well, it's just that I ran into an application using the double variant.

Considering that typical built-in implementation for amdgcn-amdhsa target is a simple wrapper around a compiler built-in, adding implementation for all the types seems like a good rule to follow.

So I was testing this a bit further and the current change actually breaks the build, adding fp16 support seems a bit more involved than I thought.

This is because the default build, builds for the tahiti architecture, I reckon to have the lowest common denominator so libclc works on as many GPUs as possible, but that version of the ISA doesn't support fp16, so we'd need to update this version or add a way to change it at build time. In addition it seems that cl_khr_fp16 is defined anyway so we can't really use that right now in the code to skip the half variant for tahiti.

So we should probably leave out the half variant for now until we can setup half support for AMD in libclc properly.

npmiller · 2021-07-30T16:33:23Z

I just forced pushed to remove the commit adding the fp16 variant as it doesn't work, other commit is untouched, see previous comment for reasoning.

[SYCL][LIBCLC] Add sqrt for doubles for amdgcn-amdhsa

672fb14

npmiller requested a review from bader as a code owner July 30, 2021 14:52

bader previously approved these changes Jul 30, 2021

View reviewed changes

bader added the libclc libclc project related issues label Jul 30, 2021

npmiller dismissed bader’s stale review via 85e412a July 30, 2021 15:41

bader previously approved these changes Jul 30, 2021

View reviewed changes

npmiller dismissed bader’s stale review via 672fb14 July 30, 2021 16:32

npmiller force-pushed the rocm-sqrt-double branch from 85e412a to 672fb14 Compare July 30, 2021 16:32

bader approved these changes Jul 30, 2021

View reviewed changes

bader merged commit 2af5e6c into intel:sycl Aug 1, 2021

zahiraam pushed a commit to zahiraam/llvm-1 that referenced this pull request Aug 2, 2021

[SYCL][LIBCLC] Add sqrt for doubles for amdgcn-amdhsa (intel#4223)

c1b1a74

npmiller mentioned this pull request Sep 29, 2021

[SYCL] Enable shuffle tests on HIP AMD. intel/llvm-test-suite#487

Merged

npmiller mentioned this pull request Feb 15, 2023

[SYCL][HIP] Required aspect fp16 is not supported on the device #8330

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][LIBCLC] Add sqrt for doubles for amdgcn-amdhsa #4223

[SYCL][LIBCLC] Add sqrt for doubles for amdgcn-amdhsa #4223

Uh oh!

npmiller commented Jul 30, 2021

Uh oh!

bader left a comment

Uh oh!

npmiller commented Jul 30, 2021

Uh oh!

bader commented Jul 30, 2021

Uh oh!

npmiller commented Jul 30, 2021

Uh oh!

npmiller commented Jul 30, 2021

Uh oh!

Uh oh!

[SYCL][LIBCLC] Add sqrt for doubles for amdgcn-amdhsa #4223

[SYCL][LIBCLC] Add sqrt for doubles for amdgcn-amdhsa #4223

Uh oh!

Conversation

npmiller commented Jul 30, 2021

Uh oh!

bader left a comment

Choose a reason for hiding this comment

Uh oh!

npmiller commented Jul 30, 2021

Uh oh!

bader commented Jul 30, 2021

Uh oh!

npmiller commented Jul 30, 2021

Uh oh!

npmiller commented Jul 30, 2021

Uh oh!

Uh oh!