different macros are defined during AOT compilation for CUDA targets #15545

AuroraPerego · 2024-09-27T23:32:27Z

Describe the bug

When compiling AOT for a specific target the corresponding macro is set to 1, while the macros for all the other targets are set to 0. However, for the CUDA backend, the macro that are set to 0 by the compiler end with *_SM**__, while those that correspond to the target we are compiling for end with *_SM_**__.
As an example, when compiling for NVIDIA Pascal architecture the macro defined are:

...
#define __SYCL_TARGET_NVIDIA_GPU_SM50__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM52__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM53__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM60__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM61__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM62__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM70__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM72__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM75__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM80__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM86__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM87__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM89__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM90__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM_60__ 1
...

To reproduce

Include code snippet as short as possible

`test.cpp`

#include <sycl/sycl.hpp>
int main()
{
    return 0;
}

Specify the command which should be used to compile the program

icpx -fsycl -fsycl-targets=nvidia_gpu_sm_60 -dM -E test.cpp | grep "60__"

Indicate what is wrong and what was expected
The output is:

#define __SYCL_TARGET_NVIDIA_GPU_SM60__ 0
#define __SYCL_TARGET_NVIDIA_GPU_SM_60__ 1

while only one of the two should exist.

Environment

OS: RHEL 8.10
Target device and vendor: Nvidia GPUs

icpx version:

Intel(R) oneAPI DPC++/C++ Compiler 2024.2.1 (2024.2.1.20240711)
Target: x86_64-unknown-linux-gnu
Thread model: posix
InstalledDir: /opt/intel/oneapi/compiler/2024.2/bin/compiler
Configuration file: /opt/intel/oneapi/compiler/2024.2/bin/compiler/../icpx.cfg

Dependencies version:

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2024 NVIDIA Corporation
Built on Thu_Mar_28_02:18:24_PDT_2024
Cuda compilation tools, release 12.4, V12.4.131
Build cuda_12.4.r12.4/compiler.34097967_0

Additional context

The problem may be related to the definitions in the file /opt/intel/oneapi/compiler/latest/include/sycl/ext/oneapi/experimental/device_architecture.hpp.

The text was updated successfully, but these errors were encountered:

AuroraPerego · 2024-09-27T23:33:31Z

FYI @fwyzard @ivorobts

GeorgeWeb · 2024-10-04T16:25:30Z

Thanks for the issue. Nice catch!

AuroraPerego added bug Something isn't working cuda CUDA back-end labels Sep 27, 2024

GeorgeWeb self-assigned this Oct 4, 2024

GeorgeWeb mentioned this issue Oct 6, 2024

[SYCL] Fix TARGET_NVIDIA_GPU macro defines for sycl_ext_oneapi_device_architecture #15610

Closed

jchlanda mentioned this issue Oct 7, 2024

[SYCL] Correctly spell out SM version macro when AOT compiling #15615

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

different macros are defined during AOT compilation for CUDA targets #15545

different macros are defined during AOT compilation for CUDA targets #15545

AuroraPerego commented Sep 27, 2024

AuroraPerego commented Sep 27, 2024

GeorgeWeb commented Oct 4, 2024

different macros are defined during AOT compilation for CUDA targets #15545

different macros are defined during AOT compilation for CUDA targets #15545

Comments

AuroraPerego commented Sep 27, 2024

Describe the bug

To reproduce

test.cpp

Environment

Additional context

AuroraPerego commented Sep 27, 2024

GeorgeWeb commented Oct 4, 2024

`test.cpp`