Closed
Description
Summary
Build fails with CUBLAS backend and DPC++ with the following errors. However the build is just fine with CURAND backend. Any pointers how to navigate this.
cuda_fp16.hpp:690:1: error: unknown type name 'CUDA_FP16_DECL'
Debug info from CMake output: (cmake/FindcuBLAS.cmake)
-- Found CUDA: /soft/compilers/cudatoolkit/cuda-12.0.0 (found suitable version "12.0", minimum required is "10.0")
-- Found cuBLAS: /soft/compilers/cudatoolkit/cuda-12.0.0/include
-- 2. CUDA_TOOLKIT_INCLUDE : /soft/compilers/cudatoolkit/cuda-12.0.0/include
-- 2. CUDA_cublas_LIBRARY : /soft/compilers/cudatoolkit/cuda-12.0.0/lib64/libcublas.so
-- 2. CUDA_LIBRARIES : /soft/compilers/cudatoolkit/cuda-12.0.0/lib64/libcudart_static.a;Threads::Threads;dl;/usr/lib64/librt.so
-- 2. CUDA_CUDART_LIBRARY : /soft/compilers/cudatoolkit/cuda-12.0.0/lib64/libcudart.so
-- 2. CUDA_CUDA_LIBRARY : /soft/compilers/cudatoolkit/cuda-12.0.0/lib64/stubs/libcuda.so
Version
oneMKL: d217915
llvm/dpcpp: b2d6fdfd63fe8dd7be98e113cde9bc7c3d9a21d8
CUDA: 12.0
Build instructions
CUDA_LIB_PATH=/soft/compilers/cudatoolkit/cuda-12.0.0/lib64/stubs cmake .. -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ -DENABLE_CUBLAS_BACKEND=True -DENABLE_CURAND_BACKEND=True -DENABLE_ROCBLAS_BACKEND=False -DENABLE_MKLCPU_BACKEND=False -DENABLE_MKLGPU_BACKEND=False -DBUILD_FUNCTIONAL_TESTS=False -DBUILD_EXAMPLES=False
Error
[ 26%] Building CXX object bin/blas/backends/cublas/CMakeFiles/onemkl_blas_cublas_obj.dir/cublas_level1.cpp.o
cd /lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/build/bin/blas/backends/cublas && /lus/eagle/projects/UINTAH_aesp/abagusetty/llvm_sycl/build_PrngEnvGnu_cuda1200_12212022/install/bin/clang++ -DCUDA_NO_HALF -I/lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/include -I/lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/src/include -I/lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/src -isystem /lus/eagle/projects/UINTAH_aesp/abagusetty/llvm_sycl/build_PrngEnvGnu_cuda1200_12212022/install/include/sycl -isystem /soft/compilers/cudatoolkit/cuda-12.0.0/include -DSYCL2020_DISABLE_DEPRECATION_WARNINGS -O3 -DNDEBUG -fPIC -fsycl -fsycl-targets=nvptx64-nvidia-cuda -fsycl-unnamed-lambda -MD -MT bin/blas/backends/cublas/CMakeFiles/onemkl_blas_cublas_obj.dir/cublas_level1.cpp.o -MF CMakeFiles/onemkl_blas_cublas_obj.dir/cublas_level1.cpp.o.d -o CMakeFiles/onemkl_blas_cublas_obj.dir/cublas_level1.cpp.o -c /lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/src/blas/backends/cublas/cublas_level1.cpp
clang-16: warning: CUDA version is newer than the latest partially supported version 11.8 [-Wunknown-cuda-version]
In file included from /lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/src/blas/backends/cublas/cublas_level1.cpp:20:
In file included from /lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/src/blas/backends/cublas/cublas_task.hpp:13:
In file included from /lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/src/blas/backends/cublas/cublas_scope_handle.hpp:27:
/lus/eagle/projects/UINTAH_aesp/abagusetty/llvm_sycl/build_PrngEnvGnu_cuda1200_12212022/install/bin/../include/sycl/backend/cuda.hpp:17:2: warning: sycl/backend/cuda.hpp is deprecated and not required anymore [-W#warnings]
#warning sycl/backend/cuda.hpp is deprecated and not required anymore
^
1 warning generated.
In file included from /lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/src/blas/backends/cublas/cublas_level1.cpp:19:
In file included from /lus/eagle/projects/UINTAH_aesp/abagusetty/oneMKL/src/blas/backends/cublas/cublas_helper.hpp:31:
In file included from /soft/compilers/cudatoolkit/cuda-12.0.0/include/cublas_v2.h:69:
In file included from /soft/compilers/cudatoolkit/cuda-12.0.0/include/cublas_api.h:77:
In file included from /soft/compilers/cudatoolkit/cuda-12.0.0/include/cuda_fp16.h:4006:
/soft/compilers/cudatoolkit/cuda-12.0.0/include/cuda_fp16.hpp:690:1: error: unknown type name '__CUDA_FP16_DECL__'
__CUDA_FP16_DECL__ __half2 __internal_device_float2_to_half2_rn(const float a, const float b) {
^
/soft/compilers/cudatoolkit/cuda-12.0.0/include/cuda_fp16.hpp:690:27: error: expected ';' after top level declarator
__CUDA_FP16_DECL__ __half2 __internal_device_float2_to_half2_rn(const float a, const float b) {
^
Metadata
Metadata
Assignees
Labels
No labels