Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compilation errors due to incorrect GPU architecture #2610

Open
profitgrowinginnovator opened this issue Nov 11, 2024 · 0 comments
Open

Compilation errors due to incorrect GPU architecture #2610

profitgrowinginnovator opened this issue Nov 11, 2024 · 0 comments

Comments

@profitgrowinginnovator
Copy link

I got these errors:
src/quantized.cu(1962): error: identifier "__dp4a" is undefined
sumi = __dp4a(v[i], u[i], sumi);
^

18 errors detected in the compilation of "src/quantized.cu".
thread 'main' panicked at /home/remote/.cargo/registry/src/index.crates.io-6f17d22bba15001f/bindgen_cuda-0.1.5/src/lib.rs:391:13:
nvcc error while compiling "src/quantized.cu":

CLI "nvcc" "--gpu-architecture=sm_60" "--ptx" "--default-stream" "per-thread" "--output-directory" "/projects/ai/candle/candle/candle-kernels/target/debug/build/candle-kernels-2b82da83d6e51215/out" "-Isrc" "-I/usr/include" "src/quantized.cu"

They were solved after some ChatGPT and putting this:

export CUDA_COMPUTE_CAP=61

However it would be better to solve this in a more generic way.

Just as an FYI. This happens with Ubuntu 24.04, CUDA 12.6 and 3 Tesla P100.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant