Skip to content

[SYCL][CUDA][DOC] GettingStartedGuide.md to recommend cuda 11.6 #5917

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Mar 30, 2022
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
26 changes: 16 additions & 10 deletions sycl/doc/GetStartedGuide.md
Original file line number Diff line number Diff line change
Expand Up @@ -168,25 +168,31 @@ There is experimental support for DPC++ for CUDA devices.

To enable support for CUDA devices, follow the instructions for the Linux or
Windows DPC++ toolchain, but add the `--cuda` flag to `configure.py`. Note,
the CUDA backend has experimental Windows support, windows subsystem for
the CUDA backend has Windows support; windows subsystem for
linux (WSL) is not needed to build and run the CUDA backend.

Enabling this flag requires an installation of
Enabling this flag requires an installation of at least
[CUDA 10.2](https://developer.nvidia.com/cuda-10.2-download-archive) on
the system, refer to
[NVIDIA CUDA Installation Guide for Linux](https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html)
or
[NVIDIA CUDA Installation Guide for Windows](https://docs.nvidia.com/cuda/cuda-installation-guide-microsoft-windows/index.html)

**_NOTE:_** An installation of at least
[CUDA 11.6](https://developer.nvidia.com/cuda-downloads) is recommended because
there is a known issue with some math builtins when using -O1/O2/O3
Optimization options for CUDA toolkits prior to 11.6 (This is due to a bug in
earlier versions of the CUDA toolkit: see
[this issue](https://forums.developer.nvidia.com/t/libdevice-functions-causing-ptxas-segfault/193352)).

An installation of at least
[CUDA 11.0](https://developer.nvidia.com/cuda-11.0-download-archive)
is required for fully utilize Turing (SM 75) devices.

Currently, the only combination tested is Ubuntu 18.04 with CUDA 10.2 using
a Titan RTX GPU (SM 71). The CUDA backend should work on Windows or Linux
operating systems with any GPU compatible with SM 50 or above. The default
SM for the NVIDIA CUDA backend is 5.0. Users can specify lower values,
but some features may not be supported. Windows CUDA support is experimental
as it is not currently tested on the CI.
is required to fully utilize Turing (SM 75) devices and to enable Ampere (SM 80)
core features.

The CUDA backend should work on Windows or Linux operating systems with any
GPU compatible with SM 50 or above. The default SM for the NVIDIA CUDA backend
is 5.0. Users can specify lower values, but some features may not be supported.

**Non-standard CUDA location**

Expand Down