Add support for hipstdpar #1

gsitaram · 2024-04-29T21:10:32Z

This PR adds support for offload to AMD GPUs using the par_unseq execution policy in C++ standard parallelism algorithms. To trigger the GPU offload of all parallel algorithms, the --hipstdpar compilation flag must be provided. For GPU targets other than the current default of gfx906, the --offload-arch=<arch_string> option must also be provided at compile time.

When using ROCm 6.1.0, the compilation commands may look like the following if compiling for an AMD Instinct MI200 series GPU, for instance:

cmake -Bbuild -H. -DMODEL=std-data -DCMAKE_CXX_COMPILER=hipcc -DCLANG_OFFLOAD=gfx90a
cmake --build build

Please let me know if you have any questions.

src/std-data/model.cmake

src/std-indices/model.cmake

Add support for hipstdpar

9d4cc72

gsitaram requested a review from afanfa April 29, 2024 21:12

afanfa reviewed Apr 30, 2024

View reviewed changes

src/std-data/model.cmake Outdated Show resolved Hide resolved

src/std-indices/model.cmake Outdated Show resolved Hide resolved

Remove --hipstdpar-path as it is not needed with ROCm 6.1 onwards

9f67c5f

afanfa merged commit 6bd658c into main Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for hipstdpar #1

Add support for hipstdpar #1

Uh oh!

gsitaram commented Apr 29, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add support for hipstdpar #1

Add support for hipstdpar #1

Uh oh!

Conversation

gsitaram commented Apr 29, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants