[SYCL][CUDA] Handle large Y/Z range dimensions. #7968

mmoadeli · 2023-01-10T01:16:55Z

The dimensions passed to sycl::range, determine the blocks per grid and threads per blocks. Currently, calculation of thread per blocks only performed for the x dimension. This means the blocks per grid for y and z dimensions passed to cuLaunchKernel, directly come from the sycl::range arguments. This can result in an error returned on calling cuLaunchKernel, when those parameters for y and z dimensions are larger than 65535.
This PR offers a simple tuning of thread per block for larger (over 65535) values of Y and Z dimensions to make the associated blocks per grid within the allowed range.

…cks per grid limits.

bader · 2023-01-10T01:29:37Z

Please, add [SYCL] title tag +

To a reasonable extent, title tags can be used to signify the component changed, e.g.: [PI], [CUDA], [Doc].

https://github.com/intel/llvm/blob/sycl/CONTRIBUTING.md#pull-request

I also recommend linking this PR to the issue #7854 to the issue automatically when PR is merged. https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue

mmoadeli · 2023-01-10T09:19:02Z

/verify with intel/llvm-test-suite#1500

sycl/plugins/cuda/pi_cuda.cpp

… size. - Improve variable namings

steffenlarsen

Looks good! 🚀

steffenlarsen · 2023-01-11T17:38:54Z

/verify with #7854

@mmoadeli - Was that the intended link?

mmoadeli · 2023-01-12T09:03:48Z

/verify with #7854

@mmoadeli - Was that the intended link?

@steffenlarsen, thanks, it's updated.

Provides a test for intel/llvm#7968

…te#1500) Provides a test for intel#7968

Handle the case where Y and/or Z range dimensions are larger than blo…

a0a6f03

…cks per grid limits.

mmoadeli requested a review from a team as a code owner January 10, 2023 01:16

mmoadeli requested a review from steffenlarsen January 10, 2023 01:16

This was referenced Jan 10, 2023

[SYCL][CUDA] parallel_for with sycl::range fails with limitations to 65535 #7854

Closed

Test the case where range is over allowed limit. intel/llvm-test-suite#1500

Merged

mmoadeli temporarily deployed to aws January 10, 2023 02:52 — with GitHub Actions Inactive

mmoadeli temporarily deployed to aws January 10, 2023 05:29 — with GitHub Actions Inactive

mmoadeli changed the title ~~Handle large Y/Z range dimensions.~~ [SYCL][CUDA] Handle large Y/Z range dimensions. Jan 10, 2023

mmoadeli linked an issue Jan 10, 2023 that may be closed by this pull request

[SYCL][CUDA] parallel_for with sycl::range fails with limitations to 65535 #7854

Closed

steffenlarsen reviewed Jan 10, 2023

View reviewed changes

sycl/plugins/cuda/pi_cuda.cpp Outdated Show resolved Hide resolved

sycl/plugins/cuda/pi_cuda.cpp Outdated Show resolved Hide resolved

sycl/plugins/cuda/pi_cuda.cpp Outdated Show resolved Hide resolved

- Fix the case when range dimension is equal to dimension's max block…

423c633

… size. - Improve variable namings

steffenlarsen approved these changes Jan 10, 2023

View reviewed changes

mmoadeli temporarily deployed to aws January 10, 2023 15:29 — with GitHub Actions Inactive

mmoadeli temporarily deployed to aws January 11, 2023 02:31 — with GitHub Actions Inactive

bader merged commit afeb8a6 into intel:sycl Jan 11, 2023

mmoadeli deleted the large-grid-yz-dim branch January 12, 2023 08:40

bader pushed a commit to intel/llvm-test-suite that referenced this pull request Jan 25, 2023

Test the case where range is over allowed limit. (#1500)

767d532

Provides a test for intel/llvm#7968

aelovikov-intel pushed a commit to aelovikov-intel/llvm that referenced this pull request Mar 27, 2023

Test the case where range is over allowed limit. (intel/llvm-test-sui…

05c6d19

…te#1500) Provides a test for intel#7968

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][CUDA] Handle large Y/Z range dimensions. #7968

[SYCL][CUDA] Handle large Y/Z range dimensions. #7968

Uh oh!

mmoadeli commented Jan 10, 2023

Uh oh!

bader commented Jan 10, 2023

Uh oh!

mmoadeli commented Jan 10, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

steffenlarsen left a comment

Uh oh!

steffenlarsen commented Jan 11, 2023

Uh oh!

mmoadeli commented Jan 12, 2023

Uh oh!

Uh oh!

[SYCL][CUDA] Handle large Y/Z range dimensions. #7968

[SYCL][CUDA] Handle large Y/Z range dimensions. #7968

Uh oh!

Conversation

mmoadeli commented Jan 10, 2023

Uh oh!

bader commented Jan 10, 2023

Uh oh!

mmoadeli commented Jan 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

steffenlarsen left a comment

Choose a reason for hiding this comment

Uh oh!

steffenlarsen commented Jan 11, 2023

Uh oh!

mmoadeli commented Jan 12, 2023

Uh oh!

Uh oh!

mmoadeli commented Jan 10, 2023 •

edited

Loading