[AsyncAlloc][SYCL][CUDA][Exp] Initial device side implementation for the sycl_ext_oneapi_async_memory_alloc extension #16900

Seanst98 · 2025-02-06T10:27:54Z

Implement the sycl_ext_oneapi_async_memory_alloc extension for asynchronous memory allocation and freeing in CUDA, for device allocated pools only.

SYCL entrypoints which specify host or shared side pools, or pools created by pre-existing allocations will throw.

co-authored-by: Sean Stirling sean.stirling@codeplay.com
co-authored-by: Hugh Delaney hugh.delaney@codeplay.com

AerialMantis

Thanks for the changes, this LGTM.

npmiller · 2025-03-27T09:05:53Z

@intel/llvm-gatekeepers I believe this is ready to merge

uditagarwal97 · 2025-03-28T22:05:31Z

Hi @Seanst98 ,
This PR broke nightly: https://github.com/intel/llvm/actions/runs/14121213165/job/39561695855

/__w/llvm/llvm/src/unified-runtime/source/adapters/cuda/usm.cpp: In constructor 'ur_usm_pool_handle_t_::ur_usm_pool_handle_t_(ur_context_handle_t, ur_device_handle_t, ur_usm_pool_desc_t*)':
/__w/llvm/llvm/src/unified-runtime/source/adapters/cuda/usm.cpp:434:20: error: 'CUmemPoolProps' {aka 'struct CUmemPoolProps_st'} has no member named 'maxSize'
  434 |       MemPoolProps.maxSize =
      |                    ^~~~~~~

This could be due to difference in CUDA version in pre-commit vs. Nightly. The Ubuntu 22 build job in Nightly uses CUDA 12.1, while Ubuntu 24 build job uses CUDA 12.6.3.
Could you please look into this?

Seanst98 · 2025-03-31T07:48:00Z

Hi @Seanst98 , This PR broke nightly: https://github.com/intel/llvm/actions/runs/14121213165/job/39561695855

Thanks for bringing this to my attention, I've pushed a PR which addresses this: #17733

This patch fixes a couple static analysis issues with the recent [async alloc patch](#16900): * Use `std::move` for shared pointer in `CGAsyncAlloc` constructor. It is already passed by-value to the constructor so we can just move it when assigning it to the member. * Assert that the queue is available in `AsyncFree` * Catch any exceptions from the memory pool destructor * Initialize AsyncAlloc fields in handler * Add `[[maybe_unused]]` for parameter only used in assert

…the sycl_ext_oneapi_async_memory_alloc extension (#16900) Implement the [sycl_ext_oneapi_async_memory_alloc](#14800) extension for asynchronous memory allocation and freeing in CUDA, for device allocated pools only. SYCL entrypoints which specify host or shared side pools, or pools created by pre-existing allocations will throw. co-authored-by: Sean Stirling <sean.stirling@codeplay.com> co-authored-by: Hugh Delaney <hugh.delaney@codeplay.com> --------- Co-authored-by: Hugh Delaney <hugh.delaney@codeplay.com> Co-authored-by: Nicolas Miller <nicolas.miller@codeplay.com>

Seanst98 had a problem deploying to WindowsCILock February 6, 2025 10:28 — with GitHub Actions Failure

Seanst98 mentioned this pull request Feb 6, 2025

DRAFT: [AsyncAlloc][CUDA] Initial UR spec and implementation for the async oneapi-src/unified-runtime#2668

Closed

Seanst98 force-pushed the sean/async-alloc branch from f9534ee to c0bb96d Compare February 21, 2025 10:18

Seanst98 had a problem deploying to WindowsCILock February 21, 2025 10:19 — with GitHub Actions Failure

Seanst98 force-pushed the sean/async-alloc branch from c0bb96d to 7fa5ae1 Compare February 21, 2025 11:01

Seanst98 had a problem deploying to WindowsCILock February 21, 2025 11:02 — with GitHub Actions Failure

EwanC mentioned this pull request Feb 21, 2025

[UR][L0] Add initial USM alloc enqueue API #17112

Merged

Seanst98 had a problem deploying to WindowsCILock February 21, 2025 12:29 — with GitHub Actions Error

Seanst98 force-pushed the sean/async-alloc branch 2 times, most recently from 30ed8bb to 82a106a Compare February 21, 2025 13:18

Seanst98 had a problem deploying to WindowsCILock February 21, 2025 13:20 — with GitHub Actions Error

Seanst98 force-pushed the sean/async-alloc branch from 82a106a to 5f7de35 Compare February 21, 2025 13:20

Seanst98 had a problem deploying to WindowsCILock February 21, 2025 13:21 — with GitHub Actions Failure

Seanst98 had a problem deploying to WindowsCILock February 21, 2025 14:35 — with GitHub Actions Error

Seanst98 force-pushed the sean/async-alloc branch from 5f7de35 to 970fcd1 Compare February 21, 2025 14:54

Seanst98 had a problem deploying to WindowsCILock February 21, 2025 14:56 — with GitHub Actions Failure

Seanst98 had a problem deploying to WindowsCILock February 21, 2025 16:02 — with GitHub Actions Error

Seanst98 force-pushed the sean/async-alloc branch from 970fcd1 to def5019 Compare February 21, 2025 16:46

Seanst98 had a problem deploying to WindowsCILock February 21, 2025 16:48 — with GitHub Actions Failure

Seanst98 temporarily deployed to WindowsCILock February 21, 2025 19:27 — with GitHub Actions Inactive

Seanst98 force-pushed the sean/async-alloc branch from def5019 to 80829c1 Compare February 24, 2025 10:16

Seanst98 had a problem deploying to WindowsCILock February 24, 2025 10:17 — with GitHub Actions Failure

Seanst98 temporarily deployed to WindowsCILock February 24, 2025 11:09 — with GitHub Actions Inactive

Seanst98 force-pushed the sean/async-alloc branch from 80829c1 to e82546a Compare February 25, 2025 15:46

Seanst98 temporarily deployed to WindowsCILock February 25, 2025 15:47 — with GitHub Actions Inactive

Seanst98 temporarily deployed to WindowsCILock February 25, 2025 17:02 — with GitHub Actions Inactive

Seanst98 force-pushed the sean/async-alloc branch from e82546a to ef9cb95 Compare February 26, 2025 10:22

Seanst98 temporarily deployed to WindowsCILock February 26, 2025 10:23 — with GitHub Actions Inactive

Seanst98 temporarily deployed to WindowsCILock February 26, 2025 11:32 — with GitHub Actions Inactive

Seanst98 mentioned this pull request Feb 26, 2025

run_prebuilt_e2e_tests CI jobs fail in cases with UR API changes #16982

Closed

Seanst98 added 2 commits March 26, 2025 11:11

Merge branch 'sycl' into sean/async-alloc

2ba3fd9

Update symbols

9bc5dc9

Seanst98 temporarily deployed to WindowsCILock March 26, 2025 11:26 — with GitHub Actions Inactive

Seanst98 had a problem deploying to WindowsCILock March 26, 2025 12:09 — with GitHub Actions Error

Seanst98 temporarily deployed to WindowsCILock March 26, 2025 12:09 — with GitHub Actions Inactive

Merge branch 'sycl' into sean/async-alloc

e48d88a

Seanst98 temporarily deployed to WindowsCILock March 26, 2025 12:52 — with GitHub Actions Inactive

Seanst98 had a problem deploying to WindowsCILock March 26, 2025 13:55 — with GitHub Actions Error

AerialMantis approved these changes Mar 26, 2025

View reviewed changes

Merge branch 'sycl' into sean/async-alloc

2b81311

Seanst98 had a problem deploying to WindowsCILock March 26, 2025 14:54 — with GitHub Actions Error

Merge branch 'sycl' into sean/async-alloc

22e6311

Seanst98 temporarily deployed to WindowsCILock March 26, 2025 15:50 — with GitHub Actions Inactive

Seanst98 temporarily deployed to WindowsCILock March 26, 2025 16:25 — with GitHub Actions Inactive

Merge branch 'sycl' into sean/async-alloc

285f593

npmiller temporarily deployed to WindowsCILock March 26, 2025 21:00 — with GitHub Actions Inactive

npmiller temporarily deployed to WindowsCILock March 26, 2025 21:40 — with GitHub Actions Inactive

martygrant merged commit faa2365 into intel:sycl Mar 27, 2025
31 checks passed

npmiller mentioned this pull request Mar 27, 2025

[SYCL][AsyncAlloc] Fix minor async alloc issues #17689

Merged

npmiller mentioned this pull request Apr 1, 2025

[L0] Update async alloc support to lateset spec changes #17772

Open

Seanst98 deleted the sean/async-alloc branch April 4, 2025 16:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AsyncAlloc][SYCL][CUDA][Exp] Initial device side implementation for the sycl_ext_oneapi_async_memory_alloc extension #16900

[AsyncAlloc][SYCL][CUDA][Exp] Initial device side implementation for the sycl_ext_oneapi_async_memory_alloc extension #16900

Uh oh!

Seanst98 commented Feb 6, 2025 •

edited

Loading

Uh oh!

AerialMantis left a comment

Uh oh!

npmiller commented Mar 27, 2025

Uh oh!

Uh oh!

uditagarwal97 commented Mar 28, 2025

Uh oh!

Seanst98 commented Mar 31, 2025

Uh oh!

Uh oh!

[AsyncAlloc][SYCL][CUDA][Exp] Initial device side implementation for the sycl_ext_oneapi_async_memory_alloc extension #16900

[AsyncAlloc][SYCL][CUDA][Exp] Initial device side implementation for the sycl_ext_oneapi_async_memory_alloc extension #16900

Uh oh!

Conversation

Seanst98 commented Feb 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AerialMantis left a comment

Choose a reason for hiding this comment

Uh oh!

npmiller commented Mar 27, 2025

Uh oh!

Uh oh!

uditagarwal97 commented Mar 28, 2025

Uh oh!

Seanst98 commented Mar 31, 2025

Uh oh!

Uh oh!

Seanst98 commented Feb 6, 2025 •

edited

Loading