Adapt NPP calls for CUDA >= 12.9 #757

Kh4L · 2025-07-07T02:33:41Z

Recent CUDA versions don't support non-context NPP calls, so use the ctx-based API calls.
Also CUDA 12.9+ deprecates nppGetStreamContext, so we need to build the NPP context manually.

NicolasHug

Thanks a lot for sending this PR @Kh4L, this is very helpful.

I applied our linter and also enabled testing for CUDA 12.9 so that we can correctly check the PR.

Do I understand correctly that in 12.9 we have to use the context-based API, while at the same time the context creation helper was removed?! This sounds error prone, is there any way we could avoid manually building and setting the context attributes?

NicolasHug · 2025-07-07T12:13:52Z

src/torchcodec/_core/CudaDeviceInterface.cpp

-  at::cuda::CUDAStream nppStreamWrapper =
-      c10::cuda::getStreamFromExternal(nppGetStream(), device_.index());
-  nppDoneEvent.record(nppStreamWrapper);
-  nppDoneEvent.block(at::cuda::getCurrentCUDAStream());


@Kh4L Can you confirm my understanding that the nppiNV12ToRGB_8u_P2C3R_Ctx call well properly wait on the stream before returning?

Thank you!

Do I understand correctly that in 12.9 we have to use the context-based API, while at the same time the context creation helper was removed?! This sounds error prone, is there any way we could avoid manually building and setting the context attributes?

Unfortunately, there is currently no alternative way to build this.
Since I’m not part of the npp team, I can’t comment on their design choices

@Kh4L Can you confirm my understanding that the nppiNV12ToRGB_8u_P2C3R_Ctx call well properly wait on the stream before returning?

That’s correct. We bind the NPP context to the active CUDA stream so we can leverage CUDA stream management rather than performing a blocking sync

https://github.com/pytorch/torchcodec/pull/757/files#diff-37d8a09669d3f009b6850f6e66888b6875d805064933148fce3a637cc7694712R254

Kh4L · 2025-07-10T02:05:57Z

@NicolasHug LMK if you need anything else!

the assertion error doesn't seem related to my change

NicolasHug · 2025-07-10T08:51:25Z

Nothing else to do on your side @Kh4L , thank you. I'll merge this soon, I'll just try to extract all the #ifdef stuff into their own single function before merging.

NicolasHug · 2025-07-11T08:55:23Z

src/torchcodec/_core/CudaDeviceInterface.cpp

+  int dev = 0;
+  cudaError_t err = cudaGetDevice(&dev);


Hi @Kh4L , I actually have some questions before moving forward:

Should we just rely on the existing device_ attribute instead of calling cudaGetDevice(&dev), or are they actually equivalent?

Would it make sense to cache the nppCtx across calls? In this PR it looks like we're creating the context over and over for every single frame that needs to be decoded. I wonder if it might be beneficial to cache it in the class and re-use it?

Thanks for your help so far, I'm still trying to build familiarity with that part of the code base.

Update NPP calls for CUDA >= 12.9

886f64a

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 7, 2025

Add testing against CUDA 12.9

6023ea3

NicolasHug mentioned this pull request Jul 7, 2025

Add testing against CUDA 12.9 #758

Closed

NicolasHug added 2 commits July 7, 2025 12:39

Linter

d66cb33

Merge branch 'cuda129' into cuda129_update

69753e0

NicolasHug reviewed Jul 7, 2025

View reviewed changes

NicolasHug reviewed Jul 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adapt NPP calls for CUDA >= 12.9 #757

Adapt NPP calls for CUDA >= 12.9 #757

Uh oh!

Kh4L commented Jul 7, 2025

Uh oh!

NicolasHug left a comment

Uh oh!

NicolasHug Jul 7, 2025

Uh oh!

Kh4L Jul 8, 2025

Uh oh!

Kh4L commented Jul 10, 2025

Uh oh!

NicolasHug commented Jul 10, 2025

Uh oh!

NicolasHug Jul 11, 2025

Uh oh!

Uh oh!

Adapt NPP calls for CUDA >= 12.9 #757

Are you sure you want to change the base?

Adapt NPP calls for CUDA >= 12.9 #757

Uh oh!

Conversation

Kh4L commented Jul 7, 2025

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

NicolasHug Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

Kh4L Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

Kh4L commented Jul 10, 2025

Uh oh!

NicolasHug commented Jul 10, 2025

Uh oh!

NicolasHug Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!