Open
Description
torch_cluster.radius
is not compatible with the CUDA graph (https://pytorch.org/docs/stable/notes/cuda.html#cuda-graphs)
from torch_cluster import radius
device = pt.device('cuda')
x = pt.tensor([0.0], device=device)
y = pt.tensor([1.0], device=device)
graph = pt.cuda.CUDAGraph()
with pt.cuda.graph(graph):
radius(x, y, r=2)
---------------------------------------------------------------------------
RuntimeError Traceback (most recent call last)
Input In [26], in <module>
8 with pt.cuda.graph(graph):
----> 9 radius(x, y, r=2)
RuntimeError: The following operation failed in the TorchScript interpreter.
Traceback of TorchScript (most recent call last):
File "/shared2/raimis/opt/miniconda/envs/torchmd-net/lib/python3.9/site-packages/torch_cluster/radius.py", line 72, in radius
torch.cumsum(deg, 0, out=ptr_y[1:])
return torch.ops.torch_cluster.radius(x, y, ptr_x, ptr_y, r,
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ <--- HERE
max_num_neighbors, num_workers)
RuntimeError: CUDA error: operation not permitted when stream is capturing
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.