Closed
Description
Study the Writing a CUDA Kernel section and document with examples equivalent features in numba-dppy.
Feature | CUDA | DPPY |
---|---|---|
Kernel declaration | @cuda.jit | @numba.jit, @dppy.kernel |
Kernel invocation | threadsperblock = 32 blockspergrid = (an_array.size + (threadsperblock - 1)) // threadsperblock increment_by_one [blockspergrid, threadsperblock](an_array) |
idx=dppy.get_global_id(0) |
Choosing the block size | ||
Thread positioning | ||
Thread index | numba.cuda.threadIdx | dppy.get_local_id |
Block dimension | numba.cuda.blockDim | dppy.get_local_size |
Block index | numba.cuda.blockIdx | dppy.get_group_id |
Grid dimension | numba.cuda.gridDim | dppy.get_num_groups |
Grid position | numba.cuda.grid(ndim) | ??? |
Grid size | numba.cuda.gridsize(ndim) | ??? |
Missing features:
Metadata
Metadata
Assignees
Labels
No labels