Skip to content

Gaps in writing kernel features #157

Closed
@akharche

Description

@akharche

Study the Writing a CUDA Kernel section and document with examples equivalent features in numba-dppy.

Feature CUDA DPPY
Kernel declaration @cuda.jit @numba.jit, @dppy.kernel
Kernel invocation threadsperblock = 32 blockspergrid = (an_array.size + (threadsperblock - 1)) // threadsperblock increment_by_one [blockspergrid, threadsperblock](an_array) idx=dppy.get_global_id(0)
Choosing the block size
Thread positioning
Thread index numba.cuda.threadIdx dppy.get_local_id
Block dimension numba.cuda.blockDim dppy.get_local_size
Block index numba.cuda.blockIdx dppy.get_group_id
Grid dimension numba.cuda.gridDim dppy.get_num_groups
Grid position numba.cuda.grid(ndim) ???
Grid size numba.cuda.gridsize(ndim) ???

Missing features:

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions