Skip to content

Commit 92bca39

Browse files
committed
op/cuda: Add flexible vector type
CUDA provides only limited vector widths and only for variable width integer types. We use our own vector type and some C++ templates to get more flexible vectors. We aim to get 128bit loads by adjusting the width based on the type size. Signed-off-by: Joseph Schuchart <joseph.schuchart@stonybrook.edu>
1 parent 25c24c9 commit 92bca39

File tree

1 file changed

+480
-610
lines changed

1 file changed

+480
-610
lines changed

0 commit comments

Comments
 (0)