oneMKL only has `dgmm_batch` variant: doesn't map to cublas #562

pen-and-papers · 2024-08-29T16:17:21Z

cublas does not have a specific batch variant of dgmm, it only has cublas<t>dgmm(), see https://docs.nvidia.com/cuda/cublas/#id10
However oneMKL only supports the "batch-style" dgmm_batch interface

This is probably one reason why the cublas backend to oneMKL has no implementation for any dgmm functions. I'm not sure how widely used dgmm is, but I'm guessing since it is a core blas algorithm it should probably be supported.

Is there a reason why there is only the batch style variant of dgmm in oneMKL?

Thanks

The text was updated successfully, but these errors were encountered:

pen-and-papers added the question A request for more information or clarification label Aug 29, 2024

JackAKirk mentioned this issue Oct 10, 2024

implement oneMKL row-major -> cublas mapping #588

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

oneMKL only has `dgmm_batch` variant: doesn't map to cublas #562

oneMKL only has `dgmm_batch` variant: doesn't map to cublas #562

pen-and-papers commented Aug 29, 2024 •

edited

Loading

oneMKL only has dgmm_batch variant: doesn't map to cublas #562

oneMKL only has dgmm_batch variant: doesn't map to cublas #562

Comments

pen-and-papers commented Aug 29, 2024 • edited Loading

oneMKL only has `dgmm_batch` variant: doesn't map to cublas #562

oneMKL only has `dgmm_batch` variant: doesn't map to cublas #562

pen-and-papers commented Aug 29, 2024 •

edited

Loading