gpu: nvidia: Add support for cublaslt matmul #2124

ShanoToni · 2024-09-26T12:45:14Z

Description

Adds support for using the cublaslt API for IMMA kernels and when the bias and/or relu post-op can be merged into the cublaslt epilogue.

mgouicem · 2024-09-26T13:25:03Z

make test
disable device_cpu
enable device_gpu
enable thr_cuda
enable arch_rtx

mgouicem

It seems some copyright issues escaped (and the same are present in main). I wonder why they were not caught in the original PR Lightweight scans...

src/gpu/nvidia/cudnn_reorder_lt.hpp

cmake/FindcublasLt.cmake

src/common/memory_desc.hpp

ShanoToni requested review from a team as code owners September 26, 2024 12:45

mgouicem reviewed Sep 26, 2024

View reviewed changes

src/gpu/nvidia/cudnn_reorder_lt.hpp Outdated Show resolved Hide resolved

cmake/FindcublasLt.cmake Outdated Show resolved Hide resolved

src/common/memory_desc.hpp Show resolved Hide resolved

github-actions bot added platform:gpu-nvidia Codeowner: @oneapi-src/onednn-gpu-nvidia platform:gpu-generic Codeowner: @oneapi-src/onednn-gpu-generic backport labels Sep 26, 2024

gpu: nvidia: Add support for cublaslt matmul

9ec9455

ShanoToni force-pushed the cublas_lt_impl branch from c16e939 to 9ec9455 Compare September 26, 2024 13:47

mgouicem merged commit eb146c4 into oneapi-src:rls-v3.6 Sep 30, 2024
12 checks passed

vpirogov modified the milestones: v3.7, v3.6 Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu: nvidia: Add support for cublaslt matmul #2124

gpu: nvidia: Add support for cublaslt matmul #2124

ShanoToni commented Sep 26, 2024

mgouicem commented Sep 26, 2024

mgouicem left a comment

gpu: nvidia: Add support for cublaslt matmul #2124

gpu: nvidia: Add support for cublaslt matmul #2124

Conversation

ShanoToni commented Sep 26, 2024

Description

mgouicem commented Sep 26, 2024

mgouicem left a comment

Choose a reason for hiding this comment