Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gpu: nvidia: Add support for cublaslt matmul #2124

Merged
merged 1 commit into from
Sep 30, 2024

Conversation

ShanoToni
Copy link
Contributor

Description

Adds support for using the cublaslt API for IMMA kernels and when the bias and/or relu post-op can be merged into the cublaslt epilogue.

@ShanoToni ShanoToni requested review from a team as code owners September 26, 2024 12:45
@mgouicem
Copy link
Contributor

make test
disable device_cpu
enable device_gpu
enable thr_cuda
enable arch_rtx

Copy link
Contributor

@mgouicem mgouicem left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems some copyright issues escaped (and the same are present in main). I wonder why they were not caught in the original PR Lightweight scans...

src/gpu/nvidia/cudnn_reorder_lt.hpp Outdated Show resolved Hide resolved
cmake/FindcublasLt.cmake Outdated Show resolved Hide resolved
src/common/memory_desc.hpp Show resolved Hide resolved
@github-actions github-actions bot added platform:gpu-nvidia Codeowner: @oneapi-src/onednn-gpu-nvidia platform:gpu-generic Codeowner: @oneapi-src/onednn-gpu-generic backport labels Sep 26, 2024
@mgouicem mgouicem merged commit eb146c4 into oneapi-src:rls-v3.6 Sep 30, 2024
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport platform:gpu-generic Codeowner: @oneapi-src/onednn-gpu-generic platform:gpu-nvidia Codeowner: @oneapi-src/onednn-gpu-nvidia
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants