Skip to content

Add specialized CUDA kernels for multi-head attention with various head dimensions#30

Merged
LoserCheems merged 51 commits intomainfrom
Support-sparse_gemm
Jun 26, 2025
Merged

Add specialized CUDA kernels for multi-head attention with various head dimensions#30
LoserCheems merged 51 commits intomainfrom
Support-sparse_gemm

Commits

Commits on Jun 26, 2025