Skip to content

[kernel] Support new KCache Layout - Context Attention Triton Kernel#5658

Merged
yuanheng-zhao merged 4 commits intohpcaitech:feature/colossal-inferfrom
yuanheng-zhao:kernel/inference/triton/attn
Apr 26, 2024
Merged

[kernel] Support new KCache Layout - Context Attention Triton Kernel#5658
yuanheng-zhao merged 4 commits intohpcaitech:feature/colossal-inferfrom
yuanheng-zhao:kernel/inference/triton/attn

Commits

Commits on Apr 26, 2024