Skip to content

[Inference]Support FP16/BF16 Flash Attention 2 And Add high_precision Flag To Rotary Embedding#5461

Merged
isky-cd merged 13 commits intohpcaitech:feature/colossal-inferfrom
isky-cd:context_flash_attn_branch
Mar 25, 2024
Merged

[Inference]Support FP16/BF16 Flash Attention 2 And Add high_precision Flag To Rotary Embedding#5461
isky-cd merged 13 commits intohpcaitech:feature/colossal-inferfrom
isky-cd:context_flash_attn_branch

Commits

Commits on Mar 14, 2024

Commits on Mar 15, 2024

Commits on Mar 19, 2024

Commits on Mar 20, 2024

Commits on Mar 21, 2024