Skip to content

Pinned Loading

  1. flash-linear-attention flash-linear-attention Public

    🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

    Python 2.7k 193

  2. flame flame Public

    🔥 A minimal training framework for scaling FLA models

    Python 166 22

  3. native-sparse-attention native-sparse-attention Public

    🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

    Python 695 30

Repositories

Showing 10 of 11 repositories

Top languages

Loading…

Most used topics

Loading…