Awesome-Triton-Kernels

Collection of kernels written in Triton language (didn't seem to be a lot till now). Welcoming contribution!

Main Repo by OpenAI

Official Tutorials

Awesome resources from cuda-mode, their guide to Triton

Triton Kernel collection by cuda-mode

Puzzles by Sasha Rush

General Operators

attorch subset of PyTorch's nn module

Kernels by PyTorch Labs

scattermoe: Sparse Mixture-of-Experts

Transformer

Liger Kernel: Efficient Triton Kernels for LLM Training

Flash Linear Attention

FLASHNN for LLM Serving

Kernels by Kernl

Kernels by Unsloth

GPTQ by fpgaminer

GPTQ on PyTorch blog

FlagAttention, memory-efficient attention kernels

Activations

Activation functions by dogukantai

Matrix Operations

Sparse Toolkit: Block-sparse matrix multiplication (paper)

GemLite: Fused low-bit matrix multiplication

Special operations

EquiTriton for equivariant NN by IntelLabs

Benchmark

TritonBench by PyTorch

Integrations