Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned Loading

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 7.1k 394

  2. llm-awq llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 3.4k 283

  3. efficientvit efficientvit Public

    Efficient vision foundation models for high-resolution generation and perception.

    Python 3.2k 232

  4. bevfusion bevfusion Public archive

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 2.9k 528

  5. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2.2k 422

  6. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.9k 345

Repositories

Showing 10 of 62 repositories
  • fastrl Public

    Efficient Reinforcement Learning for Language Models

    mit-han-lab/fastrl’s past year of commit activity
    Python 50 Apache-2.0 2 0 0 Updated Nov 21, 2025
  • flash-moba Public
    mit-han-lab/flash-moba’s past year of commit activity
    C++ 174 BSD-3-Clause 6 1 0 Updated Nov 20, 2025
  • radial-attention Public

    [NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

    mit-han-lab/radial-attention’s past year of commit activity
    Python 558 Apache-2.0 29 14 1 Updated Nov 11, 2025
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    mit-han-lab/torchquantum’s past year of commit activity
    Jupyter Notebook 1,569 MIT 242 69 (4 issues need help) 9 Updated Oct 28, 2025
  • streaming-vlm Public

    StreamingVLM: Real-Time Understanding for Infinite Video Streams

    mit-han-lab/streaming-vlm’s past year of commit activity
    Python 721 MIT 45 9 0 Updated Oct 15, 2025
  • efficientvit Public

    Efficient vision foundation models for high-resolution generation and perception.

    mit-han-lab/efficientvit’s past year of commit activity
    Python 3,151 Apache-2.0 232 108 0 Updated Sep 5, 2025
  • llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    mit-han-lab/llm-awq’s past year of commit activity
    Python 3,355 MIT 283 169 10 Updated Jul 18, 2025
  • lpd Public

    Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

    mit-han-lab/lpd’s past year of commit activity
    Python 80 MIT 6 1 0 Updated Jul 14, 2025
  • Quest Public

    [ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

    mit-han-lab/Quest’s past year of commit activity
    Cuda 353 MIT 38 4 0 Updated Jul 11, 2025
  • x-attention Public

    [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring

    mit-han-lab/x-attention’s past year of commit activity
    Python 255 15 9 0 Updated Jul 7, 2025