MIT HAN Lab
Pinned Loading
Repositories
- nunchaku Public
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
mit-han-lab/nunchaku’s past year of commit activity - llm-awq Public
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
mit-han-lab/llm-awq’s past year of commit activity - torchquantum Public
A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.
mit-han-lab/torchquantum’s past year of commit activity - efficientvit Public
Efficient vision foundation models for high-resolution generation and perception.
mit-han-lab/efficientvit’s past year of commit activity - omniserve Public
[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention
mit-han-lab/omniserve’s past year of commit activity - torchsparse Public
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
mit-han-lab/torchsparse’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…