🎯
Focusing
Ph.D. student at MIT | NVIDIA | Waymo | OmniML
- Cambridge, Massachusetts, United States
- http://kentang.net
Pinned Loading
-
mit-han-lab/hart
mit-han-lab/hart PublicHART: Efficient Visual Generation with Hybrid Autoregressive Transformer
-
mit-han-lab/llm-awq
mit-han-lab/llm-awq Public[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
-
mit-han-lab/qserve
mit-han-lab/qserve PublicQServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
-
mit-han-lab/spvnas
mit-han-lab/spvnas Public archive[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
-
mit-han-lab/bevfusion
mit-han-lab/bevfusion Public archive[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
-
mit-han-lab/torchsparse
mit-han-lab/torchsparse Public[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.