Commits
Branch selector
User selector
Datepicker
Commit History
Commits on Jun 1, 2024
- authored
- authored
[Minor] Fix the path typo in loader.py: save_sharded_states.py -> save_sharded_state.py (vllm-project#5151)
authored- authored
- authored
- authored
- authored
Commits on May 31, 2024
- authored
Revert "[Kernel] Marlin_24: Ensure the mma.sp instruction is using the ::ordered_metadata modifier (introduced with PTX 8.5)" (vllm-project#5149)
authored- committed
[Kernel] Marlin_24: Ensure the mma.sp instruction is using the ::ordered_metadata modifier (introduced with PTX 8.5) (vllm-project#5136)
authored- authored
Commits on May 30, 2024
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on May 29, 2024
- authored
- authored
- authored
- authored
- authored
[Core] Cross-attention KV caching and memory-management (towards eventual encoder/decoder model support) (vllm-project#4837)
authored- authored
- authored
- authored