-
-
Notifications
You must be signed in to change notification settings - Fork 9.1k
Closed as not planned
Labels
releaseRelated to new version releaseRelated to new version release
Description
Update (02/03/2025):
- This has been renamed to v0.7.3 as we are releasing v0.7.2 for MLA bug fixes, transformers backend, and Qwen2.5VL
Update (01/31/2025):
- This has been renamed to v0.7.2 as we are releasing v0.7.1 for Deepseek enhancements.
Blockers
- Support for Qwen-1M: Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support #11844
- Support for Baichuan-M1: [Model] Enable Inference Support for the New Baichuan-M1 Model #12251
Metadata
Metadata
Assignees
Labels
releaseRelated to new version releaseRelated to new version release