Tags: vllm-project/vllm
Toggle v0.10.0rc1's commit message
Enable v1 metrics tests (#20953 )
Signed-off-by: Seiji Eicher <seiji@anyscale.com>
Toggle v0.9.2's commit message
Revert "[V0 deprecation] Remove V0 CPU/XPU/TPU backends (#20412 )"
This reverts commit e202dd2 .
Toggle v0.9.2rc2's commit message
Revert "[V0 deprecation] Remove V0 CPU/XPU/TPU backends (#20412 )"
This reverts commit e202dd2 .
Toggle v0.9.2rc1's commit message
[Misc] Remove _maybe_ignore_quant_config from GLM4.1v (#20432 )
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Toggle v0.9.1's commit message
[Misc] Slight improvement of the BNB (#19418 )
Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
Co-authored-by: Isotr0py <2037008807@qq.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Toggle v0.9.1rc2's commit message
[Misc] Slight improvement of the BNB (#19418 )
Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
Co-authored-by: Isotr0py <2037008807@qq.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Toggle v0.9.1rc1's commit message
[Misc] Fix a config typo in disable_hybrid_kv_cache_manager configura…
…tion (#19383 )
Signed-off-by: Siyuan Liu <lsiyuan@google.com>
Toggle v0.9.0.1's commit message
[BugFix] FA2 MLA Accuracy Issue (#18807 )
Signed-off-by: LucasWilkinson <lwilkinson@neuralmagic.com>
Toggle v0.9.0's commit message
[Bugfix] Mistral tool calling when content is list (#18729 )
Signed-off-by: mgoin <mgoin64@gmail.com>
Toggle v0.8.5.post1's commit message
[BugFix][Attention] Fix sliding window attention in V1 giving incorre…
…ct results (#17574 )
Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
You can’t perform that action at this time.