GitHub · Where software is built

[RFC]: Deprecating vLLM V0
#18571 · WoosukKwon opened on May 22, 2025
38
[Roadmap] vLLM Roadmap Q2 2025
#15735 · simon-mo opened on Mar 29, 2025
18
[Roadmap] vLLM Release/CI/Performance Benchmark Q2 2025
#16284 · khluu opened on Apr 8, 2025
3

Labels Milestones New issue

[Misc][Help]: Adding support for a Custom model with External MoE Routing

#15214

· XMaster96 opened

on Mar 20, 2025

[Misc]: Why not sort the waiting queue before popleft waiting queue?

#15091

· zh-jp opened

on Mar 19, 2025

[CI failed]: V1 Test Failed due to "No available memory for the cache blocks" in GitHub Actions

#14574

· cynthieye opened

on Mar 10, 2025

[Misc]: asyncio requests and continuous batching

#14559

· Bodoral opened

on Mar 10, 2025

[Misc, this is not a dev issue]: Congrats to vllm for having 888 developers!

#14405

· ZihanWang314 opened

on Mar 7, 2025

[Misc]: running multiple vLLM instances on a single ray cluster

#14277

· gitlawr opened

on Mar 5, 2025

[Misc]: How does the system evenly distribute the requests to multiple micro batches?

#14213

· oldcpple opened

on Mar 4, 2025

[Misc]: When using lossy optimization, how to explain that the loss caused by optimization is within the acceptable range?

#14128

· nodekey777 opened

on Mar 3, 2025

[Misc]: Performance Increase When Running Benchmarks on Docker Container

#14006

· jingeekim9 opened

on Feb 28, 2025

[Misc] [ROCm]: Build from source failure with Arch/gcc14 with ROCm 6.3

#13777

· arjunkathuria opened

on Feb 24, 2025

[Misc]: Question: Where can I find getting computed kv cache code on v0.

#13327

· hiroki-nishimoto-fixstars opened

on Feb 15, 2025

[Feature]: Support Python 3.13

#12083

· manueldeprada opened

on Jan 15, 2025