Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Benchmark] Report prefix cache hit rate in vllm bench serve performance Performance-related issues
#46938 opened Jun 28, 2026 by yuyz-cyber Loading…
[Bugfix][GB10] Fix negative CUDA graph memory estimate on unified-memory GPUs (#44740) bug Something isn't working nvidia v1
#46932 opened Jun 27, 2026 by WindChimeRan Contributor Loading…
3 of 4 tasks
[Hardware][AMD][CI] Tweak mirrored tests; improve CI base dependency change detection ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#46930 opened Jun 27, 2026 by mawong-amd Contributor Loading…
4 tasks
docs: add OpenAI server production hardening checklist documentation Improvements or additions to documentation
#46922 opened Jun 27, 2026 by alexchenyu Loading…
[Bugfix] Package example Jinja chat templates in wheels bug Something isn't working ci/build frontend
#46921 opened Jun 27, 2026 by kainoj Loading…
4 tasks done
fix(flashinfer): guard trtllm MoE behind x86_64 check nvidia
#46917 opened Jun 27, 2026 by matdou Loading…
5 of 8 tasks
[communication] [bugfix] fix quickreduce acc error in cudagraph mode bug Something isn't working nvidia
#46913 opened Jun 27, 2026 by haoyangli0109 Contributor Loading…
[Perf] Fuse DFlash cache insert kernel qwen Related to Qwen models
#46911 opened Jun 27, 2026 by gcanlin Contributor Loading…
4 tasks
AFD deepseek Related to DeepSeek models frontend v1
#46909 opened Jun 27, 2026 by zingercode Loading…
4 tasks
[Bugfix] Handle list slot mappings in attention context bug Something isn't working
#46908 opened Jun 27, 2026 by zupengwang Loading…
[CPU][Bugfix] Build cpu_fused_moe on Apple Silicon bug Something isn't working ci/build cpu Related to CPU backends
#46907 opened Jun 27, 2026 by yuyz-cyber Loading…
3 of 4 tasks
Feat/oscar kv documentation Improvements or additions to documentation nvidia v1
#46903 opened Jun 27, 2026 by pranavthakur0-0 Loading…
Bump the minor-update group across 1 directory with 149 updates ci/build dependencies Pull requests that update a dependency file nvidia rocm Related to AMD ROCm
#46902 opened Jun 27, 2026 by dependabot Bot Loading…
[MoE] [MoE Refactor] Migrate int8 w4a8int8 oracle 37753
#46901 opened Jun 27, 2026 by qyYue1389 Contributor Loading…
[Docs] Add Phi-4-mini-instruct to batch invariance tested models documentation Improvements or additions to documentation
#46900 opened Jun 27, 2026 by CHIPMUNK-T0T Loading…
4 tasks done
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.