Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[V0 deprecation] Guided decoding ci/build frontend needs-rebase ready ONLY add when PR is ready to merge/full CI is needed structured-output v1
#21347 opened Jul 22, 2025 by rzabarazesh Loading…
1 of 4 tasks
Convert tests to ruff-format deepseek Related to DeepSeek models llama Related to Llama models multi-modality Related to multi-modality (#4194) needs-rebase performance Performance-related issues qwen Related to Qwen models rocm Related to AMD ROCm speculative-decoding structured-output tool-calling tpu Related to Google TPUs v1
#21129 opened Jul 17, 2025 by hmellor Loading…
[benchmark] add max-concurrency in result table performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed structured-output
#21095 opened Jul 17, 2025 by panpan0000 Loading…
2 of 3 tasks
[V0 deprecation] Removal V0 structured outputs needs-rebase ready ONLY add when PR is ready to merge/full CI is needed structured-output
#20928 opened Jul 14, 2025 by aarnphm Loading… v0.10.0
[WIP][RC] Update PyTorch to 2.8.0 ci/build deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend llama Related to Llama models multi-modality Related to multi-modality (#4194) new-model Requests to new models performance Performance-related issues qwen Related to Qwen models rocm Related to AMD ROCm speculative-decoding structured-output tool-calling v1
#20358 opened Jul 2, 2025 by huydhn Draft
Add support for Prithvi geospatial model in serving mode documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) needs-rebase new-model Requests to new models structured-output v1
#20307 opened Jul 1, 2025 by mgazz Draft
1 of 4 tasks
Draft: WIP NixlConnector allow configurable handshake backend +HTTP ci/build documentation Improvements or additions to documentation frontend llama Related to Llama models multi-modality Related to multi-modality (#4194) needs-rebase performance Performance-related issues qwen Related to Qwen models rocm Related to AMD ROCm structured-output tool-calling v1
#19447 opened Jun 10, 2025 by wseaton Loading…
3 of 7 tasks
[Frontend] Added support for HermesToolParser for models without special tokens ci/build documentation Improvements or additions to documentation frontend llama Related to Llama models multi-modality Related to multi-modality (#4194) performance Performance-related issues qwen Related to Qwen models rocm Related to AMD ROCm speculative-decoding structured-output tool-calling v1
#16890 opened Apr 20, 2025 by minpeter Loading…
Adding Share Expert Fusion for DeepSeek ci/build deepseek Related to DeepSeek models needs-rebase performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm speculative-decoding structured-output v1
#15502 opened Mar 25, 2025 by DiegoD94 Loading…
[Frontend] Skip stop in reasoning content documentation Improvements or additions to documentation frontend needs-rebase structured-output
#14550 opened Mar 10, 2025 by gaocegege Loading…
2
8
ProTip! What’s not been updated in a month: updated:<2025-06-23.