-
-
Notifications
You must be signed in to change notification settings - Fork 8.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[V0 deprecation] Guided decoding
ci/build
frontend
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
v1
#21347
opened Jul 22, 2025 by
rzabarazesh
Loading…
1 of 4 tasks
Convert Related to DeepSeek models
llama
Related to Llama models
multi-modality
Related to multi-modality (#4194)
needs-rebase
performance
Performance-related issues
qwen
Related to Qwen models
rocm
Related to AMD ROCm
speculative-decoding
structured-output
tool-calling
tpu
Related to Google TPUs
v1
tests
to ruff-format
deepseek
#21129
opened Jul 17, 2025 by
hmellor
Loading…
[benchmark] add max-concurrency in result table
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
#21095
opened Jul 17, 2025 by
panpan0000
Loading…
2 of 3 tasks
[V0 deprecation] Removal V0 structured outputs
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
structured-output
[WIP][RC] Update PyTorch to 2.8.0
ci/build
deepseek
Related to DeepSeek models
documentation
Improvements or additions to documentation
frontend
llama
Related to Llama models
multi-modality
Related to multi-modality (#4194)
new-model
Requests to new models
performance
Performance-related issues
qwen
Related to Qwen models
rocm
Related to AMD ROCm
speculative-decoding
structured-output
tool-calling
v1
Add support for Prithvi geospatial model in serving mode
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
needs-rebase
new-model
Requests to new models
structured-output
v1
Draft: WIP NixlConnector allow configurable handshake backend +HTTP
ci/build
documentation
Improvements or additions to documentation
frontend
llama
Related to Llama models
multi-modality
Related to multi-modality (#4194)
needs-rebase
performance
Performance-related issues
qwen
Related to Qwen models
rocm
Related to AMD ROCm
structured-output
tool-calling
v1
#19447
opened Jun 10, 2025 by
wseaton
Loading…
3 of 7 tasks
[Deprecation] Remove Improvements or additions to documentation
frontend
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
speculative-decoding
structured-output
v1
prompt_token_ids
arg fallback in LLM.generate
and LLM.embed
documentation
#18800
opened May 28, 2025 by
DarkLight1337
•
Draft
[V1][Feat] Fail request if FSM fails to advance
structured-output
v1
#18780
opened May 27, 2025 by
atbe
Loading…
[Fix] Auto-detect XGrammar compiler threads based on CPU cores.
documentation
Improvements or additions to documentation
needs-rebase
structured-output
v1
#17737
opened May 6, 2025 by
Ubospica
Loading…
[RFC][core][V1] generalize structured output manager and backends
needs-rebase
structured-output
tpu
Related to Google TPUs
v1
#17503
opened Apr 30, 2025 by
william-baker-inflection
Loading…
[benchmark][structured output] Add offline benchmark script for structured output
needs-rebase
performance
Performance-related issues
structured-output
#17440
opened Apr 30, 2025 by
lk-chen
Loading…
[Feature][Refactor][CLI] Rename guided to structured outputs, and Improvements or additions to documentation
frontend
needs-rebase
structured-output
tool-calling
v1
--structured-outputs-config
documentation
#17420
opened Apr 29, 2025 by
aarnphm
Loading…
[Frontend] Added support for HermesToolParser for models without special tokens
ci/build
documentation
Improvements or additions to documentation
frontend
llama
Related to Llama models
multi-modality
Related to multi-modality (#4194)
performance
Performance-related issues
qwen
Related to Qwen models
rocm
Related to AMD ROCm
speculative-decoding
structured-output
tool-calling
v1
#16890
opened Apr 20, 2025 by
minpeter
Loading…
Enable Outlines with JSON Sub-Schema References
frontend
needs-rebase
structured-output
#15627
opened Mar 27, 2025 by
theobjectivedad
Loading…
Adding Share Expert Fusion for DeepSeek
ci/build
deepseek
Related to DeepSeek models
needs-rebase
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
speculative-decoding
structured-output
v1
#15502
opened Mar 25, 2025 by
DiegoD94
Loading…
[V1][Experimental] Jump-forward decoding
needs-rebase
qwen
Related to Qwen models
structured-output
v1
[Frontend]Reduce vLLM's import time
ci/build
deepseek
Related to DeepSeek models
frontend
multi-modality
Related to multi-modality (#4194)
needs-rebase
speculative-decoding
structured-output
v1
#15128
opened Mar 19, 2025 by
Chen-0210
Loading…
[Frontend] Skip Improvements or additions to documentation
frontend
needs-rebase
structured-output
stop
in reasoning content
documentation
#14550
opened Mar 10, 2025 by
gaocegege
Loading…
[Frontend] Adding the "User Defined Custom Tool Calling" parser for the Llama models
ci/build
documentation
Improvements or additions to documentation
frontend
llama
Related to Llama models
multi-modality
Related to multi-modality (#4194)
needs-rebase
speculative-decoding
structured-output
tool-calling
v1
#12752
opened Feb 4, 2025 by
lulmer
Loading…
ProTip!
What’s not been updated in a month: updated:<2025-06-23.