-
-
Notifications
You must be signed in to change notification settings - Fork 7.7k
Issues: vllm-project/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Core] Cast multimodal input in hf processor
ci/build
multi-modality
Related to multi-modality (#4194)
ready
ONLY add when PR is ready to merge/full CI is needed
speculative-decoding
tpu
Related to Google TPUs
v1
#18862
opened May 28, 2025 by
lgeiger
Loading…
[Spec Decode][Benchmark] Generalize spec decode offline benchmark to more methods and datasets
documentation
Improvements or additions to documentation
speculative-decoding
#18847
opened May 28, 2025 by
ekagra-ranjan
Loading…
[CI] change spell checker from codespell to typos
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
speculative-decoding
tool-calling
tpu
Related to Google TPUs
v1
#18711
opened May 26, 2025 by
andyxning
Loading…
[Model][Speculative Decoding] Integrate PARD into vLLM
speculative-decoding
#18541
opened May 22, 2025 by
zihaoanllm
Loading…
[Bugfix] Fix spec decode on non-cuda platforms
speculative-decoding
#18501
opened May 21, 2025 by
rand-fly
Loading…
[Core] Add support for sampling penalties to v1 ngram speculative decoding
speculative-decoding
v1
#18441
opened May 20, 2025 by
pooyadavoodi
Loading…
[V1] [Spec decode] Llama4 type eagle support in v1
speculative-decoding
v1
#18369
opened May 19, 2025 by
RonaldBXu
Loading…
[RFC]: Enabling Suffix Decoding, LSTM Speculator, Sequence Parallelism from Arctic Inference
RFC
speculative-decoding
#18037
opened May 13, 2025 by
sfc-gh-aqiao
1 task done
[V1] LogitsProcessor programming model
ci/build
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
speculative-decoding
structured-output
tool-calling
v1
#16728
opened Apr 16, 2025 by
afeldman-nm
Loading…
[V1] Add request-level, per-step acceptance counts tracking for spec dec.
documentation
Improvements or additions to documentation
needs-rebase
speculative-decoding
v1
#16367
opened Apr 9, 2025 by
luyuzhe111
Loading…
[V1][Spec Decode] Add random seed for EAGLE and its test script
needs-rebase
speculative-decoding
v1
#16235
opened Apr 8, 2025 by
wwl2755
Loading…
[V1][Spec Decode] Non greedy sample with EAGLE / Reduce memory allocation for Rejection Sampler
documentation
Improvements or additions to documentation
needs-rebase
speculative-decoding
v1
#16077
opened Apr 4, 2025 by
ekagra-ranjan
Loading…
2 tasks done
[SpecDecode] Support EAGLE in V1
speculative-decoding
v1
#15901
opened Apr 1, 2025 by
WoosukKwon
7 of 10 tasks
[Misc] Disable pin_memory in AsyncMetricsCollector for spec decode tensor allocation
needs-rebase
speculative-decoding
#15886
opened Apr 1, 2025 by
esmeetu
Loading…
[Misc] Improve cli help show
ci/build
needs-rebase
speculative-decoding
#15455
opened Mar 25, 2025 by
reidliu41
Loading…
[Bugfix]: Fix Promethus spec decode counter sum-of-sums
speculative-decoding
v0
#15415
opened Mar 24, 2025 by
alugowski
Loading…
[SpecDecode] Make spec decoding extensible to different backends
ci/build
speculative-decoding
#15195
opened Mar 20, 2025 by
MengqingCao
Loading…
[Spec Decode] Make speculative decoding compatible with pipeline parallelism
needs-rebase
speculative-decoding
#15173
opened Mar 20, 2025 by
xyang16
Loading…
[Frontend]Reduce vLLM's import time
ci/build
frontend
multi-modality
Related to multi-modality (#4194)
needs-rebase
speculative-decoding
structured-output
v1
#15128
opened Mar 19, 2025 by
Chen-0210
Loading…
[Bugfix] Fix hidden_states reshape failed and no_proposals error when…
speculative-decoding
#15032
opened Mar 18, 2025 by
ptkang
Loading…
[Feature] Eagle Chunked Prefill Support
speculative-decoding
#14922
opened Mar 17, 2025 by
luyuzhe111
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.