-
-
Notifications
You must be signed in to change notification settings - Fork 5k
Issues: vllm-project/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Misc] Move some multimodal utils to modality-specific modules
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#11494
opened Dec 25, 2024 by
DarkLight1337
Loading…
[Model][LoRA]LoRA support added for MolmoForCausalLM
ci/build
documentation
Improvements or additions to documentation
frontend
needs-rebase
#11439
opened Dec 23, 2024 by
ayylemao
Loading…
fix: add missing bos_token to example templates
ci/build
needs-rebase
#11432
opened Dec 23, 2024 by
toslunar
Loading…
[Misc] Add image repeat option to benchmark_serving.py (to test hit/miss of MM cache)
needs-rebase
#11177
opened Dec 13, 2024 by
alexm-neuralmagic
Loading…
[V1] Supports scheduling asynchronousization on V1 version
needs-rebase
#11133
opened Dec 12, 2024 by
lixiaolx
Loading…
[Core] Support offloading KV cache to CPU
frontend
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#10874
opened Dec 3, 2024 by
ApostaC
Loading…
[V1] VLM prefix caching: Add hashing of images
needs-rebase
#10497
opened Nov 20, 2024 by
alexm-neuralmagic
•
Draft
Update outlines support to v0.1.4
ci/build
needs-rebase
#10490
opened Nov 20, 2024 by
Treparme
Loading…
[V1] Replace traversal search with lookup table
needs-rebase
#10486
opened Nov 20, 2024 by
Abatom
Loading…
Add support for reporting metrics in completion response headers in o…
frontend
needs-rebase
#10484
opened Nov 20, 2024 by
coolkp
Loading…
Compressed tensors w8a8 tpu
needs-rebase
#10435
opened Nov 18, 2024 by
robertgshaw2-neuralmagic
•
Draft
[Core] Interface for accessing model from engine
needs-rebase
#10353
opened Nov 15, 2024 by
DarkLight1337
Loading…
Rahul quant merged
ci/build
needs-rebase
#10341
opened Nov 14, 2024 by
robertgshaw2-neuralmagic
•
Draft
[Kernel] Add CUTLASS sparse support, heuristics, and torch operators
ci/build
needs-rebase
#10340
opened Nov 14, 2024 by
Faraz9877
Loading…
[Feature] enable host memory for kv cache
needs-rebase
#10330
opened Nov 14, 2024 by
YZP17121579
•
Draft
[Core][Frontend] Add faster-outlines as guided decoding backend
ci/build
needs-rebase
#10277
opened Nov 13, 2024 by
unaidedelf8777
Loading…
[V1] TPU Prototype
ci/build
needs-rebase
#10241
opened Nov 12, 2024 by
robertgshaw2-neuralmagic
•
Draft
6 tasks
[Frontend][Core] Add Guidance backend for guided decoding
ci/build
needs-rebase
#10217
opened Nov 11, 2024 by
JC1DA
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.