-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
Issues: vllm-project/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Encoder Decoder] Add flash_attn kernel support for encoder-decoder models
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#9559
opened Oct 21, 2024 by
sroy745
Loading…
Pytorch hete spec
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#9266
opened Oct 11, 2024 by
jiqing-feng
Loading…
[Core][VLM] Add support for placeholder token content hashes
needs-rebase
#8348
opened Sep 10, 2024 by
petersalas
Loading…
[Core][VLM] Add precise multi-modal placeholder tracking
needs-rebase
#8346
opened Sep 10, 2024 by
petersalas
Loading…
[Hardware][Ascend] Add Ascend NPU backend
needs-rebase
#8054
opened Aug 31, 2024 by
wangshuai09
•
Draft
11 of 12 tasks
[Misc] p90 and p95 for serving benchmark
needs-rebase
stale
#7062
opened Aug 2, 2024 by
UranusSeven
Loading…
[Frontend] Add security scheme to server
frontend
needs-rebase
stale
#7021
opened Aug 1, 2024 by
g-parki
Loading…
[CI/Build] Adding timeout in CPU CI to avoid CPU test queue blocking
ci/build
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
stale
#6892
opened Jul 29, 2024 by
bigPYJ1151
Loading…
Add required libcuda.so
ci/build
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#6864
opened Jul 27, 2024 by
sdake
Loading…
[Model] Add support for Qwen2 for embeddings
needs-rebase
stale
#5611
opened Jun 17, 2024 by
mgoin
Loading…
[Kernel][Core][WIP] Tree attention and parallel decoding
needs-rebase
stale
#4325
opened Apr 24, 2024 by
yukavio
Loading…
[Model] Add moondream vision language model
documentation
Improvements or additions to documentation
needs-rebase
stale
#4228
opened Apr 20, 2024 by
vikhyat
Loading…
[Frontend] Entrypoint for hosting local Kobold Lite chat interface
frontend
needs-rebase
stale
#4096
opened Apr 15, 2024 by
mgoin
Loading…
[Model] Adding sliding window support for block table [#3665]
needs-rebase
stale
#3967
opened Apr 10, 2024 by
ruthe98
Loading…
[Bugfix] Add Prefix Caching Warmup Step
action-required
needs-rebase
stale
#3901
opened Apr 7, 2024 by
robertgshaw2-neuralmagic
Loading…
Dynamic Multi LoRA Load \ Delete Support
frontend
needs-rebase
stale
#3496
opened Mar 19, 2024 by
gauravkr2108
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-10-01.