-
-
Notifications
You must be signed in to change notification settings - Fork 4.9k
Issues: vllm-project/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Update amd-installation.md
ci/build
documentation
Improvements or additions to documentation
#11470
opened Dec 24, 2024 by
johnnynunez
Loading…
[Frontend] improve hermes_tool_parser.py
ci/build
frontend
#11453
opened Dec 24, 2024 by
paulcx
Loading…
[Platform] More consistent entrypoints across different platforms
ci/build
#11448
opened Dec 24, 2024 by
terrytangyuan
Loading…
[Model][LoRA]LoRA support added for MolmoForCausalLM
ci/build
documentation
Improvements or additions to documentation
frontend
needs-rebase
#11439
opened Dec 23, 2024 by
ayylemao
Loading…
fix: add missing bos_token to example templates
ci/build
#11432
opened Dec 23, 2024 by
toslunar
Loading…
Bump helm/kind-action from 1.10.0 to 1.11.0
ci/build
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#11424
opened Dec 23, 2024 by
dependabot
bot
Loading…
[V1] Optimize block table transfer from CPU to GPU
ci/build
#11401
opened Dec 22, 2024 by
WoosukKwon
•
Draft
[VLM] Support caching in merged multi-modal processor
ci/build
documentation
Improvements or additions to documentation
#11396
opened Dec 21, 2024 by
DarkLight1337
Loading…
Update Dockerfile.tpu pin to nightly torch_xla
ci/build
#11309
opened Dec 18, 2024 by
ManfeiBai
Loading…
[CI/Build] Adds Modal runners for performance benchmark
ci/build
#11239
opened Dec 16, 2024 by
erik-dunteman
Loading…
[Platform] Add platform pluggable framework
ci/build
documentation
Improvements or additions to documentation
#11222
opened Dec 16, 2024 by
wangxiyuan
Loading…
[Hardware][CPU] Multi-LoRA implementation for the CPU backend
ci/build
documentation
Improvements or additions to documentation
frontend
#11100
opened Dec 11, 2024 by
Akshat-Tripathi
Loading…
Avoid mistakenly picking Gaudi/HPU if XPU is requested.
ci/build
#11018
opened Dec 9, 2024 by
janimo
Loading…
[CI]add genai-perf benchmark in nightly benchmark
ci/build
nightly-benchmarks
perf-benchmarks
#10704
opened Nov 27, 2024 by
jikunshang
Loading…
[Core] Integrate Fastsafetensor loader for loading model weights
ci/build
documentation
Improvements or additions to documentation
#10647
opened Nov 26, 2024 by
manish-sethi
•
Draft
Previous Next
ProTip!
Adding no:label will show everything without a label.