-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
Issues: vllm-project/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Frontend] Rename and auto-detect Improvements or additions to documentation
frontend
--chat-template-text-format
documentation
#9919
opened Nov 1, 2024 by
DarkLight1337
•
Draft
[Frontend]: enable state callbacks for offline inference
ci/build
frontend
#9780
opened Oct 29, 2024 by
sethkimmel3
Loading…
[Model] add tool parser for openbmb/MiniCPM3-4B
documentation
Improvements or additions to documentation
frontend
#9762
opened Oct 28, 2024 by
Cppowboy
Loading…
[Model] Add support for H2OVL-Mississippi models
documentation
Improvements or additions to documentation
frontend
#9747
opened Oct 28, 2024 by
cooleel
Loading…
Adds method to read the pooling types from model's files
frontend
#9506
opened Oct 18, 2024 by
flaviabeo
Loading…
【Frontend】Add sampler_priority and repetition_penalty_range
frontend
#9485
opened Oct 18, 2024 by
ZeroYuJie
Loading…
[Feature] [Spec decode]: Combine chunked prefill with speculative decoding
ci/build
documentation
Improvements or additions to documentation
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#9291
opened Oct 11, 2024 by
NickLucche
Loading…
3 of 5 tasks
[Frontend] Tool calling parser for Granite 3.0 models
ci/build
documentation
Improvements or additions to documentation
frontend
#9027
opened Oct 2, 2024 by
maxdebayser
Loading…
[Frontend] Add security scheme to server
frontend
needs-rebase
stale
#7021
opened Aug 1, 2024 by
g-parki
Loading…
[Frontend] Entrypoint for hosting local Kobold Lite chat interface
frontend
needs-rebase
stale
#4096
opened Apr 15, 2024 by
mgoin
Loading…
Dynamic Multi LoRA Load \ Delete Support
frontend
needs-rebase
stale
#3496
opened Mar 19, 2024 by
gauravkr2108
Loading…
[Core] Support thread-based async tokenizer pools
frontend
needs-rebase
stale
#3449
opened Mar 16, 2024 by
njhill
Loading…
[Frontend] support new lora module to a live server in OpenAI Entrypoints
frontend
needs-rebase
stale
#3446
opened Mar 16, 2024 by
AlphaINF
Loading…
Implement structured engine for parsing json grammar by token with
response_format: {type: json_object}
frontend
needs-rebase
stale
#3328
opened Mar 12, 2024 by
pathorn
Loading…
feat: quadratic + cubic sampling
frontend
needs-rebase
stale
#3167
opened Mar 3, 2024 by
AlpinDale
Loading…
Fix: Echo without asking for new tokens or logprobs in OpenAI Completions API
frontend
needs-rebase
stale
#2995
opened Feb 22, 2024 by
matheper
Loading…
FIX: llm entry, for enable lora
frontend
needs-rebase
stale
#2889
opened Feb 16, 2024 by
suncl1990
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.