Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Misc] Remove SqueezeLLM ready ONLY add when PR is ready to merge/full CI is needed
#8220 opened Sep 6, 2024 by dsikka Loading…
[BugFix] Fix Granite model configuration ready ONLY add when PR is ready to merge/full CI is needed
#8216 opened Sep 5, 2024 by njhill Loading…
[Misc] Upgrade vllm-flash-attn to v2.6.2 ready ONLY add when PR is ready to merge/full CI is needed
#8211 opened Sep 5, 2024 by WoosukKwon Loading…
Fix shutdown problem
#8209 opened Sep 5, 2024 by Bye-legumes Loading…
[Model] Adding Granite MoE.
#8206 opened Sep 5, 2024 by shawntan Loading…
[CI/Build] Increasing timeout for multiproc worker tests ready ONLY add when PR is ready to merge/full CI is needed
#8203 opened Sep 5, 2024 by alexeykondrat Loading…
Reshape cache to be XQA kernel compatible
#8200 opened Sep 5, 2024 by wenscarl Loading…
[Frontend] Add --logprobs argument to benchmark_serving.py ready ONLY add when PR is ready to merge/full CI is needed
#8191 opened Sep 5, 2024 by afeldman-nm Loading…
[Core/Bugfix] pass VLLM_ATTENTION_BACKEND to ray workers ready ONLY add when PR is ready to merge/full CI is needed
#8172 opened Sep 4, 2024 by SolitaryThinker Loading…
[Model] Allow loading from original Mistral format ready ONLY add when PR is ready to merge/full CI is needed
#8168 opened Sep 4, 2024 by patrickvonplaten Loading…
[Misc] remove peft as dependency for prompt models ready ONLY add when PR is ready to merge/full CI is needed
#8162 opened Sep 4, 2024 by prashantgupta24 Loading…
[Misc] add iteration_tokens metric ready ONLY add when PR is ready to merge/full CI is needed
#8140 opened Sep 4, 2024 by LucasWilkinson Loading…
[CI/Build] Use python 3.12 in cuda image ready ONLY add when PR is ready to merge/full CI is needed
#8133 opened Sep 3, 2024 by joerunde Loading…
[CI/Build] Enabling kernels tests for AMD, ignoring some of then that fail ready ONLY add when PR is ready to merge/full CI is needed rocm
#8130 opened Sep 3, 2024 by alexeykondrat Loading…
ProTip! Adding no:label will show everything without a label.