[Fix] Fix bugs in scheduler #1727

linotfan · 2023-11-20T13:54:55Z

Fixes llm.generate hangs for long prompt #1673 : If the number of GPU blocks a sequence group required exceeds the limit, it will never be scheduled and stay in the waiting queue forever.
Fixes: 'seq_lens' could be empty, and 'max(seq_lens)' will raise an error.

zhuohan123

LGTM! The first bug is fixed in #1534 and I'll merge the second bug fix. Thank you for your contribution!

linotfan added 3 commits November 17, 2023 20:01

Add a word: #FIXME(woosuk): Do 'not' use internal method

91e2c9b

fix bugs

53f8bea

format

d482df6

linotfan closed this Nov 20, 2023

linotfan reopened this Nov 20, 2023

linotfan added 2 commits November 20, 2023 22:10

format

ed33323

remove unnecessary parens

cfae741

simon-mo requested review from zhuohan123 and WoosukKwon November 20, 2023 19:57

zhuohan123 added 2 commits November 21, 2023 00:07

Merge branch 'main' into linotfan/main

a1b6e92

merge with main

8c7fd87

zhuohan123 approved these changes Nov 21, 2023

View reviewed changes

zhuohan123 merged commit 19849db into vllm-project:main Nov 21, 2023

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

[Fix] Fix bugs in scheduler (vllm-project#1727)

d5d2a79

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Fix bugs in scheduler #1727

[Fix] Fix bugs in scheduler #1727

linotfan commented Nov 20, 2023

zhuohan123 left a comment

[Fix] Fix bugs in scheduler #1727

[Fix] Fix bugs in scheduler #1727

Conversation

linotfan commented Nov 20, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment