[benchmarks]allow skip ready check for bench serve #25420

luccafong · 2025-09-22T20:27:30Z

Summary: Allow skip ready check for bench serve through --ready-check-timeout-sec 0

Test Plan: vllm bench serve --ready-check-timeout-sec 0

Differential Revision: D82995002

facebook-github-bot · 2025-09-22T20:27:48Z

@luccafong has exported this pull request. If you are a Meta employee, you can view the originating diff in D82995002.

gemini-code-assist

Code Review

This pull request introduces a --skip-ready-check flag for the bench serve command, allowing users to bypass the initial endpoint readiness check. The implementation is clear and correctly adds the new command-line argument and conditional logic. I've added one comment regarding user-facing log messages to improve clarity when the check is skipped. Additionally, while the change is functionally correct, it would benefit from corresponding test cases to verify the new flag's behavior and prevent future regressions.

vllm/benchmarks/serve.py

yeqcharlotte · 2025-09-22T21:20:05Z

we can achieve same purpose by setting --ready-check-timeout-sec=0?

luccafong · 2025-09-22T21:24:47Z

we can achieve same purpose by setting --ready-check-timeout-sec=0?

I think it will raise the error directly

Summary: Allow skip ready check for bench serve through `--skip-ready-check` Test Plan: `vllm bench serve --skip-ready-check` Differential Revision: D82995002 Signed-off-by: Lu Fang <fanglu@fb.com>

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Signed-off-by: Lu Fang <fanglu@fb.com>

Signed-off-by: Lu Fang <fanglu@fb.com>

ywang96

What's the use case for this? I actually found the wait_for_endpooint util pretty handy (now I don't need to fire vllm serve and vllm bench serve command sequentially)

luccafong · 2025-09-23T00:28:18Z

What's the use case for this? I actually found the wait_for_endpooint util pretty handy (now I don't need to fire vllm serve and vllm bench serve command sequentially)

when the workload is huge, and we don't want to wait for 1 request to completed.

ywang96 · 2025-09-23T01:35:41Z

when the workload is huge, and we don't want to wait for 1 request to completed.

Ah ok - then I think it's probably better to modify wait_for_endpooint so that it pings /health instead of sending one dummy request. (and maybe we make another flag --do-validate-benchmark-dataset) Do you think that makes more sense?

I also don't have a strong opinion on this current PR so going to approve it.

minosfuture · 2025-09-23T02:55:18Z

when the workload is huge, and we don't want to wait for 1 request to completed.

Ah ok - then I think it's probably better to modify wait_for_endpooint so that it pings /health instead of sending one dummy request. (and maybe we make another flag --do-validate-benchmark-dataset) Do you think that makes more sense?

I also don't have a strong opinion on this current PR so going to approve it.

agree that pinging /health would be a better solution so we don't completely skip the readiness check (but just skip using single request test).

ywang96 · 2025-09-23T03:36:29Z

when the workload is huge, and we don't want to wait for 1 request to completed.

Ah ok - then I think it's probably better to modify wait_for_endpooint so that it pings /health instead of sending one dummy request. (and maybe we make another flag --do-validate-benchmark-dataset) Do you think that makes more sense?
I also don't have a strong opinion on this current PR so going to approve it.

agree that pinging /health would be a better solution so we don't completely skip the readiness check (but just skip using single request test).

@minosfuture Feel free to make a follow-up PR!

Signed-off-by: Lu Fang <fanglu@fb.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by: Lucia (Lu) Fang <fanglu@meta.com>

Signed-off-by: Lu Fang <fanglu@fb.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by: Lucia (Lu) Fang <fanglu@meta.com> Signed-off-by: charlifu <charlifu@amd.com>

Signed-off-by: Lu Fang <fanglu@fb.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by: Lucia (Lu) Fang <fanglu@meta.com> Signed-off-by: yewentao256 <zhyanwentao@126.com>

Signed-off-by: Lu Fang <fanglu@fb.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by: Lucia (Lu) Fang <fanglu@meta.com> Signed-off-by: gaojc <1055866782@qq.com>

Signed-off-by: Lu Fang <fanglu@fb.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by: Lucia (Lu) Fang <fanglu@meta.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: Lu Fang <fanglu@fb.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by: Lucia (Lu) Fang <fanglu@meta.com>

Signed-off-by: Lu Fang <fanglu@fb.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by: Lucia (Lu) Fang <fanglu@meta.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

mergify bot added the performance Performance-related issues label Sep 22, 2025

luccafong changed the title ~~allow skip ready check for bench serve~~ [benchmarks]allow skip ready check for bench serve Sep 22, 2025

luccafong requested a review from yeqcharlotte September 22, 2025 20:28

gemini-code-assist bot reviewed Sep 22, 2025

View reviewed changes

vllm/benchmarks/serve.py Outdated Show resolved Hide resolved

luccafong force-pushed the export-D82995002 branch from 4ddee01 to 9f492c3 Compare September 22, 2025 20:30

luccafong requested a review from ywang96 September 22, 2025 21:18

luccafong force-pushed the export-D82995002 branch from 4a65536 to a81ce95 Compare September 22, 2025 23:05

Lucia (Lu) Fang and others added 2 commits September 22, 2025 16:05

allow skip ready check for bench serve

b96ac42

Summary: Allow skip ready check for bench serve through `--skip-ready-check` Test Plan: `vllm bench serve --skip-ready-check` Differential Revision: D82995002 Signed-off-by: Lu Fang <fanglu@fb.com>

Update vllm/benchmarks/serve.py

9517382

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Signed-off-by: Lu Fang <fanglu@fb.com>

luccafong force-pushed the export-D82995002 branch from a81ce95 to b5a1ed2 Compare September 22, 2025 23:05

switch to reuse ready_check_timeout_sec=0

f52445d

Signed-off-by: Lu Fang <fanglu@fb.com>

luccafong force-pushed the export-D82995002 branch from b5a1ed2 to f52445d Compare September 22, 2025 23:06

ywang96 reviewed Sep 22, 2025

View reviewed changes

ywang96 approved these changes Sep 23, 2025

View reviewed changes

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 23, 2025

Merge branch 'main' into export-D82995002

83c6dde

ywang96 enabled auto-merge (squash) September 23, 2025 01:38

ywang96 merged commit eea1783 into vllm-project:main Sep 23, 2025
42 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

[benchmarks]allow skip ready check for bench serve #25420

[benchmarks]allow skip ready check for bench serve #25420

Uh oh!

luccafong commented Sep 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

facebook-github-bot commented Sep 22, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

yeqcharlotte commented Sep 22, 2025

Uh oh!

luccafong commented Sep 22, 2025

Uh oh!

ywang96 left a comment •

edited

Loading

Uh oh!

luccafong commented Sep 23, 2025

Uh oh!

ywang96 commented Sep 23, 2025 •

edited

Loading

Uh oh!

minosfuture commented Sep 23, 2025

Uh oh!

Uh oh!

ywang96 commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Uh oh!

[benchmarks]allow skip ready check for bench serve #25420

[benchmarks]allow skip ready check for bench serve #25420

Uh oh!

Conversation

luccafong commented Sep 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Sep 22, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

yeqcharlotte commented Sep 22, 2025

Uh oh!

luccafong commented Sep 22, 2025

Uh oh!

ywang96 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

luccafong commented Sep 23, 2025

Uh oh!

ywang96 commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

minosfuture commented Sep 23, 2025

Uh oh!

Uh oh!

ywang96 commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

luccafong commented Sep 22, 2025 •

edited by github-actions bot

Loading

ywang96 left a comment •

edited

Loading

ywang96 commented Sep 23, 2025 •

edited

Loading