[Bugfix] Fix SLA tuner initialization #27355

DarkLight1337 · 2025-10-22T16:18:11Z

Purpose

Sorry I didn't run the SLA script after the latest changes in #27168. This should be fixed now.

cc @lengrongfu

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

gemini-code-assist

Code Review

This pull request fixes an initialization bug in the SweepServeSLAArgs.from_cli_args method. The previous implementation incorrectly used super(), which led to a TypeError because the base class factory method attempted to instantiate the subclass with incomplete arguments. The fix, which calls the base class factory SweepServeArgs.from_cli_args directly, is correct and resolves the issue. The code change is sound and I have no further suggestions.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Alberto Perdomo <aperdomo@redhat.com>

…o step_forward * 'step_forward' of https://github.com/raindaywhu/vllm: (148 commits) [Model] Add MoE support for NemotronH (vllm-project#25863) [Metrics] [KVConnector] Add connector prefix cache hit rate stats (vllm-project#26245) [CI] Reorganize entrypoints tests (vllm-project#27403) add SLA information into comparison graph for vLLM Benchmark Suite (vllm-project#25525) [CI/Build] Fix AMD CI: test_cpu_gpu.py (vllm-project#27388) [Bugfix] Fix args settings for guided decoding args (vllm-project#27375) [CI/Build] Fix Prithvi plugin test (vllm-project#27393) [Chore] Remove duplicate `has_` functions in vllm.utils (vllm-project#27372) [Model] Add num_cached_tokens for PoolingRequestOutput (vllm-project#27378) [V1][spec decode] return logprobs for spec decoding (vllm-project#26060) [CORE] Support Prefix Caching with Prompt Embeds (vllm-project#27219) [Bugfix][Core] running queue index leakage exception (vllm-project#26754) [Bugfix] Fix incorrect kv cache metrics in grafana.json (vllm-project#27133) [Bugfix] Fix SLA tuner initialization (vllm-project#27355) [Bugfix] Fix deepseek-ocr multi-image inference and add `merge_by_field_config=True` with tensor schema support (vllm-project#27361) [MLA] Bump FlashMLA (vllm-project#27354) [Chore] Separate out system utilities from vllm.utils (vllm-project#27201) [BugFix] bugfix for Flash Attention MLA with full cuda graph IMA following pr-25490 (vllm-project#27128) [Feature] publisher default set zmq in kv_event config (vllm-project#26915) [Prefix Cache] Use LoRA name for consistent KV-cache block hashing (vllm-project#27211) ...

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

[Bugfix] Fix SLA tuner initialization

5dbf12c

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested review from ProExpertProg and noooop October 22, 2025 16:18

mergify bot added the performance Performance-related issues label Oct 22, 2025

gemini-code-assist bot reviewed Oct 22, 2025

View reviewed changes

DarkLight1337 modified the milestones: v0.11.1, v0.11.0 Oct 22, 2025

Fix missing metric

658f4c3

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

ProExpertProg approved these changes Oct 22, 2025

View reviewed changes

vllm-bot merged commit 6738e4a into vllm-project:main Oct 23, 2025
5 checks passed

DarkLight1337 deleted the fix-sla branch October 23, 2025 03:43

usberkeley pushed a commit to usberkeley/vllm that referenced this pull request Oct 23, 2025

[Bugfix] Fix SLA tuner initialization (vllm-project#27355)

664bff9

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

albertoperdomo2 pushed a commit to albertoperdomo2/vllm that referenced this pull request Oct 23, 2025

[Bugfix] Fix SLA tuner initialization (vllm-project#27355)

6b2ac65

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Alberto Perdomo <aperdomo@redhat.com>

kingsmad pushed a commit to kingsmad/vllm that referenced this pull request Oct 25, 2025

[Bugfix] Fix SLA tuner initialization (vllm-project#27355)

16d1ce9

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[Bugfix] Fix SLA tuner initialization (vllm-project#27355)

a861472

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[Bugfix] Fix SLA tuner initialization (vllm-project#27355)

8084413

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix SLA tuner initialization #27355

[Bugfix] Fix SLA tuner initialization #27355

Uh oh!

DarkLight1337 commented Oct 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Bugfix] Fix SLA tuner initialization #27355

[Bugfix] Fix SLA tuner initialization #27355

Uh oh!

Conversation

DarkLight1337 commented Oct 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DarkLight1337 commented Oct 22, 2025 •

edited by github-actions bot

Loading