[V0 Deprecation] Remove V0 Spec Decode workers #21152

WoosukKwon · 2025-07-18T00:01:51Z

This PR removes the spec decoding code in vLLM V0, which is almost superseded by vLLM V1.

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

github-actions · 2025-07-18T00:01:59Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request is a large-scale cleanup that removes the V0 implementation for speculative decoding. The changes consist almost entirely of deleting the V0 spec decode workers and their associated tests. The only logical modification is in vllm/platforms/cuda.py, which now correctly raises a NotImplementedError if an attempt is made to use speculative decoding with the V0 engine. This is a clean and effective way to remove a deprecated feature. The changes appear correct and align with the PR's objective.

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon added 2 commits July 17, 2025 16:53

[V0 Deprecation] Remove V0 Spec Decode workers

0763950

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Remove tests

92a6742

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon requested review from njhill and LiuXiaoxuanPKU as code owners July 18, 2025 00:01

mergify bot added the speculative-decoding label Jul 18, 2025

gemini-code-assist bot reviewed Jul 18, 2025

View reviewed changes

Remove metrics

45474d3

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon requested review from zhuohan123, youkaichao, alexm-redhat and comaniac as code owners July 18, 2025 00:20

mergify bot added ci/build rocm Related to AMD ROCm labels Jul 18, 2025

WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 18, 2025

WoosukKwon added this to V0 Deprecation Jul 18, 2025

remove more

8f78f9b

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon requested review from simon-mo, robertgshaw2-redhat, mgoin, tlrmchlsmth, houseroad and hmellor as code owners July 18, 2025 02:26

WoosukKwon added 2 commits July 17, 2025 21:19

Merge branch 'main' into woosuk/remove-v0-specdec

2fc910b

fix test

63865ce

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

mergify bot added the new-model Requests to new models label Jul 18, 2025

WoosukKwon mentioned this pull request Jul 18, 2025

[WIP] Support relaxed acceptance for thinking tokens in speculative decoding #21157

Closed

4 tasks

Remove EAGLEModel

fe02f89

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon requested review from DarkLight1337 and ywang96 as code owners July 18, 2025 05:03

fix

6810e2f

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

hmellor moved this to In Progress in V0 Deprecation Jul 18, 2025

WoosukKwon added 3 commits July 18, 2025 05:51

fix

8f733d6

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

Merge branch 'main' into woosuk/remove-v0-specdec

93198b5

fix test

464b73f

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

mergify bot added the v1 label Jul 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[V0 Deprecation] Remove V0 Spec Decode workers #21152

[V0 Deprecation] Remove V0 Spec Decode workers #21152

WoosukKwon commented Jul 18, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

[V0 Deprecation] Remove V0 Spec Decode workers #21152

Are you sure you want to change the base?

[V0 Deprecation] Remove V0 Spec Decode workers #21152

Conversation

WoosukKwon commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

WoosukKwon commented Jul 18, 2025 •

edited

Loading