[V1][Spec Decode] Add random seed for EAGLE and its test script #16235

wwl2755 · 2025-04-08T06:35:33Z

This PR added and tested seed-based random generator in EAGLE for reproducibility, as mentioned in task 5 from #15901 .

Signed-off-by: wwl2755 <wangwenlong2755@gmail.com>

github-actions · 2025-04-08T06:35:41Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: wwl2755 <wangwenlong2755@gmail.com>

wwl2755 · 2025-04-10T18:05:32Z

This should be a minor fix that won't affect the codebase much. Any comments/reviews are appreciated!

cc: @LiuXiaoxuanPKU @WoosukKwon

ekagra-ranjan · 2025-04-10T18:36:14Z

vllm/v1/spec_decode/eagle.py

+    batch_size = probs.size(0)
+    for i in range(batch_size):
+        generator = sampling_metadata.generators.get(i, None)
+        q[i].exponential_(generator=generator)


Is it possible to use q.exponential_(generator=generator) to avoid for loop and leverage vectorization?
the generator will anyway depend on the batch id of the request so it cannot be reproducible in all situation but having a single generator for the entire batch means that order of the seq in batch doesnt matter

I just checked the torch.Tensor.exponential_(), and it seems can only take one generator instead of a vectorized one.

If it is only for reproduction issues, I think using one representative generator should be fine. But I'm concerned about any use case that different sequences of the batch may require different seed generators.

ekagra-ranjan · 2025-04-10T18:39:26Z

vllm/v1/spec_decode/eagle.py

+    for i in range(batch_size):
+        generator = sampling_metadata.generators.get(i, None)
+        q[i].exponential_(generator=generator)
+
    next_token_ids = probs.div_(q).argmax(dim=-1).view(-1)


Do you know why probs need to be divided by q which is randomly init using an exponential distribution?

I'm not quite sure about the rigious math proof behind, but I believe it is a simplified version of sampling from probs with randomness instead of always picking the max probability.

mergify · 2025-04-23T15:56:41Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @wwl2755.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

add random seed for EAGLE and its test script

5cad8e9

Signed-off-by: wwl2755 <wangwenlong2755@gmail.com>

wwl2755 requested review from WoosukKwon, robertgshaw2-redhat, njhill, ywang96, comaniac and alexm-redhat as code owners April 8, 2025 06:35

mergify bot added the v1 label Apr 8, 2025

add probs in unit test

275f296

Signed-off-by: wwl2755 <wangwenlong2755@gmail.com>

ekagra-ranjan reviewed Apr 10, 2025

View reviewed changes

mergify bot added the needs-rebase label Apr 23, 2025

markmc added the speculative-decoding label May 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[V1][Spec Decode] Add random seed for EAGLE and its test script #16235

[V1][Spec Decode] Add random seed for EAGLE and its test script #16235

Uh oh!

wwl2755 commented Apr 8, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 8, 2025

Uh oh!

wwl2755 commented Apr 10, 2025

Uh oh!

ekagra-ranjan Apr 10, 2025 •

edited

Loading

Uh oh!

wwl2755 Apr 10, 2025

Uh oh!

ekagra-ranjan Apr 10, 2025

Uh oh!

wwl2755 Apr 10, 2025

Uh oh!

mergify bot commented Apr 23, 2025

Uh oh!

Uh oh!

Uh oh!

[V1][Spec Decode] Add random seed for EAGLE and its test script #16235

Are you sure you want to change the base?

[V1][Spec Decode] Add random seed for EAGLE and its test script #16235

Uh oh!

Conversation

wwl2755 commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 8, 2025

Uh oh!

wwl2755 commented Apr 10, 2025

Uh oh!

ekagra-ranjan Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wwl2755 Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

ekagra-ranjan Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

wwl2755 Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Apr 23, 2025

Uh oh!

Uh oh!

wwl2755 commented Apr 8, 2025 •

edited

Loading

ekagra-ranjan Apr 10, 2025 •

edited

Loading