[Misc] Add penalties sampling parameters to serve tool #25974

southfreebird · 2025-09-30T18:27:15Z

Purpose

Adding the frequency_penalty, presence_penalty, and repetition_penalty sampling parameters to the serve tool. It allows enabling them for performance measurement.

Test Plan

Example for frequency_penalty:

vllm bench serve --model $MODEL_NAME --dataset-name sharegpt --num-prompts 200 --dataset-path $DATASET_PATH --seed 0 --frequency-penalty 1.0

Test Result

gemini-code-assist

Code Review

This pull request adds support for frequency_penalty, presence_penalty, and repetition_penalty sampling parameters to the vllm bench serve tool. The implementation is correct in passing these parameters to the backend. However, there is a lack of client-side validation for the values of these new parameters. I've added a comment to suggest adding validation to improve user experience and prevent benchmark failures due to invalid inputs.

gemini-code-assist · 2025-09-30T18:28:40Z

vllm/benchmarks/serve.py

+    sampling_group.add_argument(
+        "--frequency-penalty",
+        type=float,
+        default=None,
+        help="Frequency penalty sampling parameter. Only has effect on "
+        "openai-compatible backends.",
+    )
+    sampling_group.add_argument(
+        "--presence-penalty",
+        type=float,
+        default=None,
+        help="Presence penalty sampling parameter. Only has effect on "
+        "openai-compatible backends.",
+    )
+    sampling_group.add_argument(
+        "--repetition-penalty",
+        type=float,
+        default=None,
+        help="Repetition penalty sampling parameter. Only has effect on "
+        "openai-compatible backends.",
+    )


The newly added penalty parameters (frequency_penalty, presence_penalty, repetition_penalty) are parsed as floats without any validation. The OpenAI API, and vLLM's implementation of it, has specific valid ranges for these parameters:

frequency_penalty: between -2.0 and 2.0.

presence_penalty: between -2.0 and 2.0.

repetition_penalty: must be a positive float.

Passing values outside these ranges will cause requests to fail at the server level, which could be confusing for users running benchmarks. It would be better to add client-side validation for these parameters to provide immediate and clear feedback on invalid inputs. This could be done using a custom type function with argparse.

Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com>

…-to-serve

…25974) Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com> Co-authored-by: Sergei Skvortsov <sergeyskv@nebius.com> Signed-off-by: Tomer Asida <57313761+tomeras91@users.noreply.github.com>

…25974) Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com> Co-authored-by: Sergei Skvortsov <sergeyskv@nebius.com> Signed-off-by: Karan Goel <3261985+karan@users.noreply.github.com>

…25974) Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com> Co-authored-by: Sergei Skvortsov <sergeyskv@nebius.com>

…25974) Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com> Co-authored-by: Sergei Skvortsov <sergeyskv@nebius.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

…25974) Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com> Co-authored-by: Sergei Skvortsov <sergeyskv@nebius.com>

…25974) Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com> Co-authored-by: Sergei Skvortsov <sergeyskv@nebius.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

mergify bot added the performance Performance-related issues label Sep 30, 2025

gemini-code-assist bot reviewed Sep 30, 2025

View reviewed changes

simon-mo approved these changes Sep 30, 2025

View reviewed changes

simon-mo enabled auto-merge (squash) September 30, 2025 22:36

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 30, 2025

auto-merge was automatically disabled October 1, 2025 09:32
Head branch was pushed to by a user without write access

southfreebird force-pushed the feature/add-frequency-penalties-to-serve branch 2 times, most recently from a9082b2 to fe78868 Compare October 1, 2025 14:48

Add frequency_penalty, presence_penalty and repetition_penalty to serve

7199ec1

Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com>

southfreebird force-pushed the feature/add-frequency-penalties-to-serve branch from fe78868 to 7199ec1 Compare October 1, 2025 19:03

Merge branch 'vllm-project:main' into feature/add-frequency-penalties…

f1e54db

…-to-serve

simon-mo merged commit b71fcd4 into vllm-project:main Oct 3, 2025
46 checks passed

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Misc] Add penalties sampling parameters to serve tool (vllm-project#…

2fe8327

…25974) Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com> Co-authored-by: Sergei Skvortsov <sergeyskv@nebius.com>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[Misc] Add penalties sampling parameters to serve tool (vllm-project#…

b90dcad

…25974) Signed-off-by: Sergei Skvortsov <sergeyskv@nebius.com> Co-authored-by: Sergei Skvortsov <sergeyskv@nebius.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Misc] Add penalties sampling parameters to serve tool #25974

[Misc] Add penalties sampling parameters to serve tool #25974

Uh oh!

southfreebird commented Sep 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Misc] Add penalties sampling parameters to serve tool #25974

[Misc] Add penalties sampling parameters to serve tool #25974

Uh oh!

Conversation

southfreebird commented Sep 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

southfreebird commented Sep 30, 2025 •

edited by github-actions bot

Loading