Skip to content

Conversation

sjmonson
Copy link
Collaborator

@sjmonson sjmonson commented Mar 5, 2025

The throughput profile limits its max requests based on the number of successful requests in the previous synchronous profile run. This seems to be left over from previous behavior.

Fixes #88

@sjmonson sjmonson requested a review from markurtz March 5, 2025 20:44
@markurtz markurtz merged commit 55c65c4 into vllm-project:main Mar 6, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Throughput run in sweep mode runs for too short of time
2 participants