You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+2-4Lines changed: 2 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -92,10 +92,8 @@ Notes:
92
92
93
93
- The `--target` flag specifies the server hosting the model. In this case, it is a local vLLM server.
94
94
- The `--model` flag specifies the model to evaluate. The model name should match the name of the model deployed on the server
95
-
- The `--rate-type` flag specifies what load generation pattern GuideLLM will use when sending requests to the server.
96
-
If `sweep` is specified GuideLLM will run multiple performance evaluations across different request rates.
97
-
- By default GuideLLM will run over a fixed workload of 1000 requests configurable by `--max-requests`.
98
-
If `--max-seconds` is set GuideLLM will instead run over a fixed time.
95
+
- The `--rate-type` flag specifies what load generation pattern GuideLLM will use when sending requests to the server. If `sweep` is specified GuideLLM will run multiple performance evaluations across different request rates.
96
+
- By default GuideLLM will run over a fixed workload of 1000 requests configurable by `--max-requests`. If `--max-seconds` is set GuideLLM will instead run over a fixed time.
0 commit comments