Skip to content

Commit 0bba09a

Browse files
docs: fix the README link to the perf.sh file (#1501)
1 parent 8585c30 commit 0bba09a

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

examples/llm/benchmarks/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -266,7 +266,7 @@ For more information see [Collecting Performance Numbers](#collecting-performanc
266266

267267
## Collecting Performance Numbers
268268

269-
Currently, there is no consistent way of obtaining the configuration of deployment service. Hence, we need to provide this information to the script in form of command line arguments. The benchmarking script `/workspace/examples/llm/benchmarks/perf.sh` uses GenAI-Perf tool to collect the performance numbers at various different request concurrencies. The perf.sh script can be run multiple times to collect numbers for various different deployments. Each script execution will create a new artifacts directory in `artifacts_root` and dump these numbers in it. See [Plotting Pareto Graphs](#plotting-pareto-graphs) to learn how to convert the data from this `artifacts_root` to generate pareto graphs for the performance.
269+
Currently, there is no consistent way of obtaining the configuration of deployment service. Hence, we need to provide this information to the script in form of command line arguments. The benchmarking script `/workspace/benchmarks/llm/perf.sh` uses GenAI-Perf tool to collect the performance numbers at various different request concurrencies. The perf.sh script can be run multiple times to collect numbers for various different deployments. Each script execution will create a new artifacts directory in `artifacts_root` and dump these numbers in it. See [Plotting Pareto Graphs](#plotting-pareto-graphs) to learn how to convert the data from this `artifacts_root` to generate pareto graphs for the performance.
270270

271271
Note: As each `perf.sh` adds a new artifacts directory in the `artifacts_root` always, proper care should be taken that we are starting experiment with clean `artifacts_root` so we include only results from runs that we want to compare.
272272

examples/tensorrt_llm/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -286,7 +286,7 @@ See [close deployment](../../docs/guides/dynamo_serve.md#close-deployment) secti
286286
### Benchmarking
287287

288288
To benchmark your deployment with GenAI-Perf, see this utility script, configuring the
289-
`model` name and `host` based on your deployment: [perf.sh](../llm/benchmarks/perf.sh)
289+
`model` name and `host` based on your deployment: [perf.sh](../../benchmarks/llm/perf.sh)
290290

291291
### Future Work
292292

0 commit comments

Comments
 (0)