[misc] Add Benchmark Automation Script by zhuofan1123 · Pull Request #73 · taco-project/FlexKV

zhuofan1123 · 2025-12-02T08:27:25Z

Add FlexKV Benchmark Automation Script

This PR add an automated benchmark script (run_benchmark.sh) for running vLLM server with FlexKV and executing multi-turn conversation benchmarks. See scripts/README_zh.md.

Key Features

End-to-end automation: Dataset preparation, server launch, benchmark execution, and cleanup
Flexible configuration: Supports customizable vLLM, FlexKV, and benchmark parameters
Optional profiling: Integrated Nsight Systems profiling support
Comprehensive logging: Server logs, benchmark results, and profiling reports with timestamps

Usage

bash scripts/run_benchmark.sh --vllm-path <path> --model-path <path> [options]

Output

Generated log files in the log directory:

vllm_server_YYYYMMDD_HHMMSS.log
benchmark_YYYYMMDD_HHMMSS.log
vllm_profile_YYYYMMDD_HHMMSS.nsys-rep (if profiling enabled)

linhu-nv

LGTM. Thanks!

peaceforeverCN

LGTM

* add KVCacheEngineClient APIs * basic implementation for KVCacheEngineClient * initial transfer manager * init transfer handle * init kv engine * refactor kvmanager * update kvmanager * some refactor * kv response * add benchmark * serialize graph * fix bugs * ready check * update * rename * rename benchmark * use numpy instead of tensor * small fix * remove transfer descriptor * rename to kvmanager * update api * add gpu-kvcache-verifier, draft * update * create a new tp-worker process and create gpu blocks for verification * rename * the test_kvmanager works now * fix virtual op initialize * fix verifier bug when tp > 1 and mla enabled * fix * remove task id && some fix * only create one h2d op * pass slotmapping for launch * quick fix --------- Co-authored-by: linhu-nv <linhu@nvidia.com> Co-authored-by: Fei Liang <hanyueh@nvidia.com>

zhuofan1123 added 10 commits December 2, 2025 00:07

add scripts for e2e benchmark

d5f49f7

add log level filtering

874822e

process unexpected exit

6815071

print summary

480117e

optimize log output && format code

bbf695b

add README for scripts

2cbb124

fix readme

16a8e37

fix bugs in benchmark scripts

ff174d5

modify default concurrency

43dfaad

support nsys profiling

6ab2a89

zhuofan1123 requested review from linhu-nv and peaceforeverCN December 2, 2025 08:27

linhu-nv approved these changes Dec 2, 2025

View reviewed changes

remove config

407437f

peaceforeverCN approved these changes Dec 4, 2025

View reviewed changes

zhuofan1123 merged commit 17180d1 into main Dec 4, 2025
0 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[misc] Add Benchmark Automation Script#73

[misc] Add Benchmark Automation Script#73
zhuofan1123 merged 11 commits intomainfrom
feat/scripts

zhuofan1123 commented Dec 2, 2025 •

edited

Loading

Uh oh!

linhu-nv left a comment

Uh oh!

peaceforeverCN left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

zhuofan1123 commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add FlexKV Benchmark Automation Script

Key Features

Usage

Output

Uh oh!

linhu-nv left a comment

Choose a reason for hiding this comment

Uh oh!

peaceforeverCN left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhuofan1123 commented Dec 2, 2025 •

edited

Loading