[RL] reinforce learning benchmark framework #10619

SdeeRK · 2025-05-19T13:11:30Z

PR types

[ New features]

PR changes

[Benchmark Tools]

Description

This PR introduces a comprehensive benchmark testing framework with three core components:

torch_infer.py - PyTorch model inference performance testing
paddle_infer.py - PaddlePaddle model inference benchmarking
api_serve.py - API service performance testing

The framework provides standardized metrics collection including:

Inference throughput (tokens/second)
Generated token length analysis
Token generation time statistics

Execution Scripts
Corresponding launch scripts are available in the scripts/ directory:

api_serve.sh - API service testing
paddle_infer.sh - PaddlePaddle benchmark
torch_infer.sh - PyTorch benchmark

merge upstream develop

merge

…velop merge reinforce leanring benchmark framework

paddle-bot · 2025-05-19T13:11:35Z

Thanks for your contribution!

gongel · 2025-05-19T13:18:19Z

添加下使用文档；2. 函数注释统一换成英文

codecov · 2025-05-19T13:47:44Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 46.91%. Comparing base (5c482b6) to head (b507eeb).
Report is 8 commits behind head on develop.

❌ Your project check has failed because the head coverage (46.91%) is below the target coverage (58.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop   #10619      +/-   ##
===========================================
- Coverage    46.91%   46.91%   -0.01%     
===========================================
  Files          799      799              
  Lines       132452   132460       +8     
===========================================
+ Hits         62137    62140       +3     
- Misses       70315    70320       +5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

llm/benchmark/rl/torch_infer.py

JunnYu · 2025-05-21T10:18:24Z

llm/benchmark/rl/torch_infer.py

+
+
+@contextmanager
+def switch_level_context(level="ERROR"):


torch不需要这个，删除了

JunnYu · 2025-05-21T10:19:38Z

llm/benchmark/rl/api_serve.py

+import pandas as pd
+from openai import AsyncOpenAI
+from tqdm import tqdm
+from transformers import logging


用paddlenlp的logger吧，看一下paddle infer的代码

SdeeRK and others added 13 commits May 12, 2025 15:48

增加rl目录下的init.py文件

7151f1f

增加copy rightb并格式化

9b595b1

Merge remote-tracking branch 'upstream/develop' into develop

b324777

merge upstream develop

Merge remote-tracking branch 'paddle/develop'

4443562

merge

add reinforcement learning benchmark framework

161ab74

Merge branch 'PaddlePaddle:develop' into develop

28957a5

Merge branch 'develop' of https://github.com/SdeeRK/PaddleNLP

76f29eb

merge

update copyright

d5c8465

update

6db62c0

Merge branch 'PaddlePaddle:develop' into develop

c52e604

add reinforce learning benchmark framework

98f9dbd

Merge branch 'develop' of https://github.com/SdeeRK/PaddleNLP into de…

2530919

…velop merge reinforce leanring benchmark framework

Merge branch 'PaddlePaddle:develop' into develop

45758bf

paddle-bot bot added the contributor label May 19, 2025

paddle-bot bot assigned gongel May 19, 2025

SdeeRK added 2 commits May 20, 2025 09:52

add elapsed time metrics in api serve framework

ad585eb

fix some bug

32e9136

JunnYu reviewed May 20, 2025

View reviewed changes

llm/benchmark/rl/torch_infer.py Outdated Show resolved Hide resolved

JunnYu reviewed May 20, 2025

View reviewed changes

llm/benchmark/rl/torch_infer.py Outdated Show resolved Hide resolved

SdeeRK added 3 commits May 20, 2025 19:09

add reinforce learning framework with scripts file

7cde16d

remove vllm quant type

7e75765

reformat rl benchmark code and scripts

2c139d4

JunnYu reviewed May 21, 2025

View reviewed changes

llm/benchmark/rl/torch_infer.py Outdated

@contextmanager

def switch_level_context(level="ERROR"):

Copy link

Member

JunnYu May 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

torch不需要这个，删除了

JunnYu reviewed May 21, 2025

View reviewed changes

JunnYu changed the title ~~reinforce learning benchmark framework~~ [RL] reinforce learning benchmark framework May 21, 2025

add readme file

b507eeb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RL] reinforce learning benchmark framework #10619

[RL] reinforce learning benchmark framework #10619

SdeeRK commented May 19, 2025 •

edited

Loading

paddle-bot bot commented May 19, 2025

gongel commented May 19, 2025

codecov bot commented May 19, 2025 •

edited

Loading

JunnYu May 21, 2025

JunnYu May 21, 2025

[RL] reinforce learning benchmark framework #10619

Are you sure you want to change the base?

[RL] reinforce learning benchmark framework #10619

Conversation

SdeeRK commented May 19, 2025 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented May 19, 2025

gongel commented May 19, 2025

codecov bot commented May 19, 2025 • edited Loading

Codecov Report

JunnYu May 21, 2025

Choose a reason for hiding this comment

JunnYu May 21, 2025

Choose a reason for hiding this comment

SdeeRK commented May 19, 2025 •

edited

Loading

codecov bot commented May 19, 2025 •

edited

Loading