-
Notifications
You must be signed in to change notification settings - Fork 3k
[RL] reinforce learning benchmark framework #10619
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
merge upstream develop
…velop merge reinforce leanring benchmark framework
Thanks for your contribution! |
|
Codecov ReportAll modified and coverable lines are covered by tests ✅
❌ Your project check has failed because the head coverage (46.91%) is below the target coverage (58.00%). You can increase the head coverage or adjust the target coverage. Additional details and impacted files@@ Coverage Diff @@
## develop #10619 +/- ##
===========================================
- Coverage 46.91% 46.91% -0.01%
===========================================
Files 799 799
Lines 132452 132460 +8
===========================================
+ Hits 62137 62140 +3
- Misses 70315 70320 +5 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
llm/benchmark/rl/torch_infer.py
Outdated
|
||
|
||
@contextmanager | ||
def switch_level_context(level="ERROR"): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
torch不需要这个,删除了
llm/benchmark/rl/api_serve.py
Outdated
import pandas as pd | ||
from openai import AsyncOpenAI | ||
from tqdm import tqdm | ||
from transformers import logging |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
用paddlenlp的logger吧,看一下paddle infer的代码
PR types
[ New features]
PR changes
[Benchmark Tools]
Description
This PR introduces a comprehensive benchmark testing framework with three core components:
The framework provides standardized metrics collection including:
Execution Scripts
Corresponding launch scripts are available in the scripts/ directory: