-
Notifications
You must be signed in to change notification settings - Fork 46
[Dataset]: Iterate through benchmark dataset once #48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@philschmid Hey there. Could you install the version of the branch |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, minor NIT on removing typing overrides and we should add in a test case for the main pathway for this.
Two additional points on testing:
- For automation, we should add in a quick test to ensure for the sweep case that we looped the dataset correctly and on start of a new benchmark it is the first item from the dataset -- can be merged in with the previous main tests
- For manual, please validate locally that calling with a hugging face dataset as well as a simple local csv file (can download the csv from the files for the linked dataset) will work properly
- Running a constant rate will execute the exact number of requests as in the dataset along with the first and last prompts sent matching the first and last in the dataset
- Running a sweep with a smaller (10 prompt) csv file will properly cycle and each benchmark run starts with the same beginning prompt and ends with the end prompt
9c077ab
to
51ceaa8
Compare
* Main CLI parameters are updated * `MaxRequestsType(ParamType)` is used as a custom `click` param type * Request generators are updated * Unit testes are added * Smoke tests are added
a02561f
to
656804f
Compare
Hey Guys, any idea on when we could get this merged? |
issue link: #44
Summary:
--max-requests
CLI parameter now supports thedataset
value.MaxRequestsType(ParamType)
is used as a customclick
param type to validate input dataRequestGenerator(ABC)
now includes the abstract__len__
that corresponds to the length of the dataset if it is supported.guidellm/utils/text.py
is fixed. There is no reason to check if the file start with the'http'
string. Basically this is a Bug since you never read a text file properly because of oldif/else
condition.