-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: need to pass skip_prepare_dataset for pretokenized dataset due to breaking change in HF SFTTrainer #326
fix: need to pass skip_prepare_dataset for pretokenized dataset due to breaking change in HF SFTTrainer #326
Conversation
…o breaking change in HF SFTTrainer Signed-off-by: Harikrishnan Balagopal <harikrishmenon@gmail.com>
2d64294
to
d543f79
Compare
…ed dataset tests Signed-off-by: Harikrishnan Balagopal <harikrishmenon@gmail.com>
d543f79
to
b6fc949
Compare
This seems to be the line in the SFTTrainer code that caused the error on our side |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! I'll try to get another set of eyes on this before merging though.
The issue is that trl created a release 5 days ago that our PRs are now picking up. v0.10.1 includes this line change https://github.com/huggingface/trl/blob/d57e4b726561e5ae58fdc335f34029052944a4a3/trl/trainer/sft_trainer.py#L348 where if packing=False (set by default in our code) then it expects that data preparation must be completed. For pretokenized datasets, data preparation is not needed, thus we need to set Angel had the idea that we could only add |
…o breaking change in HF SFTTrainer (foundation-model-stack#326) * fix: need to pass skip_prepare_dataset for pretokenized dataset due to breaking change in HF SFTTrainer Signed-off-by: Harikrishnan Balagopal <harikrishmenon@gmail.com> * fix: wrong dataset paths, was using non-tokenized data in pre-tokenized dataset tests Signed-off-by: Harikrishnan Balagopal <harikrishmenon@gmail.com> --------- Signed-off-by: Harikrishnan Balagopal <harikrishmenon@gmail.com> Signed-off-by: Angel Luu <angel.luu@us.ibm.com>
Description of the change
Fix test failure
Related issue number
Fixes #324 (comment)
How to verify the PR
make test
Was the PR tested