-
Notifications
You must be signed in to change notification settings - Fork 27.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[examples] fixes arguments for summarization finetune scripts #5157
[examples] fixes arguments for summarization finetune scripts #5157
Conversation
Codecov Report
@@ Coverage Diff @@
## master #5157 +/- ##
=======================================
Coverage 77.93% 77.94%
=======================================
Files 137 137
Lines 23475 23475
=======================================
+ Hits 18295 18297 +2
+ Misses 5180 5178 -2
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! Would love to not hardcode data_dir and output_dir, otherwise LGTM!
examples/summarization/finetune.sh
Outdated
@@ -11,6 +11,8 @@ export PYTHONPATH="../":"${PYTHONPATH}" | |||
|
|||
python finetune.py \ | |||
--model_name_or_path=facebook/bart-large \ | |||
--data_dir=./cnn-dailymail/cnn_dm \ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we leave this in the README
and add a comment in this file that the proper usage is documented in the README. I would rather not hardcode paths to data in this script, since people are using many different datasets stored in different places.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, please check my 2nd commit, i also dropped cnn
prefix from default output dir in finetune.sh. Maybe it makes sense not to hardcode it as well? Will be glad to hear your feedback.
590e32b
to
7f0c46d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we don't need to mention OUTPUT_DIR in finetune.sh
. Otherwise, LGTM!
This PR fixes arguments in bash scripts for finetuning bart/bart_tiny/t5 models in summarization examples.
After changes that were merged in #4951 i noticed some small problems. That includes:
finetune.py
(--data_dir, output_dir)n_gpu
instead ofgpus
)model_type
that is not present in list of arguments and crashes the script