Fine-tuning bug fix #51

eu9ene · 2022-01-14T22:23:27Z

It causes fewer problems when teacher training on a parallel dataset happens in a separate directory. Model weights are initialized using --pretrained-model Marian parameter.
fixes Teacher does not continue training if training on augmented data was early stopped #49
Fixes a new bug with student fine-tuning, there was no weights initialization (it was lost during refactoring).
Fixed usage of a pretrained backward model + vocab
fixes Wrong tcol when cleaning with Bicleaner #56

XapaJIaMnu

As far as I understand this wouldn't affect the behaviour when training crashed and was resumed, correct? That would still continue with loading the optimiser parameters. (TBH I haven't tested if this case even works).

XapaJIaMnu · 2022-01-17T03:02:05Z

Snakefile

@@ -91,14 +92,16 @@ align_dir = f"{data_dir}/alignment"

 # models
 models_dir = f"{data_root_dir}/models/{src}-{trg}/{experiment}"
-teacher_dir = f"{models_dir}/teacher"
+teacher_all_dir = f"{models_dir}/teacher-all"
+teacher_parallel_dir = f"{models_dir}/teacher-parallel"


From reading the source, I don't understand what teacher_parallel_dir should contain. What is a parallel teacher model?

Teacher all - the model is trained on all available data.
Teacher parallel - optional model to fine-tune on parallel data only if the data was augmented with back translations.

Will it be easier to understand if I rename them to teacher and teacher-finetuned?

Yes that would be easier to understand, but this is a very minor point.

eu9ene · 2022-01-18T19:12:32Z

As far as I understand this wouldn't affect the behaviour when training crashed and was resumed, correct? That would still continue with loading the optimiser parameters. (TBH I haven't tested if this case even works).

This will work because I removed protection from the output file model.npz.best-chrf.npz. It means snakemake will delete it when the job is stopped or crushed, so it will rerun training next time to get this file. Since it will happen in the same directory and model.npz and optimizer progress files are there, it will continue training.

However, this is an irregular situation and not desirable. The pipeline is designed to work end to end without interruptions.

eu9ene added 3 commits January 14, 2022 12:25

Continue teacher training in a separate directory

b0e2e38

Fix usage of pretrained backward model

5fbed78

Make it possible to use pretrained vocab

d822ccf

XapaJIaMnu approved these changes Jan 17, 2022

View reviewed changes

eu9ene added 5 commits January 21, 2022 11:51

Fix bicleaner

30d6271

Fix bicleaner

ca2378b

Rename teacher models

55e94c2

Rename teacher models

aea8645

Reduce bicleaner threshold for open subtitles

9abc3c5

eu9ene merged commit a4ada6c into mozilla:main Jan 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-tuning bug fix #51

Fine-tuning bug fix #51

eu9ene commented Jan 14, 2022 •

edited

Loading

XapaJIaMnu left a comment

XapaJIaMnu Jan 17, 2022

eu9ene Jan 18, 2022

XapaJIaMnu Jan 18, 2022

eu9ene commented Jan 18, 2022

Fine-tuning bug fix #51

Fine-tuning bug fix #51

Conversation

eu9ene commented Jan 14, 2022 • edited Loading

XapaJIaMnu left a comment

Choose a reason for hiding this comment

XapaJIaMnu Jan 17, 2022

Choose a reason for hiding this comment

eu9ene Jan 18, 2022

Choose a reason for hiding this comment

XapaJIaMnu Jan 18, 2022

Choose a reason for hiding this comment

eu9ene commented Jan 18, 2022

eu9ene commented Jan 14, 2022 •

edited

Loading