Script finetuning on MSMarco #9

gangiswag · 2022-07-15T23:25:12Z

Thanks a lot for releasing the code and the scripts for pre-training.

I'm trying to reproduce the numbers on MS-Marco after fine-tuning and it would be great if you could also release the scripts for fine-tuning.

Specifically, I had questions about training the model after mining the hard negatives.

Is it initialized to the pre-trained contriever model or the contriever model fine-tuned with random negatives?

gizacard · 2022-07-18T16:02:20Z

Hi! I've uploaded the script I used for finetuning here https://github.com/facebookresearch/contriever/blob/main/finetuning.py. There is no support for the ASAM optimizer that I used to finetune the English model. I'll try to add an example script when I'll have more time.
For hard negative mining, I first train contriever on supervised data, then mine hard negatives, then retrain the model with these hard negatives. Also for hard negative mining I did it in a pretty manual way, which makes it hard to write a single script pipelining all the different actions.

gangiswag changed the title ~~Fine-tuning script on MS-Marco~~ Fine-tuning script on MSMarco Jul 15, 2022

gangiswag changed the title ~~Fine-tuning script on MSMarco~~ Script finetuning on MSMarco Jul 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Script finetuning on MSMarco #9

Script finetuning on MSMarco #9

gangiswag commented Jul 15, 2022

gizacard commented Jul 18, 2022

Script finetuning on MSMarco #9

Script finetuning on MSMarco #9

Comments

gangiswag commented Jul 15, 2022

gizacard commented Jul 18, 2022