Skip to content
This repository has been archived by the owner on Oct 31, 2023. It is now read-only.

Script finetuning on MSMarco #9

Open
gangiswag opened this issue Jul 15, 2022 · 1 comment
Open

Script finetuning on MSMarco #9

gangiswag opened this issue Jul 15, 2022 · 1 comment

Comments

@gangiswag
Copy link

Thanks a lot for releasing the code and the scripts for pre-training.

I'm trying to reproduce the numbers on MS-Marco after fine-tuning and it would be great if you could also release the scripts for fine-tuning.

Specifically, I had questions about training the model after mining the hard negatives.

Is it initialized to the pre-trained contriever model or the contriever model fine-tuned with random negatives?

@gangiswag gangiswag changed the title Fine-tuning script on MS-Marco Fine-tuning script on MSMarco Jul 15, 2022
@gangiswag gangiswag changed the title Fine-tuning script on MSMarco Script finetuning on MSMarco Jul 15, 2022
@gizacard
Copy link
Contributor

Hi! I've uploaded the script I used for finetuning here https://github.com/facebookresearch/contriever/blob/main/finetuning.py. There is no support for the ASAM optimizer that I used to finetune the English model. I'll try to add an example script when I'll have more time.
For hard negative mining, I first train contriever on supervised data, then mine hard negatives, then retrain the model with these hard negatives. Also for hard negative mining I did it in a pretty manual way, which makes it hard to write a single script pipelining all the different actions.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants