ReProver fine-tuning #42

darabos · 2024-03-04T11:29:55Z

darabos
Mar 4, 2024

Hi,
We're experimenting with fine-tuning ReProver on a slightly modified Mathlib dataset. We just used the training code in this repo with ReProver instead of ByT5 as the base model. It works, but I'm wondering if this is the right way. It perhaps results in a high initial learning rate and causes the model to forget too much in the first epochs.

Do you think it would be better to continue from ReProver's training checkpoint? Or should we just use a lower learning rate?
Thanks for any tips!

yangky11 · 2024-05-23T23:59:39Z

yangky11
May 23, 2024
Maintainer

We tried further finetune the ReProver checkpoint, though the results were not promising. I believe it may have something to do with lean-dojo/LeanDojo#5.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReProver fine-tuning #42

{{title}}

Replies: 1 comment

{{title}}

Select a reply

ReProver fine-tuning #42

darabos Mar 4, 2024

Replies: 1 comment

yangky11 May 23, 2024 Maintainer

darabos
Mar 4, 2024

yangky11
May 23, 2024
Maintainer