Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[KTO] learning rate recomentations for kto #2070

Merged
merged 5 commits into from
Sep 16, 2024
Merged

Conversation

kashif
Copy link
Collaborator

@kashif kashif commented Sep 15, 2024

What does this PR do?

recommendation for the KTO learning rate to be small. See: https://x.com/ethayarajh/status/1834780782126153763

@kashif kashif requested a review from lewtun September 15, 2024 10:31
@qgallouedec qgallouedec added the 🏋 KTO Related to KTO label Sep 15, 2024
@qgallouedec
Copy link
Member

What do you think about overriding the default lr of TrainingArgument in this case? It seems to be more convenient for the user.

cc @kawine @corbt

@dataclass
class KTOConfig(TrainingArguments):
    ...
    learning_rate: float = 5e-7

docs/source/kto_trainer.mdx Outdated Show resolved Hide resolved
@kawine
Copy link
Contributor

kawine commented Sep 16, 2024

thanks for taking the initiative @qgallouedec ! I just had one suggestion re: the docs.

@kashif
Copy link
Collaborator Author

kashif commented Sep 16, 2024

@qgallouedec done

@qgallouedec qgallouedec changed the title [KT0] learning rate recomentations for kto [KTO] learning rate recomentations for kto Sep 16, 2024
@qgallouedec
Copy link
Member

Thanks, for the record, we should extend this to all our configs, when necessary

@qgallouedec qgallouedec merged commit cdafc93 into main Sep 16, 2024
10 checks passed
@qgallouedec qgallouedec deleted the kto-learning-rate branch September 16, 2024 10:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🏋 KTO Related to KTO
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants