[WIP] Full fine-tune and LoRA fine-tune simplifications #2780

Andrei-Aksionov · 2025-06-02T19:37:36Z

Hey there 👋

Note

This is an implentation for the idea proposed in #2779

This is an implementation of my comment back from March.

The objective is to enhance the maintainability and, in certain aspects, the readability of recipes.

If we consider the differences between a full fine-tuning and LoRA variant, the primary distinctions should lie in:

The way a model is instantiated
The way a checkpoint is saved

This draft/PR proposes precisely this:

Utilizing the full fine-tuning recipe as a foundation
Overriding only the LoRA-specific methods

This approach offers several advantages:

More maintainable code: eliminating the need for identical modifications across multiple locations.
Up-to-date code: sometimes changes made to the full version may not have been incorporated into the LoRA variant.
More readable code: facilitating a clearer understanding of the unique aspects of LoRA fine-tuning (in terms of the recipe) for anyone interested in exploring it.

However, there are a few drawbacks to consider:

Increased readability challenges: while the single-file approach allows users to read the whole recipe as a single file, this approach requires opening two files (ideally side-by-side) to read the recipe.
Reduced hackability: if someone intends to use the LoRA recipe as a starting point and subsequently build upon it, the first two files must be merged. This process is straightforward, as methods need to be replaced as a whole, requiring minimal mental effort and taking a few minutes at most.

As with any solution, it has its own set of drawbacks. Despite these disadvantages, this approach may still be worthwhile.

pytorch-bot · 2025-06-02T19:37:39Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/2780

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

${GIT_USER_NAME} added 7 commits June 1, 2025 18:39

Drop all identical functions

4d30909

Reuse __init__ method

8ad4987

Reuse _setup_optimizer and _set_lr_scheduler

d2dbab0

Reuse train method

4e61ab2

Reuse setup method

d09bbaa

Comment out error when importing recipes

fad57db

Update configs for LoRA used in tests

27bf0dc

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 2, 2025

Andrei-Aksionov mentioned this pull request Jun 2, 2025

Proposal: reuse methods in recipes #2779

Open

Andrei-Aksionov marked this pull request as draft June 2, 2025 19:40

Andrei-Aksionov changed the title ~~Full lora recipy simplification~~ [WIP] Full fine-tune and LoRA fine-tune simplifications Jun 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Full fine-tune and LoRA fine-tune simplifications #2780

[WIP] Full fine-tune and LoRA fine-tune simplifications #2780

Uh oh!

Andrei-Aksionov commented Jun 2, 2025

Uh oh!

pytorch-bot bot commented Jun 2, 2025

Uh oh!

Uh oh!

[WIP] Full fine-tune and LoRA fine-tune simplifications #2780

Are you sure you want to change the base?

[WIP] Full fine-tune and LoRA fine-tune simplifications #2780

Uh oh!

Conversation

Andrei-Aksionov commented Jun 2, 2025

Uh oh!

pytorch-bot bot commented Jun 2, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/2780

Uh oh!

Uh oh!