Upgrade Transformers and support microsoft/phi-4 and meta-llama/Llama-3.* #187

Mingxue-Xu · 2025-03-05T16:11:29Z

Hi, I updated the transformers to 4.48.0 and added the following supported models. Unfortunately, due to the GPU resources, I could only finetune meta-llama/Llama-3.2-1B and facebook/opt-125m (and it ran through successfully). The rest of the models, including the previously supported models like microsoft/phi-2 and microsoft/Phi-3-mini-4k-instruct, were successfully run through run_slicegpt.py and run_lm_eval.py.

Newly supported models:

microsoft/phi-4
meta-llama/Llama-3.2-1B (-Instruct)
meta-llama/Llama-3.2-3B (-Instruct)
meta-llama/Llama-3.1-8B (-Instruct)

Can anyone please test if all the above models can be run through experiments/run_finetuning.py? Thanks!

Mingxue-Xu · 2025-03-06T08:34:07Z

@microsoft-github-policy-service agree

Mingxue-Xu added 4 commits March 5, 2025 15:32

Update transformers to 4.48.0.

210f9b7

fix trainer params

59043c2

fix trainer params

8b6f4ab

fix trainer params of run_finetuning.py

06b3431

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Upgrade Transformers and support microsoft/phi-4 and meta-llama/Llama-3.* #187

Upgrade Transformers and support microsoft/phi-4 and meta-llama/Llama-3.* #187

Uh oh!

Mingxue-Xu commented Mar 5, 2025

Uh oh!

Mingxue-Xu commented Mar 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Upgrade Transformers and support microsoft/phi-4 and meta-llama/Llama-3.* #187

Are you sure you want to change the base?

Upgrade Transformers and support microsoft/phi-4 and meta-llama/Llama-3.* #187

Uh oh!

Conversation

Mingxue-Xu commented Mar 5, 2025

Uh oh!

Mingxue-Xu commented Mar 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant