Fine-tuning Llama3 8b for Math

This project involves fine-tuning Llama3 8b to generate JSON formats for arithmetic questions and further post-process the output to perform calculations. This method incorporates the latest fine-tuning techniques such as Qlora, Unsloth, and PEFT. It enables faster training speeds and requires fewer computational resources.

PS: You need a T4 (16GB) GPU to run the code.

Colab Live code: https://drive.google.com/file/d/1NsSS1_M3pNAbkiBnPB3k5JKIkEQg3XNX/view?usp=sharing

Setup

Download all the files in this repo.
Run Load_model.py to load library and Llama3 8b.
Run Prepare_data.py to load function_call.jsonl dataset and prepare dataset.
Run Fine_tuning.py
Run Inference_n_save.py to test the fine-tuned models and save the model.

Preview

Before Fine-tuning:

Test1:

Test2:

Test3:

After Fine-tuning:

This is part of my research study in The University of Chicago. The data is come from: https://github.com/rohanbalkondekar/finetune_llama2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-tuning Llama3 8b for Math

Setup

Preview

Before Fine-tuning:

After Fine-tuning:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Fine_tuning.py		Fine_tuning.py
Inference_n_save.py		Inference_n_save.py
Load_model.py		Load_model.py
Prepare_data.py		Prepare_data.py
README.md		README.md
function_call.jsonl		function_call.jsonl

yuki-2025/llama3-8b-fine-tuning-math

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning Llama3 8b for Math

Setup

Preview

Before Fine-tuning:

After Fine-tuning:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages