FineTuning Google's FLAN-T5 model

The T5 (Text-to-Text Transfer Transformer) model is a powerful Transformer-based language model developed by Google.
It stands out by framing all NLP tasks as text-to-text problems, meaning both input and output are text strings. This unified approach simplifies the process of applying the model to various tasks, including machine translation, question answering, and summarization.

FLAN-T5 is just better at everything. For the same number of parameters, these models have been fine-tuned on more than 1000 additional tasks covering also more languages.

Here the goal is to fine-tune this FLAN-T5 model on a specific area topics like physics, chemistry, and biology.
The SciQ dataset contains 13,679 crowdsourced science exam questions about Physics, Chemistry and Biology, among others. The questions are in multiple-choice format with 4 answer options each. For the majority of the questions, an additional paragraph with supporting evidence for the correct answer is provided.

FineTune Approach

Get the tokenizer function and model from transformers module.
Adding 'think' and 'answer' tokens to the tokenizer.
Training the entire model with 77M params can easily result to Out-Of-Memory (OOM) error. So, we introduce Low-Rank Adaptation (LoRA) to train only a fraction of parameters.

lora_config = LoraConfig(
    task_type="SEQ_2_SEQ_LM",
    r=2,
    target_modules=["q", "v"])

LoRA_model = get_peft_model(model, lora_config)
print(LoRA_model.print_trainable_parameters())

Output: trainable params: 86,016 || all params: 77,022,592 || trainable%: 0.1117

Note: The target module can be changed to 'linear' or 'all' resulting changes in the no. of trainable params.

Restructure the data, tokenize it and conform it to Dataset type.
Supervised training first and then improving the performance using Reinforcement learning methods.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
T5Model.png		T5Model.png
check_if_mps.py		check_if_mps.py
data.py		data.py
model.py		model.py
ops.py		ops.py
requirements.txt		requirements.txt
rewards.py		rewards.py
tokenization.py		tokenization.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FineTuning Google's FLAN-T5 model

FineTune Approach

About

Uh oh!

Releases

Packages

Languages

AmishKakka/FineTuning

Folders and files

Latest commit

History

Repository files navigation

FineTuning Google's FLAN-T5 model

FineTune Approach

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages