Adds VLM Training support to SFTTrainer + VSFT script #1518

edbeeching · 2024-04-10T07:18:03Z

Modifies the SFTTrainer so that it can be used with a VLM SFT dataset
Adds an example script vsft.py to train the llava1.5 model with an instruct dataset

TODO:

Test PEFT support
Run full training
Add example to docs
add tests

Example usage:

python examples/scripts/vsft.py \
    --model_name_or_path="llava-hf/llava-1.5-7b-hf" \
    --report_to="wandb" \
    --learning_rate=1.4e-5 \
    --per_device_train_batch_size=8 \
    --gradient_accumulation_steps=1 \
    --output_dir="data/vsft-llava-1.5-7b-hf" \
    --logging_steps=5 \
    --num_train_epochs=1 \
    --push_to_hub \
    --gradient_checkpointing \
    --remove_unused_columns=False \
    --torch_dtype=float16 \
    --fp16=True \ 
    --dataset_name=HuggingFaceH4/llava-instruct-mix-vsft \

HuggingFaceDocBuilderDev · 2024-04-10T07:22:11Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

lewtun

Very clean implementation @edbeeching ! Just a few nits and a few unit/integration tests needed and this should be good to merge

examples/scripts/vsft.py

lewtun · 2024-04-10T14:55:41Z

trl/trainer/sft_trainer.py

    ):
        if dataset is None:
            raise ValueError("The dataset should not be None")

        # check if torch dataset / dataloader and do nothing
-        if isinstance(dataset, (torch.utils.data.IterableDataset, torch.utils.data.Dataset, ConstantLengthDataset)):
+        if skip_prepare_dataset or isinstance(


Let's add an integration test for this case under the existing SFTTrainer tests, along with an example training a tiny random llava model

examples/scripts/vsft.py

pcuenca

Looking great!

examples/scripts/vsft.py

lewtun · 2024-04-11T11:40:24Z

examples/scripts/vsft_llava.py

+    --use_peft \
+    --lora_r=64 \
+    --lora_alpha=16
+"""


Suggested change

"""

# to evaluate, first install the lmms-eval framework: pip install git+https://github.com/EvolvingLMMs-Lab/lmms-eval.git

# then run:

accelerate launch --num_processes=8 -m lmms_eval \

--model llava_hf \

--model_args pretrained=llava-hf/llava-1.5-7b-hf \

--tasks mmbench \

--batch_size 1 \

--output_path ./logs/ \

--log_sample

"""

younesbelkada

Thanks ! We should be good to go after fixing the merge conflict on main ! 🚀

* adds option to skip dataset preparation in SFTTrainer * before changing the template * adds support for new schema * a few fixes to data collator to support new schema * updates args * precommit * adds sys prompt to chat template and other fixes * updates template, fixes collator for multiple images * precommit * rename vsft to vstf_llava * adding integration tests * adds integration test for vsft * precommit * adds back chat template * docs * typo * adds eval, precommit * adds peft launch args * formatting * fixes no deps tests by checking if PIL lib exists * Update __init__.py --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

adds option to skip dataset preparation in SFTTrainer

43b78e2

edbeeching added 5 commits April 10, 2024 11:53

before changing the template

d98b66d

adds support for new schema

142727a

a few fixes to data collator to support new schema

c683d89

updates args

4c0457d

precommit

2bec041

edbeeching requested a review from lewtun April 10, 2024 14:35

edbeeching marked this pull request as ready for review April 10, 2024 14:37

lewtun reviewed Apr 10, 2024

View reviewed changes