Support pre-training 8*V100 (32G) gpus with xformers #411

guanlaoda · 2023-09-01T08:43:07Z

FlashAttention does not support V100 GPU, and llava cannot train on V100, but most of the cards now are still V100. Now we introduce the FastChat solution to llava, and xFormers replaces FlashAttention. Like FastChat（https://github.com/lm-sys/FastChat）, you can now use LLaVa to train on V100.

guanlaoda · 2023-09-01T08:56:51Z

llava-7B-v1.1 model

haotian-liu · 2023-09-01T14:00:44Z

Thank you for your contribution! It looks nice to me, and thank you for providing the training logs as well. One minor: can you remove xformers from pyproject.toml and add a short instruction about how to install xformers? Similar to the one in FastChat:

If you are using V100 which is not supported by FlashAttention, you can use the memory-efficient attention implemented in xFormers. Install xformers and replace fastchat/train/train_mem.py above with fastchat/train/train_xformers.py.

guanlaoda · 2023-09-01T17:12:06Z

OK, I'll modify it as soon as possible and submit it.

guanlaoda · 2023-09-05T02:40:01Z

Completed the modification and submitted

guanlaoda · 2023-09-05T06:18:53Z

guanlaoda · 2023-09-06T06:14:17Z

no response，Is there anything else that needs to be modified?

tingxueronghua · 2023-09-20T03:37:31Z

no response，Is there anything else that needs to be modified?

Thanks for your effort and I want to inquire about the GPU assumption. After applying xformers like FastChat, could LLaVA 13B be fine-tuned on 8 V100?

guanlaoda · 2023-09-20T11:07:51Z

I have tested it and GPU OOM will occur. I'll test it tomorrow and see if I can adjust the parameters.

guanlaoda · 2023-09-21T11:31:17Z

Test results on 8*V100 (32G) gpus with xformers: pre-training works on 7b, but fine-tuned does not work on 7b, causing OOM.
Do I need to try to resolve this issue?

tingxueronghua · 2023-09-21T11:53:33Z

Test results on 8*V100 (32G) gpus with xformers: pre-training works on 7b, but fine-tuned does not work on 7b, causing OOM. Do I need to try to resolve this issue?

Thanks! I think I will just use Lora. Thanks for your effort and patience!

haotian-liu · 2023-10-31T20:21:57Z

Sorry for the late response, and thank you for the modification. I will review and merge this week.

hojisu · 2023-11-02T07:47:23Z

is there any way to pretrain on 4*v100 (32gb)?

aileenliao03 · 2023-11-03T14:36:37Z

Test results on 8*V100 (32G) gpus with xformers: pre-training works on 7b, but fine-tuned does not work on 7b, causing OOM. Do I need to try to resolve this issue?

Hi, is there a way to fine tune 7b on 8*V100s now by any chance? Thanks!

Support pre-training 8*V100 (32G) gpus with xformers

pritamqu · 2024-02-01T02:48:54Z

could you please confirm which version of xformers did you use?

zfr00 · 2024-06-14T13:22:10Z

So there's no way to do full-parameter fine-tuning with v100, right?

Pre-training supports 8*V100 (32G) gpus

945096f

haotian-liu changed the title ~~Pre-training supports 8*V100 (32G) gpus~~ Support pre-training 8*V100 (32G) gpus with xformers Sep 1, 2023

Supplementary xformers instructions

a6afb8c

guanlaoda closed this Sep 22, 2023

This was referenced Oct 7, 2023

[Question] don't use FlashAttn #435

Closed

[Memory err run finetune_full_schedule.sh on V100] #428

Closed

OOM on V100 GPU #290

Closed

haotian-liu reopened this Oct 31, 2023

guanlaoda closed this Nov 1, 2023

guanlaoda force-pushed the main branch from a6afb8c to 785f766 Compare November 1, 2023 03:03

Support pre-training 8*V100 (32G) gpus with xformers

373e646

guanlaoda reopened this Nov 1, 2023

haotian-liu merged commit 5da9716 into haotian-liu:main Nov 4, 2023

choics2623 pushed a commit to choics2623/LLaVA that referenced this pull request Dec 16, 2023

Merge pull request haotian-liu#411 from guanlaoda/main

c709f1b

Support pre-training 8*V100 (32G) gpus with xformers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support pre-training 8*V100 (32G) gpus with xformers #411

Support pre-training 8*V100 (32G) gpus with xformers #411

guanlaoda commented Sep 1, 2023

guanlaoda commented Sep 1, 2023

haotian-liu commented Sep 1, 2023

guanlaoda commented Sep 1, 2023

guanlaoda commented Sep 5, 2023

guanlaoda commented Sep 5, 2023

guanlaoda commented Sep 6, 2023

tingxueronghua commented Sep 20, 2023

guanlaoda commented Sep 20, 2023

guanlaoda commented Sep 21, 2023

tingxueronghua commented Sep 21, 2023

haotian-liu commented Oct 31, 2023

hojisu commented Nov 2, 2023 •

edited

Loading

aileenliao03 commented Nov 3, 2023

pritamqu commented Feb 1, 2024

zfr00 commented Jun 14, 2024

Support pre-training 8*V100 (32G) gpus with xformers #411

Support pre-training 8*V100 (32G) gpus with xformers #411

Conversation

guanlaoda commented Sep 1, 2023

guanlaoda commented Sep 1, 2023

haotian-liu commented Sep 1, 2023

guanlaoda commented Sep 1, 2023

guanlaoda commented Sep 5, 2023

guanlaoda commented Sep 5, 2023

guanlaoda commented Sep 6, 2023

tingxueronghua commented Sep 20, 2023

guanlaoda commented Sep 20, 2023

guanlaoda commented Sep 21, 2023

tingxueronghua commented Sep 21, 2023

haotian-liu commented Oct 31, 2023

hojisu commented Nov 2, 2023 • edited Loading

aileenliao03 commented Nov 3, 2023

pritamqu commented Feb 1, 2024

zfr00 commented Jun 14, 2024

hojisu commented Nov 2, 2023 •

edited

Loading