The minimum GPU resources needed to fine-tune the 32B model?

Hi!
Could you let me know the minimum GPU resources needed to fine-tune the 32B model?
I trained the 32B model using 8 L40 GPUs, but the process was interrupted during model loading.
It seems like the model has not been loaded into the GPUs.
But it was okay when I trained the 3B model
I want to know whether I can train the 32B model with 8 L40 GPUs (8*40G)
Thanks!





<img width="1324" alt="Image" src="https://github.com/user-attachments/assets/99fbd033-392a-4c01-86bc-334624846d63" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The minimum GPU resources needed to fine-tune the 32B model? #103

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The minimum GPU resources needed to fine-tune the 32B model? #103

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions