How to train VLM model like Qwen2.5 VL? #19

Open

opened

on Aug 31, 2025

I would like to learn how to train a VLM such as Qwen2.5-VL, including how to prepare multimodal data (text + image).

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests