- LLaVA-OneVision, Cambrian-1, Pixmo, Docmatix
- Coyo-700M, LAION-5B, Object365, SA-1B, DenseFusion-1M, ShareGPT4o, ShareGPT4V, TextCaps
- BLIP3-OCR, COCO-Text, TextOCR
- Pixel Parsing, UReader, SynthDoG, FUNSD, DUDE, Vary, pdfa-eng-wds, idl-wds
- Chart-to-Text, MMC-Instruction, MultiUI
- Osprey-724K, MDVP-Data, ADE-20K, Flickr-30K, GranD