Famous Vision Language Models and Their Architectures
awesome
awesome-list
kosmos
clip
image-encoder
vlm
blip
multimodal
text-encoder
vision-language-model
llava
internlm
cogvlm
qwen-vl
-
Updated
Sep 8, 2024 - Markdown