LLaVA 系列模型结构详解 - Zhang #242
Replies: 2 comments
-
|
强 |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
为啥看不到评论? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
LLaVA 系列模型结构详解 - Zhang
从事 LLM 推理部署、视觉算法开发、模型压缩部署以及算法SDK开发工作,终身学习践行者。Transformer多模态大模型 MLLM 架构通常都是 LLM + 视觉编码器 + 映射层的组合。本文详细总结了 LLaVA 系列多模态模型的模型结构,以及视觉编码器如何支持高分辨率输入图像。
https://www.armcvai.cn/2024-11-28/llava-structure.html
Beta Was this translation helpful? Give feedback.
All reactions