Skip to content

Commit

Permalink
Add Mini-Gemini all pretrined models
Browse files Browse the repository at this point in the history
  • Loading branch information
yanwei-li committed Apr 9, 2024
1 parent 7b331cc commit c220c57
Showing 1 changed file with 6 additions and 3 deletions.
9 changes: 6 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -79,11 +79,14 @@ We provide all our fully finetuned models on Stage 1 and 2 data for Mini-Gemini:
| Mini-Gemini-8x7B-HD | 672 | 1536 | Mixtral-8x7B-Instruct-v0.1 | CLIP-L | MiniGemini-Instruct | full_ft-1e | [ckpt](https://huggingface.co/YanweiLi/Mini-Gemini-8x7B-HD) |
| Mini-Gemini-34B-HD | 672 | 1536 | Nous-Hermes-2-Yi-34B | CLIP-L | MiniGemini-Instruct | full_ft-1e | [ckpt](https://huggingface.co/YanweiLi/Mini-Gemini-34B-HD) |

<!--
Here are the pretrained weights (text decoder + context attention + projector) on Stage 1 data only:
Here are the pretrained weights on Stage 1 data only:
| Model | LR | HR | Base LLM | Vision Encoder | Finetuning Data | Finetuning schedule | Download |
|----------|----------|----------|----------|----------------|---------------|--------------------|------------------|
| Mini-Gemini-2B | 336 | 768 | Gemma-2B | CLIP-L | MiniGemini-Pretrain | full_pt-1e | [ckpt]() | -->
| Mini-Gemini-2B | 336 | 768 | Gemma-2B | CLIP-L | MiniGemini-Pretrain | 1e | [ckpt](https://huggingface.co/YanweiLi/Mini-Gemini-Pretrain) |
| Mini-Gemini-7B | 336 | 768 | Vicuna-7B-v1.5 | CLIP-L | MiniGemini-Pretrain | 1e | [ckpt]() |
| Mini-Gemini-13B | 336 | 768 | Vicuna-13B-v1.5 | CLIP-L | MiniGemini-Pretrain | 1e | [ckpt]() |
| Mini-Gemini-8x7B | 336 | 768 | Mixtral-8x7B-Instruct-v0.1 | CLIP-L | MiniGemini-Pretrain | 1e | [ckpt]() |
| Mini-Gemini-34B | 336 | 768 | Nous-Hermes-2-Yi-34B | CLIP-L | MiniGemini-Pretrain | 1e | [ckpt]() |

## Preparation
### Dataset
Expand Down

0 comments on commit c220c57

Please sign in to comment.