Skip to content

Commit 051e636

Browse files
mudlergithub-actions[bot]
authored andcommitted
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
1 parent 495c4ee commit 051e636

File tree

1 file changed

+25
-0
lines changed

1 file changed

+25
-0
lines changed

gallery/index.yaml

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22938,3 +22938,28 @@
2293822938
- filename: ReForm-32B.i1-Q4_K_M.gguf
2293922939
sha256: a7f69d6e2efe002368bc896fc5682d34a1ac63669a4db0f42faf44a29012dc3f
2294022940
uri: huggingface://mradermacher/ReForm-32B-i1-GGUF/ReForm-32B.i1-Q4_K_M.gguf
22941+
- !!merge <<: *qwen3vl
22942+
name: "gelato-30b-a3b"
22943+
urls:
22944+
- https://huggingface.co/noctrex/Gelato-30B-A3B-GGUF
22945+
description: |
22946+
**Gelato-30B-A3B** is a state-of-the-art vision-language model specifically designed for GUI computer-use tasks, such as precise UI element localization and interaction. Built on the **Qwen3-VL-30B-A3B-Instruct** foundation, it was fine-tuned on the large-scale **Click-100k** dataset, which captures real-world user interactions with graphical interfaces.
22947+
22948+
Key Features:
22949+
- **High Accuracy**: Achieves **63.88%** on ScreenSpot-Pro and **73.40%** on OS-World-G, outperforming prior specialized models and larger VLMs.
22950+
- **Efficient Inference**: With only **3.3 GB activated memory**, it delivers strong performance on low-resource setups.
22951+
- **Open Source & Accessible**: Fully open-source under the Apache 2.0 license, enabling research and deployment in agentic systems.
22952+
- **Application Ready**: Designed for use in AI agents that interact with desktop GUIs—ideal for automation, assistive technologies, and human-computer interaction research.
22953+
22954+
Use Case: Perfect for building AI agents that understand and interact with software interfaces by interpreting visual cues and user instructions.
22955+
22956+
👉 **Model**: [mlfoundations-cua-dev/Gelato-30B-A3B](https://huggingface.co/mlfoundations-cua-dev/Gelato-30B-A3B)
22957+
👉 **Dataset**: [Click-100k](https://huggingface.co/datasets/mlfoundations/clicks-100k)
22958+
👉 **Training & Evaluation**: [GitHub Repo](https://github.com/mlfoundations/grounding-model-os-world)
22959+
overrides:
22960+
parameters:
22961+
model: Gelato-30B-A3B-Q4_K_M.gguf
22962+
files:
22963+
- filename: Gelato-30B-A3B-Q4_K_M.gguf
22964+
sha256: d44af330b10adaa291dced867ae90bbbdc8d502f97d994d7209828efe6343824
22965+
uri: huggingface://noctrex/Gelato-30B-A3B-GGUF/Gelato-30B-A3B-Q4_K_M.gguf

0 commit comments

Comments
 (0)