Skip to content

Commit dd2845a

Browse files
authored
chore(model gallery): add qwen3-embedding-4b (#5632)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
1 parent 2e7db01 commit dd2845a

File tree

1 file changed

+30
-0
lines changed

1 file changed

+30
-0
lines changed

gallery/index.yaml

Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1065,6 +1065,36 @@
10651065
- filename: OpenBuddy_OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview0-QAT-Q4_K_M.gguf
10661066
sha256: 4862bc5841f34bd7402a66b2149d6948465fef63e50499ab2d07c89f77aec651
10671067
uri: huggingface://bartowski/OpenBuddy_OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview0-QAT-GGUF/OpenBuddy_OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview0-QAT-Q4_K_M.gguf
1068+
- !!merge <<: *qwen3
1069+
name: "qwen3-embedding-4b"
1070+
tags:
1071+
- qwen3
1072+
- embedding
1073+
- gguf
1074+
- gpu
1075+
- cpu
1076+
urls:
1077+
- https://huggingface.co/Qwen/Qwen3-Embedding-4B-GGUF
1078+
description: |
1079+
The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B). This series inherits the exceptional multilingual capabilities, long-text understanding, and reasoning skills of its foundational model. The Qwen3 Embedding series represents significant advancements in multiple text embedding and ranking tasks, including text retrieval, code retrieval, text classification, text clustering, and bitext mining.
1080+
**Exceptional Versatility**: The embedding model has achieved state-of-the-art performance across a wide range of downstream application evaluations. The 8B size embedding model ranks **No.1** in the MTEB multilingual leaderboard (as of June 5, 2025, score **70.58**), while the reranking model excels in various text retrieval scenarios.
1081+
**Comprehensive Flexibility**: The Qwen3 Embedding series offers a full spectrum of sizes (from 0.6B to 8B) for both embedding and reranking models, catering to diverse use cases that prioritize efficiency and effectiveness. Developers can seamlessly combine these two modules. Additionally, the embedding model allows for flexible vector definitions across all dimensions, and both embedding and reranking models support user-defined instructions to enhance performance for specific tasks, languages, or scenarios.
1082+
**Multilingual Capability**: The Qwen3 Embedding series offer support for over 100 languages, thanks to the multilingual capabilites of Qwen3 models. This includes various programming languages, and provides robust multilingual, cross-lingual, and code retrieval capabilities.
1083+
**Qwen3-Embedding-4B-GGUF** has the following features:
1084+
- Model Type: Text Embedding
1085+
- Supported Languages: 100+ Languages
1086+
- Number of Paramaters: 4B
1087+
- Context Length: 32k
1088+
- Embedding Dimension: Up to 2560, supports user-defined output dimensions ranging from 32 to 2560
1089+
- Quantization: q4_K_M, q5_0, q5_K_M, q6_K, q8_0, f16
1090+
overrides:
1091+
embeddings: true
1092+
parameters:
1093+
model: Qwen3-Embedding-4B-Q4_K_M.gguf
1094+
files:
1095+
- filename: Qwen3-Embedding-4B-Q4_K_M.gguf
1096+
sha256: aaeddb737110a166dbc7155753bb60d8c3ba9a93e69938c18bf3fdd7f23f0381
1097+
uri: huggingface://Qwen/Qwen3-Embedding-4B-GGUF/Qwen3-Embedding-4B-Q4_K_M.gguf
10681098
- &gemma3
10691099
url: "github:mudler/LocalAI/gallery/gemma.yaml@master"
10701100
name: "gemma-3-27b-it"

0 commit comments

Comments
 (0)