Closed
Description
🚀 The feature, motivation and pitch
Core architecture is Qwen3 (0.6B, 4B, 8B).
The reranker relies on a specific logic to rank (softmax between logits of "yes" and "no" tokens):
https://huggingface.co/Qwen/Qwen3-Reranker-0.6B
The embedding model is more standard:
https://huggingface.co/Qwen/Qwen3-Embedding-0.6B
Alternatives
No response
Additional context
No response
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.