-
Notifications
You must be signed in to change notification settings - Fork 178
Closed
Description
Model description
Currently mixedbread-ai/mxbai-rerank-large-v2 does not support rerank. Reason: the loaded model cannot fullfill rerank. Options are 'embed'.
infinity_emb v2 --port 7997 \
--model-id mixedbread-ai/mxbai-rerank-large-v2 --batch-size 8 --revision "refs/heads/main"curl --location 'http://localhost:7997/v1/rerank' \
--header 'Accept: application/json' \
--header 'Content-Type: application/json' \
--data '{
"model": "mixedbread-ai/mxbai-rerank-large-v2",
"query": "What is the capital of the United States?",
"documents": [
"Carson City is the capital city of the American state of Nevada.",
"The Commonwealth of the Northern Mariana Islands is a group of islands in the Pacific Ocean. Its capital is Saipan.",
"Washington, D.C. is the capital of the United States.",
"Capital punishment has existed in the United States since before it was a country."
],
"top_n": 3
}'{"error":{"message":"ModelNotDeployedError: model=`mixedbread-ai/mxbai-rerank-large-v2` does not support `rerank`. Reason: the loaded moded cannot fullyfill `rerank`. Options are {'embed'}.","type":null,"param":null,"code":400}}Open source status & huggingface transformers.
- The model implementation is available on transformers
- The model weights are available on huggingface-hub
- I verified that the model is currently not running in the latest version
pip install infinity_emb[all] --upgrade - I made the authors of the model aware that I want to use it with infinity_emb & check if they are aware of the issue.
Metadata
Metadata
Assignees
Labels
No labels