Skip to content

mxbai-rerank-large-v2 #552

@qdrddr

Description

@qdrddr

Model description

Currently mixedbread-ai/mxbai-rerank-large-v2 does not support rerank. Reason: the loaded model cannot fullfill rerank. Options are 'embed'.

infinity_emb v2 --port 7997 \
  --model-id mixedbread-ai/mxbai-rerank-large-v2 --batch-size 8 --revision "refs/heads/main"
curl --location 'http://localhost:7997/v1/rerank' \
--header 'Accept: application/json' \
--header 'Content-Type: application/json' \
--data '{
    "model": "mixedbread-ai/mxbai-rerank-large-v2",
    "query": "What is the capital of the United States?",
    "documents": [
        "Carson City is the capital city of the American state of Nevada.",
        "The Commonwealth of the Northern Mariana Islands is a group of islands in the Pacific Ocean. Its capital is Saipan.",
        "Washington, D.C. is the capital of the United States.",
        "Capital punishment has existed in the United States since before it was a country."
    ],
    "top_n": 3
  }'
{"error":{"message":"ModelNotDeployedError: model=`mixedbread-ai/mxbai-rerank-large-v2` does not support `rerank`. Reason: the loaded moded cannot fullyfill `rerank`. Options are {'embed'}.","type":null,"param":null,"code":400}}

Open source status & huggingface transformers.

  • The model implementation is available on transformers
  • The model weights are available on huggingface-hub
  • I verified that the model is currently not running in the latest version pip install infinity_emb[all] --upgrade
  • I made the authors of the model aware that I want to use it with infinity_emb & check if they are aware of the issue.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions