Closed
Description
🚀 The feature, motivation and pitch
The latest Qwen3-Rerank-8B model is not supported by accessing the online serving API.
When I run vllm serve Qwen/Qwen3-Reranker-8B --task score --port 7001, and request the score API service similar to https://github.com/vllm-project/vllm/blob/main/examples/online_serving/openai_cross_encoder_score.py, the error occurred is {'object': 'error', 'message': 'The model does not support Score API', 'type': 'BadRequestError', 'param': None, 'code': 400}.
Alternatives
Use HF implementation instead for now.
Additional context
No response
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.