Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: support xinference rerank model #1466

Merged
merged 1 commit into from
Jul 11, 2024
Merged

feat: support xinference rerank model #1466

merged 1 commit into from
Jul 11, 2024

Conversation

hwzhuhao
Copy link
Contributor

@hwzhuhao hwzhuhao commented Jul 11, 2024

What problem does this PR solve?

support xinference rerank model
#1455

Type of change

  • Bug Fix (non-breaking change which fixes an issue)
  • New Feature (non-breaking change which adds functionality)
  • Documentation Update
  • Refactoring
  • Performance Improvement
  • Other (please describe):

@KevinHuSh KevinHuSh merged commit 009e18f into infiniflow:main Jul 11, 2024
1 check passed
@jay-jjwu
Copy link

what is the base url for rerank model of xinference?

@jay-jjwu
Copy link

what is the base url for rerank model of xinference?

There is always an error "Fail to access model(bge-reranker-v2-m3).'results'", any ideas?
The xinference API works well, because i tested with postman.
BTW, the embedding models work well too.
What's wrong with the rerank models?

@jay-jjwu
Copy link

image

@morler
Copy link
Contributor

morler commented Jul 19, 2024

You need to add a rerank directory to the address, like this: http://host:port/v1/rerank.

@chinamerp
Copy link
Contributor

You need to add a rerank directory to the address, like this: http://host:port/v1/rerank.

it not works at xinference:v0.15.3

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants