Skip to content

feat: support litellm-sdk as reranker and embeddings#357

Merged
nicoloboschi merged 14 commits intomainfrom
litellm-1
Feb 12, 2026
Merged

feat: support litellm-sdk as reranker and embeddings#357
nicoloboschi merged 14 commits intomainfrom
litellm-1

Conversation

@nicoloboschi
Copy link
Collaborator

@nicoloboschi nicoloboschi commented Feb 12, 2026

Fix #339

@qdrddr
Copy link
Contributor

qdrddr commented Feb 12, 2026

I think it makes sense to use LiteLLM-SDK for Embedding models in addition to the Reranker, so that users could benefit from a larger number of available choices between models and providers. @nicoloboschi

- Add LiteLLMSDKEmbeddings class for direct API access without proxy
- Support multiple providers: Cohere, OpenAI, Together AI, HuggingFace, Voyage AI
- Automatic dimension detection via test embedding
- Provider-specific API key mapping
- Batch processing support (configurable batch size)
- Comprehensive test coverage (17 unit tests)
- Update documentation with configuration examples

Implements embeddings in same PR as reranker per user request
@nicoloboschi nicoloboschi changed the title feat: support litellm-sdk as reranker feat: support litellm-sdk as reranker and embeddings Feb 12, 2026
- Mock get_config() from its source module (hindsight_api.config)
- Fixes factory tests that were returning LocalSTEmbeddings instead of LiteLLMSDKEmbeddings
- All 17 unit tests now passing
- Catch initialization errors and skip tests instead of failing
- Prevents CI failures when COHERE_API_KEY is set but invalid
- Integration tests now properly skip when authentication fails
- Add same error handling as embeddings tests
- Prevents CI failures when COHERE_API_KEY is set but invalid
- Tests now properly skip when authentication fails
- Add api_key parameter to arerank(), rerank(), aembedding(), and embedding() calls
- Prevents authentication issues in multi-process environments (pytest-xdist)
- More reliable than relying solely on environment variables
- Update test assertions to expect api_key parameter
@nicoloboschi nicoloboschi merged commit e408b7e into main Feb 12, 2026
28 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Feature]: DeepInfra provider for Qwen3-reranker

2 participants