Use Case
I'd like to use the DeepInfra provider with the SOTA Qwen3-reranker model
Problem Statement
It's cheaper than Cohere and has great quality
How This Feature Would Help
Reduce costs
Proposed Solution
Implement LiteLLM Client SDK so I could easily switch between providers and models
Alternatives Considered
No response
Priority
Important - affects my workflow
Additional Context
No response
Checklist