add example & check for vLLM hosted inference server#66
add example & check for vLLM hosted inference server#66tisnik merged 1 commit intolightspeed-core:mainfrom
Conversation
WalkthroughAdds hosted_vllm as a new LLM provider across docs and code. Updates README and config comments to document provider and env vars. Implements provider-specific handling in LLMManager and adds environment validation for HOSTED_VLLM_API_KEY and HOSTED_VLLM_API_BASE. Changes
Sequence Diagram(s)sequenceDiagram
participant User as User
participant LLMManager as LLMManager
participant Env as EnvValidator
participant Backend as Hosted vLLM API
User->>LLMManager: init(provider="hosted_vllm", model)
LLMManager->>Env: validate_provider_env("hosted_vllm")
Env-->>LLMManager: OK (HOSTED_VLLM_API_KEY, HOSTED_VLLM_API_BASE)
LLMManager->>LLMManager: build model name "hosted_vllm/{model}"
LLMManager->>Backend: request with constructed model and API base/key
Backend-->>LLMManager: response
LLMManager-->>User: return response
rect rgba(230, 245, 255, 0.6)
note over LLMManager,Env: New/changed path for provider "hosted_vllm"
end
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~25 minutes Suggested reviewers
Poem
Pre-merge checks and finishing touches✅ Passed checks (3 passed)
✨ Finishing touches
🧪 Generate unit tests
📜 Recent review detailsConfiguration used: CodeRabbit UI Review profile: CHILL Plan: Pro 📒 Files selected for processing (4)
🧰 Additional context used🧬 Code graph analysis (2)src/lightspeed_evaluation/core/system/env_validator.py (1)
src/lightspeed_evaluation/core/llm/manager.py (1)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)
🔇 Additional comments (7)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
@tisnik @VladimirKadlec PTAL - example for vllm based inference server |
Add example & check for vLLM hosted inference server
Summary by CodeRabbit
New Features
Documentation
Chores