request-routing

Here are 3 public repositories matching this topic...

ashwathnakate / adaptive_llm_inference_router

An intelligent LLM inference gateway that dynamically routes user queries to optimal model tiers (Llama-3.1 8B/70B) based on real-time complexity, reasoning depth, and ambiguity analysis.

python nlp request-routing mlops inference-optimization fastapi large-language-models llm groq-api ai-infrastructure model-routing latency-tracking intelligent-gateway

Updated Jan 17, 2026
Python

pradipdharam / enquiry-routing-service

Star

Extensible request routing service with modular processing pipeline and structured logging abstraction. Chain of Responsibility Design Pattern.

python backend software-engineering service-design modular-architecture chain-of-responsibility-pattern request-routing factory-design-pattern

Updated Jun 15, 2024
Python

liv-skeete / smart-model-router

Star

Semantic model router with parallel LLM classification, prompt caching, and vision short-circuiting. Optimizes request routing with sub-100ms overhead for Open WebUI.

caching machine-learning performance ai async-python request-routing model-optimization llm open-webui semantic-routing

Updated Feb 13, 2026
Python

Improve this page

Add a description, image, and links to the request-routing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the request-routing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly