This is an experimental LLM serving system, forked and built on top of SGLang SRT, and is used to support SwissAI Model Serving.
- SwissAI - as the primary serving engine for LLMs.
- HexGen-Flow - as a LLM execution simulator for Text-to-SQL application.