☸️ Easy, advanced inference platform for large language models on Kubernetes
-
Updated
Nov 12, 2024 - Go
☸️ Easy, advanced inference platform for large language models on Kubernetes
A guide to structured generation using constrained decoding
AI-based search engine done right
Examples of serving LLM on Modal.
Add a description, image, and links to the sglang topic page so that developers can more easily learn about it.
To associate your repository with the sglang topic, visit your repo's landing page and select "manage topics."