forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Doc] add KubeAI to serving integrations (vllm-project#10837)
Signed-off-by: Sam Stoelinga <sammiestoel@gmail.com>
- Loading branch information
Showing
2 changed files
with
18 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,17 @@ | ||
.. _deploying_with_kubeai: | ||
|
||
Deploying with KubeAI | ||
===================== | ||
|
||
`KubeAI <https://github.com/substratusai/kubeai>`_ is a Kubernetes operator that enables you to deploy and manage AI models on Kubernetes. It provides a simple and scalable way to deploy vLLM in production. Functionality such as scale-from-zero, load based autoscaling, model caching, and much more is provided out of the box with zero external dependencies. | ||
|
||
|
||
Please see the Installation Guides for environment specific instructions: | ||
|
||
* `Any Kubernetes Cluster <https://www.kubeai.org/installation/any/>`_ | ||
* `EKS <https://www.kubeai.org/installation/eks/>`_ | ||
* `GKE <https://www.kubeai.org/installation/gke/>`_ | ||
|
||
Once you have KubeAI installed, you can | ||
`configure text generation models <https://www.kubeai.org/how-to/configure-text-generation-models/>`_ | ||
using vLLM. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters