-
Notifications
You must be signed in to change notification settings - Fork 6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[LLM] OSS LLM Serving #50643
[LLM] OSS LLM Serving #50643
Conversation
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
All test green! Especially this one https://buildkite.com/ray-project/premerge/builds/35010#019517cc-1117-4c01-ba25-77e015f856b6/186-935 |
Signed-off-by: Gene Su <e870252314@gmail.com>
Signed-off-by: Gene Su <e870252314@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is the python/__init__.py
file required?
oh good call, maybe that got created by my IDE some how. let me drop it |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(and does not seem to have anything substantial that needs my review?)
Signed-off-by: Gene Su <e870252314@gmail.com>
correct, this doesn't contain any dependencies related changes after all |
This PR adds all the components required to run llm serving application. Main components added are: - `build_vllm_deployment` - `build_openai_app` - `VLLMDeployment` - `LLMModelRouterDeployment` Signed-off-by: Gene Su <e870252314@gmail.com> Signed-off-by: 400Ping <43886578+400Ping@users.noreply.github.com>
This PR adds all the components required to run llm serving application. Main components added are: - `build_vllm_deployment` - `build_openai_app` - `VLLMDeployment` - `LLMModelRouterDeployment` Signed-off-by: Gene Su <e870252314@gmail.com>
Why are these changes needed?
This PR adds all the components required to run llm serving application. Main components added are:
build_vllm_deployment
build_openai_app
VLLMDeployment
LLMModelRouterDeployment
Things to follow up:
DeploymentConfig
andAutoscalingConfig
inLLMConfig
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.