Serverless LLM Deployment Examples

Welcome to the repository containing a set of hackable examples for serverless deployement of Large Language Models (LLMs). Here, we explore and analyze three services: Modal Labs, Beam Cloud, and Runpod, each abstracting out the deployment process at different levels.

Service	Blogpost	Implementation
Modal Labs	Tutorial Blogpost	Modal Labs Deployment
Beam Cloud	Tutorial Blogpost	Beam Cloud Deployment
RunPod	Tutorial Blogpost	RunPod Deployment

We provide blog posts for each service, as well as dedicated repositories containing full code examples and instructions on how to run and test them.

Test Deployed Model

If you've followed our tutorials and deployed your models using any of the mentioned services, you can test the deployments from here. Please note that testing is currently available only for streaming. However, if you want to make changes, feel free to do so. Before getting started, please install the requirements from here.

pip install -r requirements.txt

Now, assuming you deployed your model using either of the services, you can run test.py as shown below:

For Modal and Beam Cloud:

python3 test.py modal --url <YOUR-DEPLOYED-MODEL/BEAM-URL> --prompt "hello"

For RunPod, you also need to provide the service ID:

python3 test.py modal \
    --url <YOUR-DEPLOYED-RUNPOD-URL> \
    --prompt "hello" \
    --runpod_id <RUNPOD-ID>

Replace <RUNPOD-ID> with a value that looks like this: 80r0eh3jel99f8 (this is an example ID).

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
benchmarks		benchmarks
deploy_beam		deploy_beam
deploy_modal		deploy_modal
deploy_runpod		deploy_runpod
.env.template		.env.template
.gitignore		.gitignore
README.md		README.md
benchmark.py		benchmark.py
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Serverless LLM Deployment Examples

Test Deployed Model

About

Releases

Packages

Contributors 3

Languages

premAI-io/serverless-examples

Folders and files

Latest commit

History

Repository files navigation

Serverless LLM Deployment Examples

Test Deployed Model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages