Run any open-source LLMs, such as Llama, Gemma, as OpenAI compatible API endpoint in the cloud.
-
Updated
Oct 29, 2024 - Python
Run any open-source LLMs, such as Llama, Gemma, as OpenAI compatible API endpoint in the cloud.
Model Deployment at Scale on Kubernetes 🦄️
Pybind11 bindings for Whisper.cpp
BentoML Example Projects 🎨
A simple web application that lets you replace any part of an image with an image generated based on your description.
This repository contains instructions, template source code and examples on how to serve/deploy machine learning models using various frameworks and applications such as Docker, Flask, FastAPI, BentoML, Streamlit, MLflow and even code on how to deploy your machine learning model as an android app.
A bentoML-powered API to transcribe audio and make sense of it
My repo for the Machine Learning Engineering bootcamp 2022 by DataTalks.Club
Generate novel text - novel finetuned from skt KoGPT2 base v2 - 한국어
Miscellaneous codes and writings for MLOps
MLOps Implementing "Brain Computer Interface" on Kubernetes
API serving for your diffusers models
DKT Project Served by Airflow / BentoML / Docker Swarm
Helm Chart for installing Yatai on Kubernetes ⎈
Deploying machine learning model using 10+ different deployment tools
Add a description, image, and links to the bentoml topic page so that developers can more easily learn about it.
To associate your repository with the bentoml topic, visit your repo's landing page and select "manage topics."