Self-Hosted AI Chatbot with vLLM and Gradio

A self-hosted AI chatbot that runs locally on your machine using vLLM for inference and Gradio for the frontend. This project is containerized with Docker, making it easy to set up and run.

Features

Local Inference: No need for external APIs—everything runs on your machine.
GPU Support: Optimized for CUDA-enabled GPUs for faster inference.
User-Friendly Interface: A simple and intuitive chat interface powered by Gradio.
Dockerized: Easy to set up and run with Docker.

How It Works

vLLM Server:
- The vLLM server runs the facebook/opt-125m model and exposes an API endpoint at http://localhost:8000/v1.
- It processes user prompts and generates responses using the model.
Gradio Frontend:
- The Gradio frontend provides a web-based chat interface.
- It sends user messages to the vLLM server and displays the generated responses.
Docker Container:
- The entire system is packaged into a Docker container for easy deployment.

Getting Started

Prerequisites

Docker installed on your machine.
NVIDIA GPU with CUDA support (optional but recommended for faster inference).

Steps to Run

Clone the Repository:

git clone https://github.com/your-username/self-hosted-chatbot.git
cd self-hosted-chatbot

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
frontend.py		frontend.py
readme.txt		readme.txt
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Self-Hosted AI Chatbot with vLLM and Gradio

Features

How It Works

Getting Started

Prerequisites

Steps to Run

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

dinukacodes/containerized_inference-engine

Folders and files

Latest commit

History

Repository files navigation

Self-Hosted AI Chatbot with vLLM and Gradio

Features

How It Works

Getting Started

Prerequisites

Steps to Run

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages