🦙 Local LLM Setup with llama.cpp (CUDA, Docker, WSL2)

Run, benchmark, and serve Large Language Models locally with llama.cpp on GPU.
This repo gives you one-command scripts, persistent model management, OpenAI-compatible serving, and a repeatable benchmarking pipeline with plots.

✨ What you get

Local inference with CUDA via Docker (ghcr.io/ggerganov/llama.cpp:full-cuda)
OpenAI-compatible server (/v1/chat/completions) for easy app integration
Self-contained model workflow — first run downloads the GGUF into models/, later runs reuse it
Benchmarks that matter — automated sweeps + CSV + Markdown summary + charts
Polished automation — Makefile + venv so anyone can reproduce results

🚀 Quickstart

# 1) Clone
git clone https://github.com/shuvanon/local-llm-setup.git
cd local-llm-setup

# 2) Set up Python env for analysis & plots (matplotlib, pandas)
make venv
source .venv/bin/activate

# 3) Try a single prompt (downloads model on first run)
./scripts/run_llm.sh "Write a intro about federated learning." 64

# 4) Start API server (OpenAI-compatible)
./scripts/serve_llm.sh
# in another terminal:
curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d @examples/chat_request.json

# 5) Run a batch sweep + summarize (CSV + Markdown + charts)
make benchmark

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
benchmarks		benchmarks
examples		examples
scripts		scripts
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🦙 Local LLM Setup with llama.cpp (CUDA, Docker, WSL2)

✨ What you get

🚀 Quickstart

About

Uh oh!

Releases

Packages

Languages

shuvanon/local-llm-setup

Folders and files

Latest commit

History

Repository files navigation

🦙 Local LLM Setup with llama.cpp (CUDA, Docker, WSL2)

✨ What you get

🚀 Quickstart

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages