LLMC: Large Language Model Compressor

Stop burning money on LLM tokens.
Get 70-95% cost reduction through local RAG, intelligent routing, and containerized security.

⚡ What is LLMC?

LLMC is a local-first RAG (Retrieval Augmented Generation) engine and intelligent router designed to drastically reduce the cost of using Large Language Models with your codebase.

Instead of sending your entire codebase to Claude or GPT-4, LLMC indexes your code locally, finds the exact relevant snippets (functions, classes, docs), and sends only what matters.

graph LR
    A[User Query] --> B(LLMC Router);
    B --> C{Local Index};
    C -->|Search| D[Relevant Context];
    D -->|Trim & Pack| E[Optimized Prompt];
    E --> F[LLM API];
    F --> G[Answer];
    style B fill:#f9f,stroke:#333,stroke-width:2px
    style E fill:#bfb,stroke:#333,stroke-width:2px

🚀 Quick Start

Get up and running in seconds.

1. Install

# One-line install
curl -sSL https://raw.githubusercontent.com/vmlinuzx/llmc/main/install.sh | bash

# Or via pip
pip install "git+https://github.com/vmlinuzx/llmc.git#egg=llmcwrapper[rag,tui,agent]"

2. Index Your Code

cd /path/to/your/project
llmc-cli repo register .

3. Save Money

# Search without using ANY tokens
llmc-cli search "authentication middleware"

# Launch the visual dashboard
llmc-cli tui

✨ Key Features

Feature	Description
💸 Massive Savings	Reduces token usage by 70-95% by sending only relevant context.
🔒 Security First	New in v0.7.0: "Hybrid Mode" for trusted clients (host access) vs. Container Isolation for untrusted LLMs.
🧠 Polyglot RAG	Smart parsing (TreeSitter) for Python, TS, JS, Go, Java, and technical docs.
🕸️ GraphRAG	Understands your code structure (imports, calls, inheritance) to find related files automatically.
🖥️ TUI Dashboard	Terminal UI to monitor indexing, search results, and costs. ⚠️ Work in progress — expect rough edges.
🔌 MCP Support	Full Model Context Protocol server to integrate seamlessly with Claude Desktop.

🔍 Deep Dive

🛠️ Core RAG Engine

Local SQLite Index: Stores text + metadata without external dependencies.
Smart Embeddings: Caches embeddings to avoid re-computing unchanged files.
Context Trimmer: Packs the most relevant spans into a fixed token budget.
Enrichment: Uses small local models to tag and summarize code for better retrieval.

🛡️ Security & MCP

Hybrid Mode: Trusted clients get direct host access (~76% cheaper than docker overhead).
Container Isolation: Untrusted inputs run in Docker/nsjail.
Defense in Depth: Even if an LLM is "jailbroken" by prompt injection, it can't escape the container.

📊 Analytics & Routing

Intelligent Failover: Cascades from Local → Cheap Cloud → Premium Models.
Cost Tracking: Hard budget caps to prevent surprise bills.
Rate Limiting: Automatic token bucket throttling for API providers.

📚 Documentation

Full documentation is available in the DOCS/ directory:

Getting Started — Installation and quickstart
User Guide — Configuration and daily usage
Operations — Running the daemon and MCP integration
Architecture — System design and internals
Reference — CLI, config, and MCP tool reference

📜 History

Originally created by David Carroll (the worst paragliding pilot in the TX Panhandle) after burning through his weekly API limits in days. This tool was born from the necessity to code more while spending less.

🤝 Contributing

We welcome PRs! Please check CONTRIBUTING.md before starting.

Fork the repo
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Current Release: v0.9.1 "Back From Vacation"

Name		Name	Last commit message	Last commit date
Latest commit History 457 Commits
.agent/workflows		.agent/workflows
.sisyphus		.sisyphus
DOCS		DOCS
config		config
docker		docker
llmc		llmc
llmc_agent		llmc_agent
llmc_mcp		llmc_mcp
metrics		metrics
scripts		scripts
tests		tests
.claudeignore		.claudeignore
.contextallow		.contextallow
.gitignore		.gitignore
.ragignore		.ragignore
.thunderdome_context.md		.thunderdome_context.md
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CONTRACTS.md		CONTRACTS.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
LLMCAGENTS.md		LLMCAGENTS.md
Makefile		Makefile
README.md		README.md
install.sh		install.sh
llmc-cli		llmc-cli
llmc.code-workspace		llmc.code-workspace
llmc.toml		llmc.toml
mcgrep		mcgrep
mcp_test_runner.py		mcp_test_runner.py
mkdocs.yml		mkdocs.yml
package-lock.json		package-lock.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test_unicode_skeletonizer.py		test_unicode_skeletonizer.py
upgrade.py		upgrade.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMC: Large Language Model Compressor

⚡ What is LLMC?

🚀 Quick Start

1. Install

2. Index Your Code

3. Save Money

✨ Key Features

🔍 Deep Dive

📚 Documentation

📜 History

🤝 Contributing

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

vmlinuzx/llmc

Folders and files

Latest commit

History

Repository files navigation

LLMC: Large Language Model Compressor

⚡ What is LLMC?

🚀 Quick Start

1. Install

2. Index Your Code

3. Save Money

✨ Key Features

🔍 Deep Dive

📚 Documentation

📜 History

🤝 Contributing

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages