Stop guessing. Run the right pattern.
A CLI for executable agent knowledge.
Install · Demo · What's new in v0.4 · Why kairos · Quickstart · How it works · Commands · Configuration
v0.4.0 replaces the single llm-mcp shim with direct backends and makes kairos an MCP server. Highlights:
- Direct model backends -
stub,ollama,openai,anthropic, andopenai_compat. - MCP server mode -
kairos mcp serveexposes seven tools to Cursor, Claude Desktop, and custom MCP clients. - Safer default -
KAIROS_LLM_BACKENDdefaults tostub;mcpis removed with a clear migration error. - Audit gates closed - Ruff, mypy, Bandit, strict
pip-audit, Radon, Semgrep, MCP smoke, live Ollama, and KAI2 regression report are required CI checks.
Full migration notes: docs/UPGRADING.md. MCP setup: docs/MCP.md.
pip install kairos-agentThat's it. By default kairos uses the deterministic stub backend. Point it at Ollama, OpenAI, Anthropic, or any OpenAI-compatible server when you want live model calls.
# or, with uv
uv tool install kairos-agent# Windows
irm https://raw.githubusercontent.com/vinothhacks/kairos/v0.4.0/install.ps1 | iex# macOS / Linux
curl -fsSL https://raw.githubusercontent.com/vinothhacks/kairos/v0.4.0/install.sh | sh$ kairos init my-wiki && cd my-wiki
$ kairos run "Search the docs for caching and summarize" --dry
top-3 techniques for: Search the docs for caching...
┌──────┬───────────┬───────┬──────────────────────────────────┐
│ rank │ technique │ score │ rationale │
├──────┼───────────┼───────┼──────────────────────────────────┤
│ 1 │ rag │ 0.70 │ keyword boost 0.50, overlap x4 │
│ 2 │ react │ 0.65 │ keyword boost 0.50, overlap x3 │
│ 3 │ reflexion │ 0.05 │ overlap x1 │
└──────┴───────────┴───────┴──────────────────────────────────┘kairos looks at your task, queries its wiki of agent techniques, and tells you which pattern to run: RAG, ReAct, or Reflexion. Then it actually runs it.
You've read the LLM techniques. CoT, ReAct, Reflexion, ToT, HyDE, rerank — twenty patterns each with a paper, each with a use case, each easy to forget the morning you're three coffees into a real problem.
Most "LLM wikis" turn this into a static reading list. kairos turns it into a runtime decision. The wiki is the agent's playbook:
- Ingest raw sources (papers, transcripts, your own notes) → LLM-curated wiki pages.
- Query the wiki with natural language; answers cite real pages with
[[wikilinks]]. - Lint for contradictions, stale claims, and gaps. The wiki gets smarter with every run.
- Run any task — kairos picks the right technique by reading its own wiki, then executes it.
Three patterns ship with working runners (RAG, ReAct, Reflexion). The other 18 are documented and ready to be promoted from doc-only to runnable. You can extend it.
# 1. Bootstrap a project. Copies 21 seed concept pages.
kairos init my-wiki && cd my-wiki
# 2. Ingest a source.
kairos ingest research/karpathy-llm-wiki-gist.md
# 3. Ask a grounded question.
kairos query "When should I use ReAct over RAG?"
# 4. Lint the wiki.
kairos lint
# 5. Run a task — kairos auto-selects the technique.
kairos run "Search the docs for caching, then summarize"
# 6. Or pick the technique manually.
kairos run "Iteratively refine this paragraph" --technique reflexionEvery run logs to .kairos/kairos.db (SQLite). Every page lives in plain markdown. Every wikilink survives git diff.
flowchart LR
User([User]) --> CLI["typer CLI<br/>cli.py"]
CLI --> Cfg["config.load_config()<br/>env > .kairos/config.toml > defaults"]
Cfg --> RunCmd["cli run/query/lint/ingest"]
RunCmd -->|technique=auto| Selector["select_technique"]
Selector --> Idx["wiki_index<br/>(SQLite cache)"]
Idx -.cache miss.-> FS["wiki/ filesystem walk"]
Selector -->|optional --llm-rerank| Rerank["claude_send tie-break"]
Selector --> Rank["TechniqueChoice ranking"]
Rank --> Disp["runners.dispatch<br/>+ entry_points discovery"]
Disp --> ABC["Runner ABC"]
ABC --> RAG["RagRunner"]
ABC --> ReA["ReactRunner"]
ABC --> Refl["ReflexionRunner"]
ABC -.plugins.-> Plug["kairos-runner-*"]
RAG --> Rec["RunRecorder.finish<br/>selected_by + score"]
ReA --> Rec
Refl --> Rec
Rec --> DB[("kairos.db<br/>WAL + busy_timeout 5s")]
DB --> Runs["runs"]
DB --> FB["feedback (KAI-035)"]
DB --> WI["wiki_index"]
DB --> WR["wiki_relations"]
CLI --> Providers["LLM providers<br/>stub / ollama / openai / anthropic / compat"]
Providers --> LLM["model runtime"]
CLI --> MCPServer["kairos mcp serve<br/>stdio MCP tools"]
CLI --> Doc["kairos doctor<br/>provider probe"] --> Providers
CLI --> FBcmd["kairos feedback"] --> FB
Three layers, mirroring Karpathy's LLM Wiki gist:
raw/- your immutable inputs (papers, articles, transcripts). Source of truth.wiki/- LLM-generated, human-curated markdown pages. Lives in git.AGENTS.md- the schema. Tells future LLM passes how the structure works.
See docs/architecture.md for the full diagram.
| Command | What it does |
|---|---|
kairos init [path] |
Bootstrap AGENTS.md, raw/, wiki/, outputs/. Seeds 21 concept pages. |
kairos ingest <file> |
Read a source, propose new + updated wiki pages, log the diff. |
kairos query "<q>" |
Lexically retrieve pages, ask the LLM to synthesize, cite wikilinks. |
kairos lint |
Local: orphans, missing concepts, stale pages. LLM: contradictions, gaps. |
kairos run "<task>" |
Auto-select technique, dispatch runner, log the run. |
kairos run "<task>" --dry |
Show the top-3 candidate techniques without running. |
kairos history |
List recent runs from .kairos/kairos.db. |
kairos feedback-list |
List saved feedback rows. |
kairos mcp serve |
Serve kairos tools over stdio MCP. |
kairos doctor |
Print env diagnostics. |
kairos version |
Print version. |
| Count | Status | |
|---|---|---|
| Concept pages (seed wiki) | 21 | doc-only |
| Runner-backed techniques | 3 | RAG, ReAct, Reflexion |
| Tests | 115+ | unit, integration, regression |
| Storage backend | 1 | SQLite |
| LLM providers | 5 | stub, Ollama, OpenAI, Anthropic, OpenAI-compatible |
The 21 seed concept pages: rag, react, reflexion, chain-of-thought, tree-of-thoughts, self-consistency, self-refine, constitutional-ai, plan-and-execute, few-shot-prompting, zero-shot-prompting, function-calling, tool-use, prompt-injection, embedding-search, hybrid-search, hyde, rerank, router-agent, memory-buffer, llm-wiki.
| kairos | LLM-wiki gist | Notion AI | Obsidian + plugins | |
|---|---|---|---|---|
| Plain markdown source | yes | yes | no | yes |
| Diff-able in git | yes | yes | no | yes |
| Ingest sources via LLM | yes | yes | partial | with plugins |
| Lint for contradictions | yes | manual | no | no |
| Pick technique automatically | yes | no | no | no |
| Execute the technique | yes | no | no | no |
| Local-first model option | yes | no | no | partial |
| CLI-first | yes | no | no | no |
The wedge: executable wiki, not passive notes.
# Default: deterministic stub backend for offline tests
export KAIROS_LLM_BACKEND="stub"
# Local Ollama
export KAIROS_LLM_BACKEND="ollama"
export KAIROS_OLLAMA_MODEL="llama3.1"
# OpenAI / Anthropic
export KAIROS_LLM_BACKEND="openai"
export OPENAI_API_KEY="..."
export KAIROS_OPENAI_MODEL="gpt-4o-mini"
export KAIROS_LLM_BACKEND="anthropic"
export ANTHROPIC_API_KEY="..."
# LM Studio / vLLM / OpenRouter / Ollama OpenAI mode
export KAIROS_LLM_BACKEND="openai_compat"
export KAIROS_LLM_BASE_URL="http://localhost:1234/v1"
export KAIROS_LLM_MODEL="local-model"Per-project config lives in .kairos/config.toml. Run kairos doctor to see resolved values.
- v0.4 (now) — direct providers,
kairos mcp serve, required audit gates. - v0.5 — remove the compatibility
kairos.llm.mcp_clientimport shim and expand provider cassettes. - v1.0 — Plugin runners (
pip install kairos-runner-tot), web preview server.
See CHANGELOG.md for what landed in each release.
Found a wiki page that's wrong? Want a new technique runner? PRs welcome. Read CONTRIBUTING.md first.
git clone https://github.com/vinothhacks/kairos
cd kairos
uv pip install -e ".[dev]"
uv run pytestMIT © vinothhacks
The wiki pattern is straight out of Andrej Karpathy's LLM Wiki gist. The README structure follows jcode for the install-first / demo-first style. The technique catalog stands on the shoulders of every paper cited in the seed pages.