π¦
cargo install decapod && decapod init
Decapod
The governance kernel for AI coding agents.
Called on demand inside agent loops to turn intent into context, then context into explicit specifications before inference.
No daemon. No workflow tax. Just artifacts you can read, hash, and trust.
Your agent writes code. Nobody checks its homework.
Decapod is the layer that makes agents show their work. Before an agent mutates your repo, Decapod forces it to answer three questions: What did the human actually ask for? What boundaries apply? How will we prove it's done? The answers become artifacts β not comments, not logs, artifacts with hashes and validation gates β stored in .decapod/ where anyone on the team can see exactly what happened and why.
It ships with an embedded constitution: governance docs agents receive as just-in-time context so they query the rules on demand instead of guessing them.
Decapod doesn't replace your agent. It doesn't run in the background. It starts when called, writes its receipts, and exits. Two commands to adopt. Zero config to maintain.
"The spec was vibes." Your agent asks Decapod what the user actually meant. Decapod forces intent to crystallize β constraints, boundaries, acceptance criteria β before a single line is generated. The agent stops hallucinating requirements.
"Three agents, one repo, total chaos." Decapod coordinates shared state across parallel runs. No silent overwrites. No drift. Each agent gets an isolated workspace with a provenance trail.
"It passes CI but is it done?" Decapod gates completion on proof artifacts, not narrative claims. VERIFIED means every gate in the proof plan actually passed β not "the agent said it looks good."
Related: Evaluating AGENTS.md (ETH SRI, 2026) on context-file quality and agent cost/performance.
Buy us a coffee β
cargo install decapod
decapod initThat's it. Keep using Claude Code, Codex, Gemini CLI, Cursor β whatever you already use. Decapod gets called by your agent automatically when control-plane decisions are needed. Your workflow doesn't change; the agent just gets smarter about when to stop and think.
.decapod/
config.toml # project configuration
data/ # durable state (governance, memory, traces)
generated/
specs/ # intent, architecture, validation specs
artifacts/ # proof artifacts, internalizations, provenance
sessions/ # per-session provenance logs
AGENTS.md # universal agent contract
CLAUDE.md / CODEX.md / GEMINI.md # tool-specific entrypoints
Everything material is a file you can cat. No databases to query, no dashboards to check. The state is the directory.
- Ask your agent to make a real change. Watch
.decapod/generated/populate with new specs and proof artifacts. - Ask your agent to validate the work. It will report typed pass/fail gates, not "looks good to me."
- Ask the agent "what did Decapod change about your plan?" β it should cite spec and proof steps, not vibes.
Agent integration: AGENTS.md and tool-specific entrypoints (CLAUDE.md, CODEX.md, GEMINI.md) define the full operational contract your agent follows.
Override any constitution default with plain English in .decapod/OVERRIDE.md. Learn more about the embedded constitution.
AI coding agents are extraordinarily good at generating code. They are extraordinarily bad at knowing when to stop, what not to touch, and whether the thing they built is the thing you asked for.
The failure mode isn't "bad code." It's unaccountable code: no intent recorded, no boundaries enforced, no proof that completion criteria were met. You get a PR that compiles. You have no idea if it's right.
Decapod closes that gap. Agents call it mid-run to lock intent, enforce boundaries, and prove completion. It shapes what goes into inference without doing inference itself.
State is local and durable in .decapod/. Context, decisions, and traces persist across sessions and stay retrievable over time. Nothing hides. Nothing phones home.
Every Decapod operation returns one of three things:
| Signal | What it does | Think of it as |
|---|---|---|
| Advisory | Tightens intent, reduces wasted loops | Guardrails |
| Interlock | Hard policy boundary β blocks unsafe flow | Circuit breaker |
| Attestation | Structured proof that criteria actually passed | Receipt |
Human Intent
|
v
AI Agent(s) <----> Decapod <----> Repository + Policy
| | |
| | +-- Interlock (enforced boundaries)
| +----- Advisory (guided execution)
+-------- Attestation (verifiable outcomes)
- Daemonless. No background process. The binary starts, does its job, exits.
- Two-command install. Install and init. Done.
- Agent-agnostic. Works with Claude, Codex, Gemini, Cursor, and anything else that can shell out.
- Parallel-safe. Multiple agents, one repo, no collisions.
- Proof-gated completion.
VERIFIEDrequires passing proof-plan results, not narrative. - Fully auditable. Every decision, trace, and proof artifact lives in
.decapod/as plain files. - Context internalization. Turn long documents into mountable, verifiable context adapters so agents stop re-ingesting the same 50-page spec every session.
The deep surface area β interfaces, capsules, eval kernel, knowledge promotions, obligation graphs β lives in the embedded constitution. Ask your agent to explore it.
git clone https://github.com/DecapodLabs/decapod
cd decapod
cargo build
cargo test- CONTRIBUTING.md β development guide
- SECURITY.md β security policy
- CHANGELOG.md β release history
MIT. See LICENSE.