Codev is an operating system for structured human-AI collaboration. You write specs and plans that AI agents execute reliably.
Results: In head-to-head comparison, SPIDER scored 92-95 vs VIBE's 12-15 on the same task. See case study
Quick Links: FAQ | Tips | Cheatsheet | CLI Reference | Why Codev? | Discord
- Quick Start
- Learn About Codev
- What is Codev?
- The SPIDER Protocol
- Agent Farm
- Example Implementations
- Quick Start & Prerequisites
- Contributing
# 1. Install
npm install -g @cluesmith/codev
# 2. Initialize a project
mkdir my-project && cd my-project
codev init
# 3. Verify setup
codev doctor
# 4. Start the dashboard (optional)
af startThen tell your AI agent: "I want to build X using the SPIDER protocol"
CLI Commands:
codev- Main CLI (init, adopt, doctor, update)af- Agent Farm for parallel AI buildersconsult- Multi-model consultation
See CLI Reference for details.
Core (required):
| Dependency | Install | Purpose |
|---|---|---|
| Node.js 18+ | brew install node |
Runtime |
| git 2.5+ | (pre-installed) | Version control |
| AI CLIs | See below | All three recommended |
AI CLIs (install all three for multi-model consultation):
- Claude Code:
npm install -g @anthropic-ai/claude-code - Gemini CLI: github.com/google-gemini/gemini-cli
- Codex CLI:
npm install -g @openai/codex
Agent Farm (optional):
| Dependency | Install | Purpose |
|---|---|---|
| tmux 3.0+ | brew install tmux |
Terminal multiplexer |
| ttyd 1.7+ | brew install ttyd |
Web terminal |
| gh | brew install gh |
GitHub CLI |
See DEPENDENCIES.md for complete details.
Common questions about Codev: FAQ
Practical tips for getting the most out of Codev: Tips & Tricks
Quick reference for Codev's philosophies, concepts, and tools: Cheatsheet
Watch a brief overview of what Codev is and how it works.
Generated using NotebookLM - Visit the notebook to ask questions about Codev and learn more.
Join the conversation in GitHub Discussions or our Discord community! Share your specs, ask questions, and learn from the community.
Get notified of new discussions: Click the Watch button at the top of this repo → Custom → check Discussions.
A comprehensive walkthrough of the Codev methodology and its benefits.
Watch a real development session using Agent Farm - from spec to merged PR in 30 minutes. Demonstrates the Architect-Builder pattern with multi-model consultation.
See Codev in action! Follow along as we use the SPIDER protocol to build a conversational todo list manager from scratch:
This tour demonstrates:
- How to write specifications that capture all requirements
- How the planning phase breaks work into manageable chunks
- The IDE loop in action (Implement → Defend → Evaluate)
- Multi-agent consultation with GPT-5 and Gemini Pro
- How lessons learned improve future development
Codev is a development methodology that treats natural language context as code. Instead of writing code first and documenting later, you start with clear specifications that both humans and AI agents can understand and execute.
📖 Read the full story: Why We Created Codev: From Theory to Practice - Learn about our journey from theory to implementation and how we built a todo app without directly editing code.
- Context Drives Code - Context definitions flow from high-level specifications down to implementation details
- Human-AI Collaboration - Designed for seamless cooperation between developers and AI agents
- Evolving Methodology - The process itself evolves and improves with each project
Our flagship protocol for structured development:
- Specify - Define what to build in clear, unambiguous language
- Plan - Break specifications into executable phases
- For each phase: Implement → Defend → Evaluate
- Implement: Build the code to meet phase objectives
- Defend: Write comprehensive tests that protect your code—not just validation, but defensive fortifications against bugs and regressions
- Evaluate: Verify requirements are met, get user approval, then commit
- Review - Capture lessons and improve the methodology
After running codev init or codev adopt, your project has a minimal structure:
your-project/
├── codev/
│ ├── specs/ # Feature specifications
│ ├── plans/ # Implementation plans
│ ├── reviews/ # Review and lessons learned
│ └── projectlist.md # Project tracking
├── AGENTS.md # AI agent instructions (AGENTS.md standard)
├── CLAUDE.md # AI agent instructions (Claude Code)
└── [your code]
Codev is designed to be customized for your project's needs. The codev/ directory is yours to extend:
- Add project-specific protocols - For example, Codev itself has a
releaseprotocol specific to npm publishing - Customize existing protocols - Modify SPIDER phases to match your team's workflow
- Add new roles - Define specialized consultant or reviewer roles
The framework provides defaults, but your local files always take precedence.
In much the same way an operating system has a memory hierarchy, Codev repos have a context hierarchy. The codev/ directory holds the top 3 layers. This allows both humans and agents to think about problems at different levels of detail.
Key insight: We build from the top down, and we propagate information from the bottom up. We start with an entry in the project list, then spec and plan out the feature, generate the code, and then propagate what we learned through the reviews.
- Specifications and plans drive implementation
- All decisions captured in version control
- Clear traceability from idea to implementation
- Structured formats that AI agents understand
- Multi-agent consultation support (GPT-5, Gemini Pro, etc.)
- Reduces back-and-forth from dozens of messages to 3-4 document reviews
- Supports both AGENTS.md standard (Cursor, Copilot, etc.) and CLAUDE.md (Claude Code)
- Every project improves the methodology
- Lessons learned feed back into the process
- Templates evolve based on real experience
Both projects below were given the exact same prompt to build a Todo Manager application using Claude Code with Opus. The difference? The methodology used:
- Built using a VIBE-style prompt approach
- Shows rapid prototyping with conversational AI interaction
- Demonstrates how a simple prompt can drive development
- Results in working code through chat-based iteration
- Built using the SPIDER protocol with full document-driven development
- Same requirements, but structured through formal specifications and plans
- Demonstrates all phases: Specify → Plan → (IDE Loop) → Review
- Complete with specs, plans, and review documents
- Multi-agent consultation throughout the process
📊 Automated Multi-Agent Analysis (click to expand)
Note: This comparison was generated through automated analysis by 3 independent AI agents (Claude, GPT-5, and Gemini Pro), not human review.
| Aspect | VIBE | SPIDER |
|---|---|---|
| Overall Score | 12-15 | 92-95 |
| Functionality | 0 | 100 |
| Test Coverage | 0 | 85 |
| Documentation | 0 | 95 |
| Architecture | N/A | 90 |
| Production Readiness | 0 | 85 |
VIBE: 3 files (boilerplate only), 0% functionality, 0 tests, no database, no API
SPIDER: 32 source files, 100% functionality, 5 test suites, SQLite + Drizzle ORM, complete REST API, full component architecture, MCP integration, TypeScript + Zod validation
As GPT-5 noted: "SPIDER's methodology clearly outperformed... Plan-first approach with defined scope, iterative verification, and delivery mindset"
The verdict: Context-driven development ensures completeness, while conversational approaches can miss the mark entirely despite identical prompts and AI models.
Codev is self-hosted - we use Codev methodology to build Codev itself. This means:
- Our test infrastructure is specified in
codev/specs/0001-test-infrastructure.md - Our development process follows the SP(IDE)R protocol we advocate
- Our improvements come from lessons learned using our own methodology
This self-hosting approach ensures:
- The methodology is battle-tested on real development
- We experience the same workflow we recommend to users
- Any pain points are felt by us first and fixed quickly
- The framework evolves based on actual usage, not theory
You can see this in practice:
- Check
codev/specs/for our feature specifications - Review
codev/plans/for how we break down work - Learn from
codev/reviews/to see what we've discovered
This repository has a dual nature:
-
codev/- Our instance of Codev for developing Codev itself- Contains our specs, plans, reviews, and resources
- Example:
codev/specs/0001-test-infrastructure.mddocuments how we built our test suite
-
codev-skeleton/- The template that gets installed in other projects- Contains protocol definitions, templates, and agents
- What users get when they install Codev
- Does NOT contain specs/plans/reviews (those are created by users)
In short: codev/ is how we use Codev, codev-skeleton/ is what we provide to others.
Test Infrastructure (click to expand)
Our comprehensive test suite (64 tests) validates the Codev installation process:
- Framework: Shell-based testing with bats-core (zero dependencies)
- Coverage: SPIDER protocol, CLAUDE.md preservation, agent installation
- Isolation: XDG sandboxing ensures tests never touch real user directories
- CI/CD Ready: Tests run in seconds with clear TAP output
./scripts/run-tests.sh # Fast tests (< 30 seconds)
./scripts/run-all-tests.sh # All tests including Claude CLI
./scripts/install-hooks.sh # Install pre-commit hookSee tests/README.md for details.
See examples/todo-manager/ for a complete walkthrough showing:
- How specifications capture all requirements
- How plans break work into phases
- How the IDE loop ensures quality
- How lessons improve future development
Templates in codev/protocols/spider/templates/ can be modified to fit your team's needs:
spec.md- Specification structureplan.md- Planning formatlessons.md- Retrospective template
Agent Farm is an optional companion tool for Codev that provides a web-based dashboard for managing multiple AI agents working in parallel. You can use Codev without Agent Farm - all protocols (SPIDER, TICK, etc.) work perfectly in any AI coding assistant.
Why use Agent Farm?
- Web dashboard for monitoring multiple builders at once
- Protocol-aware - knows about specs, plans, and Codev conventions
- Git worktree management - isolates each builder's changes
- Automatic prompting - builders start with instructions to implement their assigned spec
Current limitations:
- Currently optimized for Claude Code (uses
-pflag,--append-system-prompt, etc.) - Requires ttyd and tmux for terminal embedding
- macOS-focused (should work on Linux but less tested)
For parallel AI-assisted development, Codev includes the Architect-Builder pattern:
- Architect (you + primary AI): Creates specs and plans, reviews work
- Builders (autonomous AI agents): Implement specs in isolated git worktrees
# Start the architect dashboard
af start
# Spawn a builder for a spec
af spawn --project 0003
# Check status
af status
# Stop everything
af stopThe af command is globally available after installing @cluesmith/codev.
Access your Agent Farm dashboard from another device (tablet, phone, or laptop):
# Enable remote access (binds to 0.0.0.0)
af start --allow-insecure-remoteThen open http://<your-machine-ip>:4200 from any device on your network.
Find your IP:
# macOS
ipconfig getifaddr en0
# Linux
hostname -I | awk '{print $1}'--allow-insecure-remote flag provides no authentication. Only use on trusted networks. For secure remote access, use SSH tunneling:
# From remote machine, create secure tunnel
ssh -L 4200:localhost:4200 user@dev-machine
# Then open http://localhost:4200See CLI Reference for full details.
Builders need permission-skipping flags to run autonomously without human approval prompts:
| CLI Tool | Flag | Purpose |
|---|---|---|
| Claude Code | --dangerously-skip-permissions |
Skip permission prompts for file/command operations |
| Gemini CLI | --yolo |
Enable autonomous mode without confirmations |
Configure in codev/config.json:
{
"shell": {
"architect": "claude --dangerously-skip-permissions",
"builder": "claude --dangerously-skip-permissions"
}
}Or for Gemini:
{
"shell": {
"architect": "gemini --yolo",
"builder": "gemini --yolo"
}
}Warning: These flags allow the AI to execute commands and modify files without asking. Only use in development environments where you trust the AI's actions.
See INSTALL.md for full documentation.
Codev has a release protocol (codev/protocols/release/) that automates the entire release process. To release a new version:
Let's release v1.4.0
The AI guides you through: pre-flight checks, maintenance cycle, E2E tests, version bump, release notes, GitHub release, and npm publish.
Releases are named after great examples of architecture from around the world. See Release Notes for version history.
We welcome contributions of any kind! Talk to us on Discord or open an issue.
We especially welcome contributions to Agent Farm - help us make it work with more AI CLIs and platforms.
MIT - See LICENSE file for details
Built with Codev - where context drives code




