THIS IS NOT AN AGENT! IT IS A README FILE ABOUT THE AGENT SYSTEM.
A comprehensive multi-agent system for automated software development using Claude AI. This system implements a complete development workflow from product planning to deployment, following Scrum methodologies and a structured 4-tier testing approach.
This repository contains a collection of specialized AI agents that work together to manage the entire software development lifecycle. Each agent operates independently but coordinates through a per-task storage model under .plan/tasks using index.json (task registry) and individual per-task files, plus an append-only events.log. The system follows strict Test-Driven Development (TDD) principles.
- Learning Capture: Agents submit learnings to the Learner for validation
- Human Validation: Dashboard Self-Improvement tab for learning review
- Instruction Updates: Agent-Improver converts validated learnings into agent instructions
- Change Tracking: Complete audit trail of all agent modifications with revert capability
- Selective Task Reading: Agents read
.plan/tasks/index.jsonto locate relevant tasks and then open only the needed per-task files (e.g.,.plan/tasks/<task_id>.json). If the per-task structure is missing, agents may temporarily fallback to legacytasks.json. - Instruction Area Management: Maximum 20 concise instructions per agent
- Prompt Optimization: Agent-Improver can propose full prompt integration with human approval
- Coder/Tester/Reviewer → Product-Manager → Product-Owner → Architecturer → Strategist
- Higher-level agents notify delegated agents after decisions
- Strict task-based communication (no direct agent-to-agent communication)
- Strategist: High-level strategy and architecture decisions
- Product-Owner: Requirements gathering and user story creation
- Architecturer: Technology research, architectural guidance, performance optimization analysis
- Product-Manager: Project coordination and timeline management
- Task-Coder: Feature implementation and bug fixes
- Code-Reviewer: Code quality and security review
- Tester: Testing strategy and quality assurance
- DevOps-Engineer: Deployment and infrastructure management
- UI-Test-Designer: UI test workflow design
- UI-Tester: Automated UI test execution
- Cleaner: Code cleanup and maintenance
- Human-Concierge: Human-agent bridge, request processing, escalation management
- Learner: Learning management, knowledge organization, system improvement tracking
- Agent-Improver: Agent instruction updates, prompt optimization, change management
- Compliance-Officer: Ensures regulatory compliance, data protection, and audit readiness
Our testing approach provides quality gates at each development stage:
- Unit Tests: Per-task validation (1-3 min)
- Smoke Tests: Sprint completion validation (5-10 min)
- Sanity Tests: Pre-staging deployment validation (15-20 min)
- Regression Tests: Pre-production comprehensive validation (30-60 min)
- Test-Driven Development: Comprehensive TDD workflow with automated test generation
- Scrum Integration: Epic and user story management with sprint planning
- Quality Assurance: Multi-level code review and testing processes
- UI Testing: Specialized UI test design and execution with browser automation
- Deployment Automation: Automated deployment workflows with quality gates
- Task Management: Per-task JSON tracking under
.plan/tasks/withindex.json, append-onlyevents.log, and task-scoped logs/artifacts
- Planning Phase: Product-Manager creates epics and user stories
- Architecture Phase: Architecturer provides technology guidance and performance analysis
- Development Phase: Task-Coder implements features following TDD
- Testing Phase: 4-tier testing validation with quality gates
- Review Phase: Code-Reviewer ensures quality standards
- Deployment Phase: DevOps-Engineer manages deployment processes
- Task Level: Unit tests must pass (100% pass rate)
- Sprint Level: Smoke tests validate core functionality
- Staging Level: Sanity tests ensure business-critical paths
- Production Level: Regression tests provide comprehensive validation
| Test Level | Trigger | Duration | Priority | Pass Rate | Deployment Gate |
|---|---|---|---|---|---|
| Unit | Task completion | 1-3 min | All | 100% | Task approval |
| Smoke | Sprint completion | 5-10 min | Critical | 100% | Staging ready |
| Sanity | Pre-staging | 15-20 min | Critical + High | 100% | Staging deploy |
| Regression | Pre-production | 30-60 min | All levels | 95%+ | Production deploy |
- Shift-Left Testing: Catch issues early in development
- Risk-Based Testing: Focus on business-critical functionality
- Automated Quality Gates: Prevent deployment of defective code
- Continuous Feedback: Rapid feedback to development teams
Refer to the individual agent files in the agents/ directory for specific implementation details and workflows. Each agent contains detailed instructions for setup and operation.
├── agents/ # Agent definitions and workflows
├── .plan/ # Project data (real mode)
│ ├── tasks/
│ │ ├── index.json # Task registry (lightweight)
│ │ ├── T-XXXX.json # Per-task files (one per task)
│ ├── events.log # Append-only audit log
│ ├── logs/<task_id>/... # Per-task logs
│ └── .artifacts/<task_id>/ # Per-task artifacts (large outputs)
├── .demo/.plan/ # Demo data (mirrors real structure)
├── .templates/ # Templates for initialization
│ ├── .plan/ # Planning templates (epics, roadmaps, etc.)
│ └── tests/ # Testing templates and 4-tier suites
└── README.md # This file
- Task-Coder: Commits only to
development. Never push/merge tostagingormain. - DevOps-Engineer: Sole owner of merges:
development → staging(staging deploy) andstaging → main(production deploy).
- All JSON and Markdown files managed by agents must reside under
.plan/ortests/in real mode, and under.demo/.plan/or.demo/tests/in demo mode. - Only
CLAUDE.md(andCLAUDE.md.templatein templates) should live at the project root.
The system implements comprehensive quality assurance through:
- Automated Testing: 4-tier testing approach with clear quality gates
- Code Review: Mandatory review process with standards compliance
- Deployment Gates: Quality checks at each deployment stage
- Continuous Monitoring: Performance and reliability tracking
- Failure Response: Structured protocols for handling test failures
To use these agents with Claude Code:
-
Copy the agent
.mdfiles from theagentsfolder in this repository into the.claude/agentsfolder in your Claude Code folder (or project folder). -
Add the prompt from the
.templates/CLAUDE.md.templatefile to the begining of theCLAUDE.mdfile in your Claude Code or project folder. -
Restart Claude Code.
The Claude Code Agents Dashboard provides a web-based interface for monitoring and managing your agents, tasks, and human requests.
# Navigate to dashboard directory
cd dashboard
# Start the dashboard (installs dependencies automatically)
./start-dashboard.sh
# Or start manually
npm install
npm startThe dashboard will be available at:
- Client: http://localhost:3001
- Server API: http://localhost:3002
The dashboard determines its mode dynamically:
- Demo Mode: Active when running with
NODE_ENV=demoor when no per-task structure is detected. Data is under.demo/.plan/with tasks in.demo/.plan/tasks/usingindex.json+ per-task files. This mode is writable and file watching is enabled. - Real Mode: Active when a per-task structure is present (e.g.,
.plan/tasks/index.jsonor any task JSON in.plan/tasks/). Data is under.plan/with tasks in.plan/tasks/. On first run, any missing files are initialized from.templates/.planand.templates/tests(templates copied as-is;.templatesuffix removed for.planfiles). If no per-task files are found, the dashboard may fallback to legacy.plan/tasks.jsonif present.
- Real-time Task Monitoring: View and manage tasks with drag-and-drop functionality
- Human Request Management: Handle and respond to human input requests
- Roadmap Visualization: View and edit project roadmaps
- Agent Status: Monitor agent activity and system health
- WebSocket Integration: Real-time updates without page refresh
The dashboard service is managed externally through starter scripts, not through the agent system. The dashboard orchestrates Claude Code instances rather than being managed by them.
To start the dashboard:
cd dashboard
npm startThe dashboard will then be available to spawn and manage Claude Code agent instances.
- Do NOT request features or report bugs directly to Claude Code.
- Instead, add them to
.plan/human-requests.mdin your project. - The Human-Concierge agent will process all new requests and coordinate with the Product-Manager for roadmap integration.
- Deduplication: Before adding a new item, scan existing entries in
.plan/human-requests.mdand open items via.plan/tasks/index.json(then inspect only the relevant per-task files). If the per-task structure is missing, check legacy.plan/tasks.json. If a pending item already addresses the same area or intent, update that existing entry instead of creating a new one.
- Regularly check
.plan/human-requests.mdfor pending requests requiring your attention. - If an agent is blocked (needs clarification or information), it will add a TODO here.
- Respond to the TODO in the same file to unblock the task.
Templates for files that will be used by the agents (like the main orchestrator prompt) are stored in the .templates directory and have a .template extension.
CLAUDE.md.template- Main orchestrator prompt. Used by the Orchestrator agent to coordinate all other agents.
- Task storage is initialized on first run for the per-task structure under
.plan/tasks/(registryindex.json+ per-task files). Some repositories may include a legacytasks.json.templatefor migration support. review-report.md.template- Code review findings. Written by Code-Reviewer, read by Task-Coder.human-requests.md.template- Unified file for all human-system interactions including feature requests, bug reports, agent clarifications, and strategic decisions. Written by agents and users, processed by Human-Concierge.
product_vision.md.template- High-level project goals and stages. Written by Strategist, read by Product-Owner.roadmap.md.template- Product roadmap with stages, milestones, epics, and sprints. Organized by Product-Manager, enhanced by Product-Owner.epic.md.template- Epic definitions with technical architecture and sprint breakdown. Written by Product-Owner.user_stories.md.template- Detailed requirements and user stories. Written by Product-Owner, read by Product-Manager.
test_plan.md.template- Test suite organization. Used by Tester to plan test cases.test_example.py.template- Example test file. Written/executed by Tester.test_log.md.template- Test execution results. Written by Tester, read by Tester/Product-Manager.ui_test_log.md.template- UI test execution logs. Written by UI-Tester with detailed browser automation results.
deployment_workflow.md.template- Deployment process with UI testing integration and failure handling.task_schema.json.template- Complete task schema including UI test task types and structure.
-
Agent Tools - Most agents have access to all tools by default. Change this in your agent .md file by adding a
toolssection for enhanced secutiry. Check Claude Code docs. -
Full Permissions - Claude Code instances triggered by our system have the --dangerouslyskippermissions flag set to true. This is needed for system to work, and permit Claude Code to access to all system resources and tools without restrictions, but introduces risks, that can lead to security breaches or unexpected bad behaviour, including data loss.
-
We recommend some good practices:
- Use Git cloud providers (Github/Gitlab/Bitbucket), and commit changes regularly. Out Task-Coder is trained to commit after every succeded task, but you need to initialize the repository first.
- Monitor Claude Code's behaviour and logs closely.
- Restrict access of tools to only those that are necessary for the agent to perform its tasks.
- Regularly review and update tool access permissions.