Woz is a user-facing orchestrator for background coding agents. It routes work to coding agents, runs multi-agent experiments, analyzes output diffs, and helps evolve the harness that powers those agents.
This repo contains the Woz runtime (src/) and the skills that define its orchestration workflows (skills/).
Woz sits between a human operator and one or more background coding agents.
- For product-repo work, Woz delegates implementation to coding-agent branches.
- For comparison work, Woz runs the same prompt across multiple agent variants/rollouts.
- For harness evolution, Woz can update the coding-agent harness setup itself.
We started with a mostly manual workflow:
- Long planning loops to avoid missing details.
- Frequent implementation defects even with good plans: duplication, subtle bugs, missing paths, semantic mismatches.
Adding critique loops and specialized agents increased quality significantly. We found that:
- There is no one-size-fits-all setup.
- Teams care about different failure modes.
- The right set of agents only becomes obvious while doing real work.
Our current loop is:
- Human-owned planning (often with OpenSpec).
- Woz routes implementation to one or more coding agents.
- Woz runs side-by-side experiments when needed.
- Woz analyzes diffs across resulting filesystems.
- Humans review, pick direction, and iterate.
Woz forwards a prompt to one coding-agent branch and returns task references for follow-up.
Woz creates a multi-agent experiment, tracks per-variant tasks, and reports status.
Woz pulls task filesystems, computes diffs, and summarizes implementation differences.
Woz can implement changes in the harness repo and support branch-based deploy workflows.
- Python 3.12+
uv- An API key from Terminal Use
uv syncBefore deploying, update the agent namespace in config.yaml:
agent:
name: woz-ns/wozReplace woz-ns with your Terminal Use namespace slug (for example, acme/woz).
tu deploy --config config.yamlTERMINALUSE_API_KEYANTHROPIC_API_KEYSLACK_BOT_TOKEN(for Slack-thread flows)
src/: Woz runtime and routing logic.skills/: orchestration skills and helper scripts.tests/: unit tests for routing, delegation, hooks, and skills.
Note: This repo contains the Woz agent only. The companion platform — environment variable management, coding agent template installation, Slack bridge, and the web UI — will be open-sourced separately soon.
We also operate a hosted version of Woz here. We're prioritizing users who would like to build evals for their codebase and create auto-improving background coding agents.
