Woz: How We Harness Background Coding Agents

Woz is a user-facing orchestrator for background coding agents. It routes work to coding agents, runs multi-agent experiments, analyzes output diffs, and helps evolve the harness that powers those agents.

This repo contains the Woz runtime (src/) and the skills that define its orchestration workflows (skills/).

What Woz Is

Woz sits between a human operator and one or more background coding agents.

For product-repo work, Woz delegates implementation to coding-agent branches.
For comparison work, Woz runs the same prompt across multiple agent variants/rollouts.
For harness evolution, Woz can update the coding-agent harness setup itself.

Why We Built It

We started with a mostly manual workflow:

Long planning loops to avoid missing details.
Frequent implementation defects even with good plans: duplication, subtle bugs, missing paths, semantic mismatches.

Adding critique loops and specialized agents increased quality significantly. We found that:

There is no one-size-fits-all setup.
Teams care about different failure modes.
The right set of agents only becomes obvious while doing real work.

How We Use Woz

Our current loop is:

Human-owned planning (often with OpenSpec).
Woz routes implementation to one or more coding agents.
Woz runs side-by-side experiments when needed.
Woz analyzes diffs across resulting filesystems.
Humans review, pick direction, and iterate.

Core Flows

1. Run one coding agent

Woz forwards a prompt to one coding-agent branch and returns task references for follow-up.

2. Run multiple coding agents / rollouts

Woz creates a multi-agent experiment, tracks per-variant tasks, and reports status.

3. Analyze outputs

Woz pulls task filesystems, computes diffs, and summarizes implementation differences.

4. Update coding-agent harness

Woz can implement changes in the harness repo and support branch-based deploy workflows.

Quickstart

Prerequisites

Python 3.12+
uv
An API key from Terminal Use

Install dependencies

uv sync

Deploy Woz

Before deploying, update the agent namespace in config.yaml:

agent:
  name: woz-ns/woz

Replace woz-ns with your Terminal Use namespace slug (for example, acme/woz).

tu deploy --config config.yaml

Env Vars

TERMINALUSE_API_KEY
ANTHROPIC_API_KEY
SLACK_BOT_TOKEN (for Slack-thread flows)

Repo Layout

src/: Woz runtime and routing logic.
skills/: orchestration skills and helper scripts.
tests/: unit tests for routing, delegation, hooks, and skills.

Open Source and Hosted Access

Note: This repo contains the Woz agent only. The companion platform — environment variable management, coding agent template installation, Slack bridge, and the web UI — will be open-sourced separately soon.

We also operate a hosted version of Woz here. We're prioritizing users who would like to build evals for their codebase and create auto-improving background coding agents.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
skills		skills
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Woz: How We Harness Background Coding Agents

What Woz Is

Why We Built It

How We Use Woz

Core Flows

1. Run one coding agent

2. Run multiple coding agents / rollouts

3. Analyze outputs

4. Update coding-agent harness

Quickstart

Prerequisites

Install dependencies

Deploy Woz

Env Vars

Repo Layout

Open Source and Hosted Access

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Woz: How We Harness Background Coding Agents

What Woz Is

Why We Built It

How We Use Woz

Core Flows

1. Run one coding agent

2. Run multiple coding agents / rollouts

3. Analyze outputs

4. Update coding-agent harness

Quickstart

Prerequisites

Install dependencies

Deploy Woz

Env Vars

Repo Layout

Open Source and Hosted Access

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages