Skip to content

Tags: caiopizzol/comment-bench

Tags

v0.3.0

Toggle v0.3.0's commit message
feat: add claude.md and agents.md as documentation

v0.2.0

Toggle v0.2.0's commit message
feat: add comment-audit skill for agent-driven comment audits

v0.1.0

Toggle v0.1.0's commit message
feat: cut first release

v0.0.0

Toggle v0.0.0's commit message
feat: initial public release

A benchmark for whether inline comments steer AI coding agents on small
surgical edits. Four scenarios in the gift-card refund domain test where
the protected invariant (24h cap) lives: branch, accumulator, helper, or
comment-only. Seven comment treatments per scenario.

Deliverable is comment-policy.md. RESULTS.md describes the result patterns
the benchmark probes for.