Tags · caiopizzol/comment-bench

v0.3.0

feat: add claude.md and agents.md as documentation

May 4, 2026
98865f2
zip
tar.gz
Notes

v0.2.0

feat: add comment-audit skill for agent-driven comment audits

May 4, 2026
ae508d8
zip
tar.gz
Notes

v0.1.0

feat: cut first release

May 3, 2026
d9ed513
zip
tar.gz
Notes

v0.0.0

feat: initial public release

A benchmark for whether inline comments steer AI coding agents on small
surgical edits. Four scenarios in the gift-card refund domain test where
the protected invariant (24h cap) lives: branch, accumulator, helper, or
comment-only. Seven comment treatments per scenario.

Deliverable is comment-policy.md. RESULTS.md describes the result patterns
the benchmark probes for.

May 3, 2026
8deeb0d
zip
tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.3.0

v0.2.0

v0.1.0

v0.0.0

Tags: caiopizzol/comment-bench

v0.3.0

v0.2.0

v0.1.0

v0.0.0