Contract System v2: core contracts, enforcement, and CI gates by robertguss · Pull Request #1 · robertguss/programming_tiger_style

robertguss · 2026-02-08T03:03:06Z

Objective

Problem solved: establish an executable Contract System v2 for AI-assisted coding.
Intended outcome: move from policy-only docs to enforceable contracts, templates, checklists,
and CI gates.

Risk Tier

Tier: 1
Rationale: repository-level process/documentation and automation changes; no runtime product code.

Scope

Files/components changed: core contracts, templates, checklists, enforcement scripts, PR template,
CI workflow, and integration updates to existing language contracts/README.
Explicit exclusions: no application runtime implementation, no external service integrations.

Red

Failing test(s): initial validation requirement for TDD sequence and evidence sections.
Command(s): scripts/validate_tdd_cycle.sh --base main and
scripts/validate_evidence_packet.sh --pr-body .github/pull_request_template.md.
Expected failure summary: before implementation, v2 files and enforcement paths were absent.

Green

Minimal implementation summary: added required v2 contracts/templates/checklists and enforcement
scripts/workflow to satisfy validation goals.
Command(s): scripts/validate_tdd_cycle.sh --base main and
scripts/validate_evidence_packet.sh --pr-body .github/pull_request_template.md.
Passing summary: both validations pass on this branch.

Refactor

Structural improvements: integrated v2 into root README and language contracts; added
CONTRACT_SYSTEM_V2.md to clarify system intent and adoption.
Why behavior is unchanged: changes are governance/process artifacts, not product runtime logic.
Confirmation commands: bash -n scripts/validate_tdd_cycle.sh and
bash -n scripts/validate_evidence_packet.sh.

Invariants

Red -> Green -> Refactor sequencing is required and now machine-checked.
Evidence packet section completeness is required and now machine-checked.

Security Impact

Threats considered: supply-chain/process drift and unverified AI-generated changes.
Mitigations: explicit dependency policy, security contract, adversarial review checklist,
mandatory evidence sections.
Residual risk: enforcement quality depends on PR body quality and reviewer discipline.

Performance Impact

Baseline measurement: no existing automation for contract enforcement.
Post-change measurement: adds lightweight shell/markdown checks to CI.
Delta explanation: negligible runtime impact; modest CI runtime increase.

Assumptions

PRs will use the provided template or include equivalent required headings.
Team will keep commit prefix discipline for non-doc changes.

Open Questions

Should the TDD validator exempt additional file types or paths?
Should risk-tier declaration be auto-validated in CI in a later iteration?

Rollback Plan

Trigger conditions: unacceptable workflow friction or false positives in contract gates.
Rollback steps: revert this PR or selectively disable workflow/script gates while keeping docs.

Validation Commands

bash -n scripts/validate_tdd_cycle.sh
bash -n scripts/validate_evidence_packet.sh
scripts/validate_tdd_cycle.sh --base main
scripts/validate_evidence_packet.sh --pr-body .github/pull_request_template.md

Summary by CodeRabbit

Documentation
- Introduced Contract System v2 framework and many new governance contracts (architecture, security, performance, TDD, review, risk tiers, evidence requirements)
- Added templates and checklists for task packets, evidence packets, ADRs, test plans, session handoffs, and adversarial/PR reviews
- Expanded README and resources with language-specific contract guidance
Chores
- Added project ignore patterns and a structured pull-request template
Tests
- Added CI validation and local scripts to enforce TDD commit sequencing and evidence packet validation

coderabbitai · 2026-02-08T03:03:20Z

Caution

Review failed

The pull request is closed.

📝 Walkthrough

Walkthrough

Adds Contract System v2: a governance suite of core contracts, language integration notes, templates, checklists, CI workflow, and validation scripts to enforce TDD (RED→GREEN→REFACTOR) and evidence-packet requirements across PRs and pushes.

Changes

Cohort / File(s)	Summary
Core Contracts & Framework `contracts/core/AI_AGENT_CORE_CONTRACT.md`, `contracts/core/ARCHITECTURE_CONTRACT.md`, `contracts/core/DEPENDENCY_POLICY.md`, `contracts/core/EVIDENCE_REQUIREMENTS.md`, `contracts/core/INTERACTION_CONTRACT_FOR_CODEX.md`, `contracts/core/PERFORMANCE_CONTRACT.md`, `contracts/core/REVIEW_CONTRACT.md`, `contracts/core/RISK_TIER_POLICY.md`, `contracts/core/SECURITY_CONTRACT.md`, `contracts/core/TDD_ENFORCEMENT_CONTRACT.md`, `contracts/core/README.md`	Adds 10 core contract documents and an overview README defining mandatory agent behaviors, architecture rules, dependency and evidence policies, risk tiers, security/performance contracts, review model, and TDD enforcement.
Governance & System Docs `CONTRACT_SYSTEM_V2.md`, `README.md`	Introduces Contract System v2 concept, four-layer model, adoption guidance, and updates README structure and file mappings.
Checklists `checklists/ADVERSARIAL_REVIEW_CHECKLIST.md`, `checklists/PR_CONTRACT_CHECKLIST.md`	Adds adversarial review and PR contract checklists to guide reviewers and enforce evidence/validation requirements.
Delivery Templates `templates/TASK_PACKET_TEMPLATE.md`, `templates/EVIDENCE_PACKET_TEMPLATE.md`, `templates/TEST_PLAN_TEMPLATE.md`, `templates/ADR_TEMPLATE.md`, `templates/SESSION_HANDOFF_TEMPLATE.md`	Adds templates for task packets, evidence packets, test plans, ADRs, and session handoffs to standardize specification and validation artifacts.
Validation Scripts `scripts/validate_tdd_cycle.sh`, `scripts/validate_evidence_packet.sh`	Adds bash scripts: one enforces TDD commit-prefix sequence across a commit range; the other validates evidence packet presence and required headings (supports PR body or file).
GitHub Integration `.github/pull_request_template.md`, `.github/workflows/contract-gates.yml`	Adds PR template and a workflow that runs on PRs and pushes to validate TDD cycles and evidence packets, and to lint/prepare scripts.
Language Contracts Integration `contracts/languages/PYTHON_CODING_CONTRACT.md`, `contracts/languages/RUST_CODING_CONTRACT.md`, `contracts/languages/TYPESCRIPT_CODING_CONTRACT.md`	Adds "Contract Integration" subsections requiring core contracts to apply and specifying stricter-rule resolution with documented rationale.
Evidence & Supporting Files `.evidence/README.md`, `templates/EVIDENCE_PACKET_TEMPLATE.md`, `.gitignore`, `resources/RESOURCES.md`	Adds evidence directory README, evidence packet template, basic .gitignore entries, and curated resource list.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant Author as PR Author
  participant GitHub as GitHub Actions
  participant Repo as Repository (scripts, templates, contracts)
  participant Validator as Validation Scripts
  participant Reviewer as Human Reviewer

  Author->>GitHub: Open PR / Push to main
  GitHub->>Repo: Checkout PR head (or branch)
  GitHub->>Repo: Ensure scripts executable & lint
  GitHub->>Validator: run validate_tdd_cycle.sh --base BASE_SHA
  Validator-->>GitHub: TDD validation result
  alt event is pull_request
    GitHub->>Validator: run validate_evidence_packet.sh --pr-body PR_BODY
  else push
    GitHub->>Validator: if .evidence/EVIDENCE_PACKET.md present run validate_evidence_packet.sh --file path
  end
  Validator-->>GitHub: evidence validation result
  GitHub->>Author: report CI status (pass/fail)
  Author->>Reviewer: request review (if CI pass)
  Reviewer->>Repo: review contracts/templates and evidence

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

Contract System v2: core contracts, enforcement, and CI gates #1 — Overlapping changes that implement the same Contract System v2 artifacts (workflow, validation scripts, .evidence README, evidence/template files, and core contracts).

Poem

🐰 I nibbled on rules and stitched them with care,
Red, Green, Refactor — carrots lined in a row,
Evidence packets bundled, tidy and fair,
CI watches closely as the governance grows,
Hooray for tidy commits and reviewers in tow! 🥕✨

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main changes: introducing Contract System v2, its core contracts, enforcement mechanisms (scripts), and CI automation (contract-gates workflow).

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch codex/contract-system-v2

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8c2383d4da

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

.github/workflows/contract-gates.yml

coderabbitai

Actionable comments posted: 4

🤖 Fix all issues with AI agents

In @.github/workflows/contract-gates.yml:
- Around line 30-42: Replace the HEAD~1 logic with the push event's before SHA
so the full pushed commit range is validated: when github.event_name is
"pull_request" keep BASE_SHA="$(jq -r '.pull_request.base.sha'
"$GITHUB_EVENT_PATH")", when it's "push" set BASE_SHA="$(jq -r '.before'
"$GITHUB_EVENT_PATH")" and if that value is empty or all-zero fallback to the
repository root (use git rev-list --max-parents=0 HEAD | tail -n 1) before
invoking scripts/validate_tdd_cycle.sh; keep the existing fallback used for
other events. Reference BASE_SHA, GITHUB_EVENT_PATH and
scripts/validate_tdd_cycle.sh when making the change.

In `@contracts/core/RISK_TIER_POLICY.md`:
- Around line 18-29: Update the "Evidence packet" row in the RISK_TIER_POLICY.md
controls table so Tier 0 reads "Required" (not "Optional") to match the Auditing
section and CI validation; locate the "Evidence packet" table row (and any
duplicate instances around the other table at lines ~51-53) and change the Tier
0 cell to "Required" so policy and enforcement are consistent.

In `@contracts/core/TDD_ENFORCEMENT_CONTRACT.md`:
- Around line 84-86: The "## Exceptions" heading conflicts with the sentence "No
TDD exception is allowed without explicit approval and documented risk
controls"; update the section to remove ambiguity by either deleting the "##
Exceptions" heading and statement if exceptions are absolutely forbidden, or
expand the section to define the exception process: replace the single sentence
with clear criteria and steps for requesting an exception, define what "explicit
approval" means (e.g., required approvers by role such as Tech
Lead/Architect/Compliance, how to record sign-off), list required documented
risk controls and where to store them, and include any timeboxed review/renewal
conditions so the behavior of the functions/phrases "## Exceptions" and "No TDD
exception is allowed without explicit approval and documented risk controls" are
consistent.

In `@scripts/validate_evidence_packet.sh`:
- Around line 54-94: Normalize CRLF by creating a temporary LF-only copy of the
input and use that copy for all validations: read "$INPUT_PATH", strip CR
characters (e.g., via sed -e 's/\r$//' or tr -d '\r') into a temp file (e.g.,
normalized_input), run the REQUIRED_HEADINGS loop and the grep placeholder check
against the normalized file instead of "$INPUT_PATH", and ensure the temp file
is removed on exit (use a trap to cleanup); update references to INPUT_PATH in
the heading check and placeholder check to point to the normalized file while
keeping the existing variables REQUIRED_HEADINGS, missing, and the grep patterns
unchanged.

🧹 Nitpick comments (12)

RESOURCES.md (1)
7-8: Clarify file locations for foundational references.

The references to TIGER_STYLE.md and P10.pdf lack location information. Consider adding:

Repository paths if these files exist in the repo

External URLs if they're hosted elsewhere

A note indicating where to obtain them if they're proprietary/restricted

This will help users quickly access these foundational documents.
📝 Example of clearer reference format
 ## Foundations
 
-1. `TIGER_STYLE.md`
-2. `P10.pdf` (NASA/JPL Power of Ten)
+1. `TIGER_STYLE.md` (located at `/docs/TIGER_STYLE.md`)
+2. `P10.pdf` (NASA/JPL Power of Ten) - [available at NASA website](https://example.com) or `/docs/P10.pdf`
templates/TASK_PACKET_TEMPLATE.md (1)
54-54: Optional: Consider hyphenation for compound modifier.

Static analysis suggests hyphenating "Data handling" to "Data-handling" when used as a compound modifier before "constraints." However, "data handling" as two words is also commonly accepted. This is a minor style preference.
Optional refinement
-- Data handling constraints.
+- Data-handling constraints.
contracts/core/EVIDENCE_REQUIREMENTS.md (2)

11-14: Consider clarifying when to use repository files vs PR body.

The guidance states evidence can be in either PR description or a repository file "for example .evidence/EVIDENCE_PACKET.md". Consider adding criteria for when to use each approach (e.g., "Use repository files for complex multi-commit work or when evidence exceeds GitHub's character limits").

18-32: Consider flexibility for Tier 0 changes.

The contract mandates 13 sections for "every evidence packet" without mentioning exemptions or simplified paths. If Tier 0 changes (docs-only, typo fixes) exist, they may not need full evidence packets. Consider adding a sentence acknowledging exemptions or referencing RISK_TIER_POLICY.md for scoping requirements by tier.

CONTRACT_SYSTEM_V2.md (2)

9-14: Add cross-references to core contracts for key concepts.

Lines 11, 12, 13, and 14 mention "Risk-tier policy", "Evidence packet requirements", "Adversarial review model", and "CI automation" without linking to the detailed contracts. Consider adding inline references like:

Risk-tier policy → contracts/core/RISK_TIER_POLICY.md

Evidence packet requirements → contracts/core/EVIDENCE_REQUIREMENTS.md

Adversarial review model → checklists/ADVERSARIAL_REVIEW_CHECKLIST.md

This improves discoverability for readers encountering these concepts for the first time.

23-28: Clarify adoption sequence timeline and current state.

The "Adoption Sequence" describes a phased rollout but doesn't indicate:

Whether these phases are sequential or can overlap

What the current adoption state is (e.g., "Phase 1 complete, Phase 2 in progress")

Who decides when to move between phases

Consider adding a status indicator or timeline, or explicitly stating this is a forward-looking roadmap.

contracts/core/TDD_ENFORCEMENT_CONTRACT.md (1)

25-32: Clarify how supporting commit prefixes interact with mandatory R-G-R sequence.

Lines 25-30 allow DOCS:, CHORE:, BUILD:, TEST: commits as "supporting commits", and line 32 requires "at least one complete Red -> Green -> Refactor sequence in each change set". Consider clarifying:

Can supporting commits appear before the first RED: commit?

Can supporting commits appear between R-G-R stages?

Do supporting commits reset the R-G-R sequence requirement?

Example guidance: "Supporting commits may precede or follow R-G-R sequences but must not interrupt an incomplete sequence."

contracts/core/SECURITY_CONTRACT.md (2)

9-9: Provide guidance on threat modeling format and depth.

Line 9 requires "Threat model required for Tier 2/Tier 3 changes" but doesn't specify format, depth, or tooling. Consider adding:

Reference to a threat modeling template or framework (STRIDE, PASTA, etc.)

Minimum required artifacts (attack trees, trust boundaries, threat list)

Whether informal description is acceptable or formal model required

This would help ensure consistent threat model quality across PRs.

21-26: Consider defining "sensitive data" or providing classification guidance.

Lines 21-26 require classifying and protecting "sensitive data" but don't define what qualifies. Consider adding:

Examples (PII, credentials, financial data, health records)

Reference to a data classification policy

Inline note like "See RISK_TIER_POLICY.md for data sensitivity criteria"

This reduces ambiguity in applying the data protection requirements.

PYTHON_CODING_CONTRACT.md (1)

14-18: Consider clarifying "stricter rule" conflict resolution.

Line 18 states "If rules conflict, apply the stricter rule and document rationale in evidence." This is pragmatic but potentially ambiguous when:

Both rules are equally strict but contradictory (e.g., "always use X" vs "never use X")

"Stricter" is subjective (performance vs safety tradeoffs)

Consider adding guidance like: "When strictness is unclear, escalate to reviewer/architect and document the resolution rationale."

.github/pull_request_template.md (1)

1-79: Consider adding inline guidance or examples for each section.

The template provides structural placeholders but minimal guidance on what to write in each section. For first-time users, consider:

Adding brief inline comments with examples (e.g., under ## Red: )

Linking to a filled example evidence packet in documentation

Adding a "How to use this template" comment block at the top

This would improve adoption and reduce PR review cycles for guidance clarification.
contracts/core/AI_AGENT_CORE_CONTRACT.md (1)
91-91: Consider more concise phrasing.

The phrase "all of the following" could be shortened to "all the following" for brevity.
✏️ Optional style improvement
-Exceptions require all of the following:
+Exceptions require all the following:

.github/workflows/contract-gates.yml

contracts/core/RISK_TIER_POLICY.md

contracts/core/TDD_ENFORCEMENT_CONTRACT.md

scripts/validate_evidence_packet.sh

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@scripts/validate_tdd_cycle.sh`:
- Around line 65-70: The case pattern list inside the while loop currently has
the broad "*.md" pattern before the specific "README.md" pattern, causing
README.md to never hit its explicit branch and triggering shellcheck
SC2221/SC2222; edit the case in scripts/validate_tdd_cycle.sh (the block that
checks variable DOC_ONLY and iterates with while IFS= read -r path; do ... case
"$path" in) and reorder the patterns so specific filenames like README.md appear
before the wildcard "*.md" (move "*.md" to the end of that pattern list) to
ensure README.md matches its explicit pattern and eliminate the ShellCheck
warnings.

🧹 Nitpick comments (1)

README.md (1)
75-78: Add the Contract System v2 overview to References for discoverability.

Since CONTRACT_SYSTEM_V2.md is the conceptual entry point, listing it alongside references will help readers find it quickly.
📝 Suggested addition
 ### References
 
 - `resources/RESOURCES.md`
+ - `CONTRACT_SYSTEM_V2.md`

scripts/validate_tdd_cycle.sh

robertguss · 2026-02-08T03:17:56Z

Addressed requested review items (recommended threads 2-6) in commit e835755:\n\n1. Updated push-range TDD validation to use push event before SHA with root fallback and base-commit existence fallback.\n2. Aligned risk-tier policy by making Tier 0 evidence packet required.\n3. Clarified TDD exceptions with explicit emergency-only criteria and approval requirements.\n4. Added CRLF normalization in evidence validator to prevent false missing-heading failures.\n\nI resolved threads 2-6. Thread 1 was already addressed earlier by commit 71fdb4b (PR head checkout + merge-commit handling).

robertguss · 2026-02-08T03:18:50Z

Follow-up: addressed and pushed additional review fixes.\n\nNew commits:\n- e835755: workflow base-SHA range fix, policy alignment, TDD exceptions clarification, CRLF normalization\n- 82e848c: shellcheck pattern-order cleanup in validate_tdd_cycle.sh\n\nResolved all outstanding review threads on the PR.

robertguss added 3 commits February 7, 2026 22:02

RED: add core contract framework and templates

ebc5a49

GREEN: add automation and CI contract gates

7f735d7

REFACTOR: integrate v2 across language contracts and docs

8c2383d

chatgpt-codex-connector bot reviewed Feb 8, 2026

View reviewed changes

.github/workflows/contract-gates.yml Outdated Show resolved Hide resolved

.github/workflows/contract-gates.yml Outdated Show resolved Hide resolved

coderabbitai bot reviewed Feb 8, 2026

View reviewed changes

.github/workflows/contract-gates.yml Show resolved Hide resolved

contracts/core/RISK_TIER_POLICY.md Show resolved Hide resolved

contracts/core/TDD_ENFORCEMENT_CONTRACT.md Outdated Show resolved Hide resolved

scripts/validate_evidence_packet.sh Show resolved Hide resolved

robertguss added 2 commits February 7, 2026 22:10

DOCS: update references for contracts/resources reorganization

4253fe2

CHORE: avoid merge-commit false positives in TDD gate

71fdb4b

coderabbitai bot reviewed Feb 8, 2026

View reviewed changes

scripts/validate_tdd_cycle.sh Show resolved Hide resolved

CHORE: address PR feedback on gating and contract consistency

e835755

CHORE: apply remaining review fixes for contract gates

82e848c

robertguss merged commit 819997a into main Feb 8, 2026
1 of 2 checks passed

This was referenced Feb 9, 2026

Tiger Style v1.1 hardening: align policy, enforcement, and adoption truth #2

Merged

Add tiger-style CLI v1 with install/configure/doctor/bootstrap #3

Merged

Docs IA reorg + mdBook setup + changelog migration #4

Merged

Comments

Conversation

robertguss commented Feb 8, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Objective

Risk Tier

Scope

Red

Green

Refactor

Invariants

Security Impact

Performance Impact

Assumptions

Open Questions

Rollback Plan

Validation Commands

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

robertguss commented Feb 8, 2026

Uh oh!

robertguss commented Feb 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

robertguss commented Feb 8, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 8, 2026 •

edited

Loading