Agent Audit Kit

Exhibit-grade decision lineage for AI agent failures, designed for incident response, regulator review, insurer review, and board-level incident reporting.

AAK treats the trace as a legal exhibit, not a debugging tool. It produces evidence artifacts for post-mortem defensibility, not governance authority or compliance certification.

Overview

Agent Audit Kit (AAK) provides tamper-evident capture, replay verification, and exhibit-grade evidence-pack generation for AI agent failures. It captures what happened, what the system believed to be true at decision time, and what a reviewer needs to reconstruct the sequence later.

Lane B / Exploratory (Forensic)

Agent Audit Kit is intentionally non-authoritative. It captures what happened, not what is correct.

It is designed to support:

incident response
red team exercises
insurance and legal review
auditor inspection of agent behavior
board-level incident reporting

It does not determine truth, correctness, or admissibility.

For Whom / Not For

Agent Audit Kit produces evidence artifacts, not truth claims.

For:

Incident response and forensic review
Red team exercises and penetration testing
Underwriting evidence and insurance review
Legal discovery and audit trails
Board-level incident reporting on AI failures

Not for:

Compliance certification
Governance authority or trust assignment
Correctness proofs or formal verification
Automated decision-making

Non-Claims Discipline

IMPORTANT: This tool does NOT provide:

Compliance certification
Deterministic replay of hosted LLMs (they are non-deterministic)
Formal verification or proofs
Governance-grade authority (that's Lane A / MathLedger)

What it DOES provide:

Tamper-evident hash chains for event capture
Domain-separated SHA256 hashing (RFC 8785 canonical JSON)
OWASP Agentic (ASI) threat detection mappings
Portable evidence bundles with standalone verification
Decision-context capture for exhibit-grade lineage
Regulator-readable evidence-pack outputs with temporal narrative and counterfactual review prompts
Provenance labeling (captured vs synthetic)
Optional CPF-style psych context references (hash_ref default, evidence-only)
Optional external verifier evidence references (verifiers/, evidence-only)

Lane Boundary Rule (Hard)

Agent Audit Kit artifacts are evidence only.

They may be:

reviewed by humans
attached to audit reports
supplied to insurers, regulators, or courts

They may NOT be:

treated as verified claims
used to assign trust class
used to upgrade authority
substituted for governance-grade verification

Authority decisions belong to Lane A systems (e.g. MathLedger), not to this tool.

Quickstart

# Install dependencies
pip install -e ".[dev]"
# or with uv:
uv sync --dev

# Run tests
pytest

# Run a built-in banking decision-exposure workflow
python -m aak.cli capture run --config ./examples/banking_decision_exposure.yaml --out ./demo_output

# Verify the bundle
cd demo_output/bundle && python verify.py

# Generate deterministic evidence pack from a verified bundle
cd ../..
python -m aak.cli report generate --bundle ./demo_output/bundle --out ./demo_output/report

Architecture

┌─────────────────────────────────────────────────────────────────────┐
│                        AGENT RUNTIME                                │
│  ┌──────────────┐   ┌──────────────┐   ┌──────────────┐            │
│  │ LLM Client   │   │ Tool Router  │   │ RAG/Memory   │            │
│  └──────┬───────┘   └──────┬───────┘   └──────┬───────┘            │
│         └──────────────────┼──────────────────┘                     │
│                            ▼                                        │
│              ┌─────────────────────────────┐                        │
│              │   CONTEXT INTERCEPTOR SDK   │                        │
│              └─────────────┬───────────────┘                        │
└────────────────────────────┼────────────────────────────────────────┘
                             ▼
              ┌─────────────────────────────┐
              │      IMMUTABLE VAULT        │
              │  (append-only, hash-chained)│
              └─────────────┬───────────────┘
                            ▼
              ┌─────────────────────────────┐
              │     REPLAY / ANALYSIS       │
              │  (forensic or counterfactual)│
              └─────────────────────────────┘

DecisionContextSnapshot records perceived world state, accountability metadata, confidence signals, and reversibility at each decision boundary.

Note: Replay verifies capture integrity and event ordering. It does not imply deterministic re-execution of hosted LLMs.

Key Components

Canonicalization (`aak.canon`)

RFC 8785 compliant canonical JSON
Domain-separated SHA256 hashing
Identical code embedded in verify.py for consistency

Models (`aak.models`)

EventBase with source: captured | synthetic
Optional DecisionContextSnapshot on events for perceived world state, accountability, and reversibility
Optional psych_context on events for forensic cross-reference
IdentityContext for non-human actor tracking
ThreatFlag with OWASP ASI mappings (ASI01-ASI10)
Optional VerifierEvidence entries for external proof/verification artifacts

Vault (`aak.vault`)

VaultWriter: Append-only event storage with hash chaining
VaultReader: Verification and replay support
export_bundle: Portable bundles with embedded verify.py

Event Hashing Contract

The event_hash is computed as:

event_hash = domain_hash("event", canonical_event_payload)

Where:

canonical_event_payload is the RFC 8785 canonical JSON of the event
Envelope fields (seq, hash, prev_hash, payload_file) are NOT included
Domain prefix is aak:v0:event:

Threat Detection

Three threat classes mapped to OWASP Agentic Top 10:

Threat Class	OWASP ASI Tags
TOOL_MISUSE	ASI02, ASI03
RAG_POISONING	ASI01, ASI06
PRIVILEGE_MISUSE	ASI03

Development

# Lint
ruff check src/ tests/

# Type check
mypy src/

# Run specific test suites
pytest tests/unit -v
pytest tests/integration -v
pytest tests/acceptance -v

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
demo		demo
docs		docs
examples		examples
golden_runs		golden_runs
src/aak		src/aak
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
RUNBOOK.md		RUNBOOK.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Audit Kit

Overview

For Whom / Not For

Non-Claims Discipline

Lane Boundary Rule (Hard)

Quickstart

Architecture

Key Components

Canonicalization (`aak.canon`)

Models (`aak.models`)

Vault (`aak.vault`)

Event Hashing Contract

Threat Detection

Development

License

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agent Audit Kit

Overview

For Whom / Not For

Non-Claims Discipline

Lane Boundary Rule (Hard)

Quickstart

Architecture

Key Components

Canonicalization (aak.canon)

Models (aak.models)

Vault (aak.vault)

Event Hashing Contract

Threat Detection

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Canonicalization (`aak.canon`)

Models (`aak.models`)

Vault (`aak.vault`)

Packages