AATP — Auditable Agent Transaction Protocol

An open standard for recording and verifying AI agent decision-making.

AI agents are sending emails, scheduling meetings, negotiating contracts, managing subscriptions, making purchases, and moving money on behalf of their owners — but none of them record why they made a particular decision. When your AI agent declines a meeting, accepts a contract term, or spends $65 on API credits, the current infrastructure can verify what happened but not why — and not whether you would have made the same choice.

Any decision an AI agent makes that affects the outside world — not just financial transactions — deserves the same accountability that human decisions receive. AATP provides that infrastructure. Economic decisions are the natural starting point because they are the easiest to quantify and audit, but the protocol's design applies wherever an AI agent exercises judgment on behalf of a human principal.

The philosophy behind AATP: No set of rules can anticipate every situation an autonomous agent will face. If rules could be 100% complete, you wouldn't need an AI agent — a simple program would do. The whole point of deploying an agent is that it can exercise judgment where rules don't reach. Principals care about outcomes, not rule-following for its own sake. AATP is built on this recognition: give agents hard constraints where they matter, let them choose how to achieve reasonable outcomes in the gaps, and make every choice auditable.

The key insight: don't verify intent — verify consistency. Can we trust what an AI says about its own decisions? Not fully. But we don't need to. AATP applies the same standard human auditors have always used: you can't read a CEO's mind, but you can check whether their stated justification matches the financial data. AATP requires agents to produce a statement of record at each decision point, then enables independent auditors to check that record against what actually happened. When the record says one thing and reality shows another, the discrepancy itself is the finding.

Reasonableness is not binary — it emerges from independent review. Consistency checking catches discrepancies, but the harder question is: was the decision reasonable? A single auditor can only render a verdict — reasonable or not — based on its own judgment. But when multiple independent auditors each evaluate the same record, their individual binary verdicts aggregate into something more powerful: a probability. If 9 out of 10 independent audit AIs find a decision reasonable, that 0.9 is a meaningful approximation of the decision's true reasonableness — not because any single auditor is infallible, but because independent assessments converge by the law of large numbers. In practice, no one will run hundreds of auditors on a $50 decision. But even 2–3 independent reviews carry statistical weight — a wide confidence interval is still informative. The protocol doesn't need perfect measurement; it needs measurement that is better than nothing, which is what exists today.

Why Now

The infrastructure for autonomous AI agents is being deployed: communication protocols (Google A2A, Anthropic MCP), payment protocols (Coinbase x402, Google AP2), identity systems (W3C DIDs, Microsoft Entra Agent ID). What's missing is the accountability layer — a standard way to record and verify the decisions agents make on our behalf.

Recent independent research confirms the urgency. Google DeepMind's Intelligent AI Delegation framework (February 2026) argues for cryptographic accountability chains in agent delegation. Anthropic's engineering team documents the compounding error problem in multi-agent systems and the need for observability without content monitoring. Multiple teams are converging on the same conclusion: agent capability has outpaced agent accountability.

AATP provides a concrete, implementable protocol — not just a theoretical framework.

How It Works

None of this works without a standard. If every agent records decisions in its own format — or doesn't record them at all — there is no unified input for auditors to evaluate, and no way to compare findings across agents, models, or transactions. This is the same reason GAAP exists: not to tell companies how to run their business, but to ensure that when an auditor opens the books, they know what they're looking at. AATP is that standard for AI agent decisions — a common recording format that makes independent, scalable, comparable audit possible.

AATP adds an audit layer alongside existing agent infrastructure. It does not replace any communication, payment, or identity protocol.

Two layers:

Working Channel — where agents negotiate and transact (via A2A, MCP, etc.)
Audit Trail — where decisions are recorded, sealed, and made reviewable

Eight decision points define when records are generated: Opening, Offer, Counter-Offer, Agreement/Rejection, Payment Sent, Payment Confirmed, Problem/Dispute, and Closing.

Three operating modes enable gradual adoption:

Mode	Participants	Value
Solo	1 agent, no counterparty	Internal oversight of autonomous decisions
Unilateral	1 AATP agent + 1 non-AATP agent	One-sided transaction audit
Bilateral	2 AATP agents	Full cross-referenced accountability

Solo mode means AATP is useful from day one — even a single personal AI managing subscriptions, filtering emails, or scheduling meetings benefits from auditable decision records.

Core Design

Every audit record contains:

Narrative — natural language explanation of what the agent decided and why (enables human judgment)
Structured data — machine-readable fields for automated verification
Cryptographic seal — SHA-256 hash + Ed25519 signature, chained to the previous record
Authorization reference — link to the human principal's delegation scope

Three levels of review:

Integrity (automated) — hash chain valid? signatures valid? timestamps monotonic?
Compliance (semi-automated) — within authorization scope? narrative consistent with structured data?
Reasonableness (human/AI judgment) — did the agent pursue outcomes the principal would consider reasonable? Each independent auditor renders a verdict on each decision; multiple independent verdicts converge on the decision's true reasonableness. The auditor compares recorded reasoning against actual outcomes — narrative vs. structured data, stated terms vs. on-chain payments, claimed market conditions vs. reality. Discrepancies are the findings; the distribution of independent findings is the measure.

Level 3 is AATP's most distinctive contribution. It addresses the boundary gap problem: when AI agents operate in spaces where no predefined rule applies, their judgment must still be auditable — not by verifying intent, but by verifying consistency between what the agent said and what it did.

Seven Invariants

These define what AATP is. Changing any invariant constitutes a new protocol, not an amendment.

Dual-Layer Architecture — working channel and audit trail are always separate
Narrative + Structured Data Duality — every record contains both; neither alone is sufficient
Sealed Hash Chain — records are immutable after creation and sequentially linked
Decision-Point Model — records at defined decision moments, not continuous streams
Three-Level Review Separation — integrity, compliance, and reasonableness remain distinct
Agent and Auditor Independence — the entity that creates records cannot review them
Human Principal Sovereignty — every audit trail traces back to a human principal

Non-Goals

AATP is a recording and verification protocol. It is not a behavior control system. Specifically:

AATP does not guarantee good decisions. It makes decisions reviewable, not optimal.
AATP does not prevent malicious agents. It creates consequences for inconsistency, not barriers to action.
AATP does not verify AI truthfulness. It verifies consistency between stated reasoning and observable outcomes.
AATP does not replace regulatory compliance. It provides evidentiary infrastructure that regulators may find useful, but it is not a compliance framework for any jurisdiction.
AATP does not enforce outcomes. Consequences for audit findings are determined by the human principal, not by the protocol.

Architecture

┌─────────────────────────────────────────────────────┐
│                    PRINCIPAL (Human Owner)            │
│         Sets authorization · Reads audit reports     │
└──────────────┬──────────────────────┬───────────────┘
               │                      │
               ▼                      ▼
┌──────────────────────┐  ┌──────────────────────────┐
│   MODULE 1: RECORDER │  │   MODULE 2: REVIEWER     │
│   (Accountant)       │  │   (Auditor)              │
│                      │  │                          │
│   • startSession()   │  │   • verifyChain()    L1  │
│   • recordDecision() │  │   • checkCompliance() L2 │
│   • endSession()     │  │   • getSessionForReview()│
│                      │  │   • submitReview()    L3 │
└──────────┬───────────┘  └──────────┬───────────────┘
           │     writes               │  reads
           ▼                          ▼
┌─────────────────────────────────────────────────────┐
│                 SEALED AUDIT TRAIL                    │
│   Record₁ ──hash──▶ Record₂ ──hash──▶ Record₃ ...   │
└─────────────────────────────────────────────────────┘

Detailed architecture diagrams are in diagrams/.

Project Status

Phase: v0.x — Founder Stewardship

Component	Status
Conceptual Framework (v0.44)	✅ Frozen
Governance Addendum (v0.21)	✅ Frozen
Architecture Diagrams	✅ Complete
Reference Implementation (Python SDK)	✅ v0.1.0
Solo Mode Demo	✅ Complete
Bilateral Mode Demo	✅ Complete
Tamper Detection Demo	✅ Complete
Authorization Violation Demo	✅ Complete
CLI Audit Trail Viewer	✅ Complete
107 Tests (6 test files)	✅ All Passing
Real LLM Agent Integration	🔨 Stage 3 — In Progress
Technical Specification	📝 Planned

SDK v0.1.0 key metrics: 10 source modules across 3 packages, 2 external dependencies (pydantic, cryptography), zero LLM dependencies in the core SDK.

See ROADMAP.md for the full development timeline.

Quick Start

git clone https://github.com/mrooxx/aatp-protocol.git
cd aatp-protocol
pip install -e ".[dev]"

# Run the Solo Mode demo — a personal finance agent with full audit trail
python examples/demo_solo.py

# Run the Bilateral Mode demo — two agents negotiating API credits
python examples/demo_bilateral.py

# See tamper detection in action
python examples/demo_tamper.py

# View an audit trail in human-readable format
python -m tools.aatp_cli view examples/output/trail.json

# Run all tests
pytest

Documentation

Conceptual Framework v0.44 — full rationale, design principles, and protocol logic (start here)
Governance Addendum v0.21 — versioning, invariant protection, transition plan
Development Roadmap — phased execution plan

Contributing

AATP is an open standard under active development. Contributions are welcome via GitHub Issues and Discussions.

During Phase I (Founder Stewardship, v0.x), the founding maintainer reviews all proposals. Phase II will transition to Working Group governance. See the Governance Addendum for details.

License

Documentation: CC BY 4.0 — use, share, adapt with attribution
Code: MIT — use freely

Author

Changxiao Huang (Norland) — Accountant and protocol designer.

AATP grows from the conviction that AI decisions made on behalf of humans deserve accountability — whether those decisions involve money, communication, scheduling, or any other domain where an agent acts in the world. Economic transactions are the starting point because they are easiest to quantify; the principle extends to every decision an agent makes that its principal should be able to review.

GitHub: @mrooxx

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
aatp_agent		aatp_agent
aatp_auditor		aatp_auditor
aatp_core		aatp_core
aatp_recorder		aatp_recorder
aatp_reviewer		aatp_reviewer
diagrams		diagrams
docs		docs
examples		examples
test		test
tools		tools
.gitignore		.gitignore
DOCS-LICENSE		DOCS-LICENSE
LICENSE-CODE		LICENSE-CODE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AATP — Auditable Agent Transaction Protocol

Why Now

How It Works

Core Design

Seven Invariants

Non-Goals

Architecture

Project Status

Quick Start

Documentation

Contributing

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AATP — Auditable Agent Transaction Protocol

Why Now

How It Works

Core Design

Seven Invariants

Non-Goals

Architecture

Project Status

Quick Start

Documentation

Contributing

License

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages