Cupcake

Cupcake logo

Cupcake

Make AI agents follow the rules.

Policy enforcement layer for AI agents; yielding better performance, reliability, and security without consuming model context.

Deterministic rule‑following for your agents.
Boost performance by moving rules out of context and into guarantees.
Trigger alerts when agents repeatedly violate rules.

Cupcake intercepts agent tool calls and evaluates them against user-defined rules written in Open Policy Agent (OPA) Rego. Agent actions can be blocked, or auto-corrected. Additional benefits include reactive automation for tasks you dont need to rely on the agent to conduct (like linting after a file edit).

Cupcake is developed by EQTYLab, with agentic safety research support by Trail of Bits.

Supported Agent Harnesses

Cupcake provides native integrations for multiple AI coding agents:

Harness	Status	Integration Guide
Claude Code	✅ Fully Supported	Setup Guide
Cursor	✅ Fully Supported	Setup Guide
Gemini CLI	Coming soon	Awaiting PR

Cupcake provides native integrations for multiple web agent frameworks:

Harness	Status	Integration Guide
LangChain	Coming soon	Coming soon
Google ADK	Coming soon	Coming soon
NVIDIA NAT	Coming soon	Coming soon

Each harness uses native event formats—no normalization layer. Policies are physically separated by harness (policies/claude/, policies/cursor/) to ensure clarity and full access to harness-specific capabilities.

See also: Harness Comparison Matrix · Harness Architecture

Why Cupcake?

Modern agents are powerful but inconsistent at following operational and security rules, especially as context grows. Cupcake turns the rules you already maintain (e.g., CLAUDE.md, AGENT.md, .cursor/rules) into enforceable guardrails that run before actions execute.

Multi-harness support with first‑class integrations for Claude Code and Cursor.
Governance‑as‑code using OPA/Rego compiled to WebAssembly for fast, sandboxed evaluation.
Enterprise‑ready controls: allow/deny/review, audit trails, and proactive warnings.

How it Works

Cupcake integrates with AI coding agents like Claude Code and Cursor through lightweight hooks that monitor operations such as shell commands, file edits, and tool calls. Policies are compiled to WebAssembly (Wasm) for fast, sandboxed evaluation.

Cupcake sits in the agent hook path. When an agent proposes an action (e.g., run a shell command, edit a file, call a tool), the details are sent to Cupcake. Cupcake evaluates your policies and returns a decision in milliseconds:

Allow · Block · Warn (and optionally Require Review)

Agent → (proposed action) → Cupcake → (policy decision) → Agent runtime

Core Capabilities

Block specific tool calls Prevent use of particular tools or arguments based on policy.
Behavioral guidance Inject lightweight, contextful reminders back to the agent (e.g., "run tests before pushing").
MCP support Govern Model Context Protocol tools (e.g., mcp__memory__*, mcp__github__*).
LLM‑as‑Judge Chain in a review step by another model/agent when policies say a human‑style check is needed.
Guardrail libraries First‑class integrations with NeMo and Invariant for content and safety checks.
Signals (real‑time context) Pull facts from the environment (current Git branch, changed files, deployment target, etc.) and make policy decisions on them.

Influencing Agent Behavior

Cupcake policies can influence agents in two primary ways:

Feedback (when blocking): When a policy blocks an action, you can provide explanatory messages that help the agent understand what went wrong and how to fix it. For Cursor, policies can provide separate messages for users (reason) and agents (agent_context) to optimize both experiences.
Context injection (when allowing): Claude Code supports injecting additional context alongside allowed actions (e.g., "Remember: you're on the main branch"). This helps guide agent behavior without blocking. Note: Cursor does not support context injection.

See Writing Policies for details on using these capabilities.

Architecture

Policies: OPA/Rego compiled to WASM and executed in a sandbox.
Signals: Extensible providers (Git, CI, DB metadata, feature flags) available to policy via input.signals.
Decisions: allow | block | warn | require_review plus a human‑readable message.
Observability: Structured logs and optional evaluation traces for debugging.

// Example input passed to policy evaluation
{
  "kind": "shell", // action type (shell, fs_read, fs_write, mcp_call, ...)
  "command": "git push",
  "args": [],
  "signals": { "tests_passed": false, "git_branch": "feature/x" },
  "actor": { "id": "agent-1", "session": "abc123" }
}

See the Architecture Reference for a comprehensive technical overview.

Security Model

Sandboxed evaluation of untrusted inputs.
Allow‑by‑default or deny‑by‑default modes configurable per project.
No secret ingestion by default; policies can only read what signals expose.
Auditability through logs and optional review workflows.

See the full Security Model.

FAQ

Does Cupcake consume prompt/context tokens? No. Policies run outside the model and return structured decisions.

Is Cupcake tied to a specific model? No. Cupcake supports multiple AI coding agents with harness-specific integrations.

How fast is evaluation? Sub‑millisecond for cached policies in typical setups.

License

MIT

Citation

If you use Cupcake in your research or project, please cite it as follows:

@software{Cupcake2025,
  author = {Ramos, Michael and EQTYLab},
  title = {{Cupcake: Policy enforcement for AI agents}},
  year = {2025},
  publisher = {EQTYLab},
  url = {[https://github.com/eqtylab/cupcake](https://github.com/eqtylab/cupcake)}
}

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.cargo		.cargo
.github		.github
cupcake-cli		cupcake-cli
cupcake-core		cupcake-core
cupcake-py		cupcake-py
docs		docs
examples		examples
fixtures		fixtures
scripts		scripts
.gitignore		.gitignore
CITATION.cff		CITATION.cff
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
justfile		justfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cupcake

Supported Agent Harnesses

Why Cupcake?

How it Works

Core Capabilities

Influencing Agent Behavior

Architecture

Security Model

FAQ

License

Citation

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

eqtylab/cupcake

Folders and files

Latest commit

History

Repository files navigation

Cupcake

Supported Agent Harnesses

Why Cupcake?

How it Works

Core Capabilities

Influencing Agent Behavior

Architecture

Security Model

FAQ

License

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages