inspect_agents

Inspect‑AI–native agents with typed state, safe tools, and rich traces.

This library extends Inspect‑AI with higher‑level agent orchestration, typed state (todos and virtual files), and a small set of Inspect‑native tools. It is safe by default: optional standard tools (think, web_search, bash/python, web_browser, text_editor) are gated behind environment flags, and sandboxed filesystem operations are constrained.

Quickstart (self‑contained)

Uses local sources via PYTHONPATH and requires no provider setup. This validates the code path without contacting external services.

export PYTHONPATH=src:external/inspect_ai/src
python examples/inspect/quickstart_toy.py
# Expected: Completion: DONE

📖 Next: Examples — Start Here →

The examples directory contains comprehensive guides, CLI usage, and working demonstrations.

Docs Home →

Why Inspect Agents?

Setting up practical LLM agents is slow: you fight glue code, logging, state, and tool orchestration. Inspect Agents removes overhead with an Inspect‑AI‑native workflow: typed state (todos/files), built‑in tools, and rich transcripts/traces by default. You can run a toy agent offline in seconds (see Quickstart above).

Key Features

✅ Inspect-native tools: Todos + virtual filesystem
- Default: in-memory “store” (ephemeral; isolated per run)
- Optional: sandbox (routes through Inspect’s text_editor/bash_session; delete disabled)
- Sandbox quickstart: see docs/how-to/filesystem_sandbox_quickstart.md
- Advanced: set include_defaults=False on build_supervisor, build_iterative_agent, or YAML configs to opt out of auto-injected todos/files while keeping prompts aligned with your custom toolchain.
- Curated presets: call inspect_agents.tools.minimal_fs_preset() (Todos + FS) or inspect_agents.tools.full_safe_preset() (Todos + FS + env-gated standard tools) to rebuild safe bundles when you disable defaults.
✅ Optional standard tools (gated by env flags):
- INSPECT_ENABLE_THINK (default on), INSPECT_ENABLE_WEB_SEARCH (auto when provider keys set), INSPECT_ENABLE_EXEC, INSPECT_ENABLE_WEB_BROWSER, INSPECT_ENABLE_TEXT_EDITOR_TOOL (default off)
- The stateful bash_session tool is never exposed by this repo; it’s used internally for sandbox FS.
✅ Typed state: Pydantic models backed by Inspect Store for Todos and Files
✅ Sub‑agents: “handoff” (transfer to transfer_to_<name>) or “tool” (single‑shot)
✅ Traces & transcripts: Structured tool events and store change logs by default
✅ Safe by default: approval presets (dev/prod), quarantine filters, sandbox confinement/symlink denial
✅ Self‑contained toy example to verify setup without external model providers
✅ Advanced Examples: Including a complete vending machine business simulation (examples/vending_bench) with market dynamics, supplier relationships, and AI-driven decision making

Installation

Requirements

Python 3.11+ on macOS or Linux
Git (for development setup)

From PyPI (Recommended)

# Using pip
pip install inspect-agents

# Using uv (faster)
uv add inspect-agents

From Source (Development)

git clone https://github.com/cnm13ryan/inspect_agents.git
cd inspect_agents

# Using uv (recommended)
uv sync

# Using pip
python -m venv .venv && source .venv/bin/activate
pip install -e .

The self-contained quickstart above runs from source and requires no external dependencies.

Configure Environment Variables

If you use Inspect CLI or providers, see the Environment reference and the example configurator:

Environment reference: docs/reference/environment.md
Example configurator: env_templates/configure.py

For the self‑contained Quickstart above, no configuration is required.

Usage

Scaffold a new agent

Generate a minimal agent module in seconds.

# Creates src/<package>/<name>.py  (<package> is the dotted package path; <name> is normalized to snake_case)
python scripts/scaffold_agent.py my_helper --path . --package inspect_agents --no-test

Notes

Safe by default: refuses to overwrite existing files unless --force (or confirms y/N when interactive).
The template uses build_iterative_agent(code_only=True) so it runs without exec/search/browser tools.
Creates src/<package>/<name>.py (under src/<package>/; <name> is normalized to snake_case, e.g., "MyAgent" → my_agent.py).
By default a smoke test is also created at tests/<package>/test_<name>.py; in the example above we pass --no-test to skip it (omit --no-test to generate the test).

FS tools (store mode)

Short, deterministic example that writes, lists, and reads a file using the in‑memory store (no host filesystem writes).

import asyncio, os

# Ensure store mode (default), i.e., in‑memory virtual filesystem
os.environ["INSPECT_AGENTS_FS_MODE"] = "store"

from inspect_agents.tools import write_file, read_file, ls

async def main():
    w = write_file(); r = read_file(); l = ls()
    await w(file_path="demo.txt", content="Hello\nWorld")
    print(await l())                  # → ["demo.txt"]
    print(await r(file_path="demo.txt", offset=0, limit=10))
    # → numbered output:
    #     1\tHello
    #     2\tWorld

asyncio.run(main())

Note: In store mode these paths live in an in‑memory store tied to the current process context, not your disk. For sandbox mode, see the quickstart: docs/how-to/filesystem_sandbox_quickstart.md and the full guide: docs/how-to/filesystem.md.

CLI (Inspect)

When using Inspect CLI and tasks, see:

Evaluated examples: docs/cli/inspect_eval.md
Provider setup: docs/getting-started/inspect_agents_quickstart.md and docs/reference/environment.md

Policy note: Enabling INSPECT_ENABLE_EXEC=1 exposes only single‑shot bash() and python() tools. The stateful bash_session tool is never surfaced by this repo’s standard_tools(); it is reserved for internal filesystem‑sandbox operations (e.g., sed, ls, wc -c).

Viewing Logs (Inspect View)

Inspect provides a log viewer. See: docs/cli/inspect_view.md

Advanced Usage

Sub‑agents Configuration

Define sub‑agents in YAML and load programmatically. See: docs/guides/subagents.md

Architecture

flowchart LR
    SP[[System Prompt / Config]] --> S[Supervisor]
    MR[Model Resolver] --> S
    S --> L[Logs / Traces]
    S -->|tool call| AP[Approvals & Policies]
    AP --> ST[Stateless Tools]
    AP --> SS[Stateful Tools]
    ST -.-> S
    SS -.-> S

    subgraph "FS Path Modes (MODE=store|sandbox)"
      direction LR
      FST[FS Tools] -->|"store (default)"|VFS["(VFS)"]
      FST -->|sandbox| SBX[["Sandboxed Editor (no delete)"]]
      SBX --> HFS[(Host FS)]
    end
    AP --> FST
    VFS -.-> S
    SBX -.-> S
    HFS -.-> S

    S -->|handoff| CG[Context Gate]
    CG <-->|iterate| SA[Sub-Agents]
    SA -.-> S

Fallback: docs/diagrams/architecture_overview.png

Documentation

Getting Started: docs/getting-started/inspect_agents_quickstart.md
Tools Reference: docs/tools/README.md
Filesystem & Sandbox: docs/how-to/filesystem.md
Approvals: docs/how-to/approvals.md
Sub‑agent Patterns: docs/guides/subagents.md
Examples: examples/inspect/
Testing Guides (repo): tests/README.md

Docs (MkDocs)

To preview docs locally, see: docs/README.md

Project Status

Version: 0.0.2 (repo) / see PyPI badge for latest
Status: Beta
Python: 3.11+ (tested on 3.12)
Roadmap: Milestones | Projects

Recent Additions

✅ CI workflows (tests, lint, coverage) and release automation
✅ Published to PyPI with automated releases
✅ Advanced examples including vending_bench business simulation
✅ Comprehensive documentation with MkDocs

Coming Soon

Expanded examples for web_browser and sandboxed exec
Additional sub-agent templates (researcher, coder, editor)

Contributing

See CONTRIBUTING.md for guidelines.

Quick Setup for Contributors

See CONTRIBUTING.md for up‑to‑date dev environment instructions.

Support

Questions: GitHub Discussions
Bugs & Features: Open an Issue with repro steps

License & Acknowledgments

Licensed under MIT
Thanks to the Inspect-AI project and ecosystem
Inspired by CLI-first DX from projects like Bun and Supabase

Name		Name	Last commit message	Last commit date
Latest commit History 1,067 Commits
.artifacts		.artifacts
.github		.github
docs		docs
env_templates		env_templates
examples		examples
external		external
mirror-repo		mirror-repo
ops/providers		ops/providers
overrides		overrides
scripts		scripts
src/inspect_agents		src/inspect_agents
tests		tests
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.pypirc.template		.pypirc.template
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dangerfile.js		Dangerfile.js
LICENSE		LICENSE
README.md		README.md
STATUS.md		STATUS.md
VENDING_BENCH_COMPLETION.md		VENDING_BENCH_COMPLETION.md
config_spec.yaml		config_spec.yaml
deep_agents.png		deep_agents.png
environment_var_flags.md		environment_var_flags.md
improving_mkdocs.md		improving_mkdocs.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
pyproject.toml.template		pyproject.toml.template
pytest-docs.ini		pytest-docs.ini
pytest.ini		pytest.ini
pytest_no_xdist.ini		pytest_no_xdist.ini
retrievers_framework_1.md		retrievers_framework_1.md
sitecustomize.py		sitecustomize.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

inspect_agents

Quickstart (self‑contained)

Why Inspect Agents?

Key Features

Table of Contents

Installation

Requirements

From PyPI (Recommended)

From Source (Development)

Configure Environment Variables

Usage

Scaffold a new agent

FS tools (store mode)

CLI (Inspect)

Viewing Logs (Inspect View)

Advanced Usage

Sub‑agents Configuration

Architecture

Documentation

Docs (MkDocs)

Project Status

Recent Additions

Coming Soon

Contributing

Quick Setup for Contributors

Support

License & Acknowledgments

About

Uh oh!

Releases 2

Packages

Contributors 19

Uh oh!

Languages

License

cnm13ryan/inspect_agents

Folders and files

Latest commit

History

Repository files navigation

inspect_agents

Quickstart (self‑contained)

Why Inspect Agents?

Key Features

Table of Contents

Installation

Requirements

From PyPI (Recommended)

From Source (Development)

Configure Environment Variables

Usage

Scaffold a new agent

FS tools (store mode)

CLI (Inspect)

Viewing Logs (Inspect View)

Advanced Usage

Sub‑agents Configuration

Architecture

Documentation

Docs (MkDocs)

Project Status

Recent Additions

Coming Soon

Contributing

Quick Setup for Contributors

Support

License & Acknowledgments

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 19

Uh oh!

Languages

Packages