Agentic refactoring and full MCP support #1502

gustavo-grieco · 2025-12-18T10:50:00Z

This branch contains a rewrite of some core features of Echidna to transform workers into agents, which can receive commands and collaborate with each other. It also allows to easily add MCP commands to query and guide the fuzzing campaign.

Commands

This code is still a work in progress but it can already tested with the few command implemented:

All these commands are subject to change, and there will be an experimentation phase where we will challenge the agents to increase the coverage (or break properties) using different commands

There are some additional changes in the logs to convey useful information for the agent, in particular, when new coverage is found:

[2025-12-18 11:45:22.26] [Worker 2] New coverage: 15176 instr, 7 contracts, 19 seqs in corpus (mintAndApprove)

How to test this PR

Open your echidna-compatible fuzzing project in Visual Studio Code, make sure you have Github Copilot installed.
Compile this branch or download the echidna executable from our CI tests.
Start/resume an Echidna campaign for the agent to guide. Very recommended to use text mode (--format text) for this test. Append the --server 3000 to your command line to start the MCP server in the port 3000.
Create the .vscode/mcp.json file with the following content:

{
  "servers": {
    "Echidna fuzzing campaign": {
      "type": "http",
      "url": "http://localhost:3000/mcp"
    }
  }
}

Once you save the file, you will be able to click on the "Run" button next the server name to make sure the it is detected by Copilot. It should list the number of commands available (currently 7).
5. Everything should be ready to go, open the Chat window (Cmd + Shift + P and write /chat) and start prompting. My recommended option to quickly start:

An Echidna fuzzing campaign is currently running and has already achieved baseline coverage using the default strategy, meaning trivial execution paths are mostly explored.

Using the available MCP interface **only** (do not modify any code or functions):

1. **Identify the campaign context**
   - Determine the fuzzing target and current campaign status using the appropriate MCP commands.

2. **Analyze coverage**
   - Inspect coverage for the relevant contracts using `show_coverage`.
   - Identify execution paths or code regions with **low coverage** that are *theoretically reachable* from the target logic.

3. **Design targeted fuzzing sequences**
   - Once you understand the contract behavior, use `inject_fuzz_transactions` to prioritize one or more transaction sequences.
   - Separate multiple calls with `;` (e.g., `f(1,?,?) ; g(?,2,5)`).
   - Combine **concrete values and random parameters** strategically.
   - **Do not use `?` for all parameters**, as those cases are already well-covered by the existing campaign.
   - **Avoid making all parameters concrete when possible**. As a *recommendation* (not a strict requirement), leave at least one parameter as `?` per call to allow the fuzzer to continue exploring the input space rather than repeatedly replaying a single fixed execution. Fully concrete calls may still be used when they intentionally target a very specific path or invariant.
   - **Injection semantics:** the prioritized sequence is **not executed in isolation**. Echidna starts from an existing transaction sequence in the current corpus and **inserts the injected sequence at a random position within it**, not necessarily at the beginning. Design injected calls assuming relevant state may already exist.

4. **Evaluate results**
   - After injecting transactions, run `sleep 20` to allow fuzzing to progress.
   - Then call `status` to check whether additional coverage was discovered or any invariant (e.g., `assert`) failed.

5. **Reset priorities**
   - Clear prioritized calls using `clear_fuzz_priorities` to return the fuzzer to its default random sampling strategy.

Focus on crafting transaction sequences that are most likely to exercise uncovered logic or trigger invariant violations, while still allowing the fuzzer enough freedom to explore variations.

CLAassistant · 2025-12-18T10:50:09Z

All committers have signed the CLA.

The clean PR branch is based on upstream/dev-agents (PR crytic#1502) which uses the '--server' flag name. Documentation was written for the development branch which used '--mcp-port', causing inconsistency. Changes: - AGENT_TESTING_GUIDE.md: Update all command examples - test-mcp-client.py: Fix error message - examples/README.md: Update all 3 command examples - examples/simple_agent.py: Fix error message - examples/langgraph_agent.py: Fix error message - tests/mcp/conftest.py: Fix pytest fixture command - .gitignore: Add Python cache patterns This ensures documentation matches the actual upstream implementation.

- Update examples/README.md to list correct 7 tools - Fix mcp_client_wrapper.py to use upstream tool names: * inject_fuzz_transactions (not inject_transaction) * clear_fuzz_priorities (not clear_priorities) * status, target, show_coverage, reload_corpus, dump_lcov - Mark old test files as skipped (use old tool names): * test_corpus.py - get_corpus_size, inspect_corpus, find_transaction * test_injection.py - inject_transaction (old signature) * test_prioritization.py - prioritize_function * test_read_logs.py - read_logs (commented out in upstream) - Update docstring: 7 active tools (not 9) These tests are preserved for reference but skipped until updated to match upstream API from PR crytic#1502.

gustavo-grieco requested review from arcz and elopez as code owners December 18, 2025 10:50

This was referenced Dec 18, 2025

[RFC] MCP support during a fuzzing campaign #1423

Closed

Feat: SSE POST Events for lcov dump and #1498

Closed

gustavo-grieco added 25 commits December 19, 2025 16:28

first step to define agent interactions

25562c6

first functional MCP command

352ab79

removed redundant code

9005d53

verify -> verification

1f6f93f

priorize_function command

1bf5e2a

read_logs command

c73d7ea

better logs and coverage_report

582e6ad

hlint fixes

c0677a5

improve show_coverage

159a7d2

fixed flake to build correctly

33358ad

fixed test compilation

45b90fe

fixed flake to build correctly

d6cb38f

refactor MCP code

f82da87

refactor MCP code

b33baec

fix tests

b4de33a

new command

8685e54

new command

1c1bdde

fixes

67d103c

simplify get coverage tool

af90c03

new command

0eefe2d

make sure logs are available for the mcp

773b2a3

implemented status command

9b33fc7

fixes

5830818

more fixes

b52d729

more fixes

017b181

elopez force-pushed the dev-agents branch from 29158f2 to 017b181 Compare December 19, 2025 19:28

gustavo-grieco added 12 commits December 20, 2025 09:08

allow sequences to be prioritized

a73c072

clean-up

b2159a0

new command

718f11d

inject_fuzz_transactions validation

15a7eaf

added target mcp command

9a85bb2

upgraded haskell-mcp-server

2d4a4cd

upgraded haskell-mcp-server

bc5b652

added optimization values to the status command

1d277a9

allow to intercalate random transactions durign priorized sequence

b521a45

better handling of parsing injected transactions

9df353f

insert transactions in a random part of the ones in the corpus

84c9005

refactoring and probabily tweaking

fb40b77

gustavo-grieco mentioned this pull request Dec 28, 2025

Rewrite campaign handling to allow more flexible worker interactions #1490

Open

datradito mentioned this pull request Dec 29, 2025

feat: Add MCP server for AI agent integration with fuzzing controls #1508

Closed

datradito mentioned this pull request Dec 29, 2025

docs: Add comprehensive MCP agent integration guide and examples #1509

Open

wrap the coverage MCP command output in code tags

0559648

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Agentic refactoring and full MCP support #1502

Agentic refactoring and full MCP support #1502

Uh oh!

gustavo-grieco commented Dec 18, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Dec 18, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Agentic refactoring and full MCP support #1502

Are you sure you want to change the base?

Agentic refactoring and full MCP support #1502

Uh oh!

Conversation

gustavo-grieco commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Commands

How to test this PR

Uh oh!

CLAassistant commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gustavo-grieco commented Dec 18, 2025 •

edited

Loading

CLAassistant commented Dec 18, 2025 •

edited

Loading