LLM-verify

Real verification for LLM-generated code via ECHIDNA integration.

The Problem

When Claude "validates" code, it’s doing pattern matching against training data:

| What It Looks Like | What’s Actually Happening | |-------------------|---------------------------| | "This code is correct" | High confidence from patterns | | "I notice this bug" | Pattern recognition | | "The types match" | Internalised grammar patterns | | "This compiles" | Syntax familiarity prediction |

Claude is not running: - Pratt parsers - SMT solvers - Model checkers - Theorem provers - Type inference algorithms

Claude makes educated guesses. It’s often right, but can and does make mistakes.

The Solution

claude-verify provides actual verification by integrating with [ECHIDNA](https://github.com/Hyperpolymath/echidna), a neurosymbolic theorem proving platform that provides access to 12 provers:

SMT Solvers: Z3, CVC5
Theorem Provers: Lean 4, Coq, Agda, Isabelle, HOL Light, Mizar
First-Order ATP: Vampire, E
Model Checkers: TLC (TLA+), Alloy Analyzer

Plus neurosymbolic integration via DeepProbLog and OpenCyc.

Quick Start

= Install (requires GHC 9.4+)

cabal install claude-verify

= Initialize project configuration

claude-verify init

= Verify files

claude-verify verify src/*.rs

= Record feedback when Claude is wrong

claude-verify feedback record \
    --output "for i in 0..=n" \
    --issue "Off-by-one: should be 0..n" \
    --correction "for i in 0..n"

= View known issues

claude-verify feedback list

Configuration

Create .claude-context.toml in your project root:

[project]
name = "my-project"
languages = ["rust"]
safety_level = "high"

[known_issues]
items = [
    "Off-by-one in pagination - use exclusive upper bounds",
    "This project uses no_std - don't assume alloc",
]

[verification]
required = ["compile", "clippy", "test"]
optional = ["miri", "audit"]

[formal_specs]
tla_plus = ["specs/consensus.tla"]
lean = ["specs/invariants.lean"]

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                                                                  │
│   Claude Output (probabilistic)                                  │
│         │                                                        │
│         ▼                                                        │
│   ┌─────────────────┐                                            │
│   │  claude-verify  │  Haskell orchestrator                      │
│   │                 │  - Multi-language parsing                  │
│   │                 │  - VC extraction                           │
│   │                 │  - Feedback capture                        │
│   └─────────────────┘                                            │
│         │                                                        │
│         ▼                                                        │
│   ┌─────────────────┐                                            │
│   │    ECHIDNA      │  Neurosymbolic platform                    │
│   │                 │  - 12 theorem provers                      │
│   │                 │  - DeepProbLog                             │
│   │                 │  - OpenCyc                                 │
│   └─────────────────┘                                            │
│         │                                                        │
│         ▼                                                        │
│   ACTUAL VERIFICATION                                            │
│         │                                                        │
│         ▼                                                        │
│   ┌─────────────────┐                                            │
│   │    Feedback     │  For future sessions                       │
│   │     Loop        │                                            │
│   └─────────────────┘                                            │
│                                                                  │
└─────────────────────────────────────────────────────────────────┘

Commands

claude-verify verify [FILES]     Verify with ECHIDNA provers
claude-verify check [FILES]      Quick checks (no formal verification)
claude-verify feedback record    Record a correction
claude-verify feedback list      Show recent feedback
claude-verify feedback export    Export for session injection
claude-verify init               Create .claude-context.toml
claude-verify stats              Show verification statistics

Pre-commit Hook

#!/bin/bash
= .git/hooks/pre-commit

claude-verify check --staged --fail-on-error

CI/CD Integration

= .github/workflows/verify.yml

- uses: hyperpolymath/claude-verify-action@v1
  with:
    echidna-version: 'latest'
    fail-on: ['counterexample', 'error']

Feedback Loop

The key innovation is the feedback loop: when verification fails or a user corrects Claude, the issue is captured with full provenance and injected into future sessions.

User corrects Claude → Feedback captured → Stored in DB
                                              ↓
Next Claude session ← Known issues injected ←┘

See [docs/feedback-loop-spec.md](docs/feedback-loop-spec.md) for details.

Building from Source

Requires: - GHC 9.4+ - cabal-install 3.8+ - Z3 (optional, for local SMT solving)

git clone https://github.com/Hyperpolymath/llm-verify
cd llm-verify
cabal build
cabal test
cabal install

Or with just:

just build
just test
just install

[ECHIDNA](https://github.com/Hyperpolymath/echidna) - The neurosymbolic theorem proving platform
[Echomesh](https://github.com/Hyperpolymath/echomesh) - Cross-session context persistence

License

MIT

What This Doesn’t Do

Make Claude actually understand code (still pattern matching)
Guarantee absence of all bugs (verification is incomplete)
Replace human review (humans understand requirements)
Verify arbitrary properties (needs specifications)

What This Does

Provide actual verification instead of guessing
Capture and persist correction feedback
Integrate real provers (Z3, CVC5, Lean, etc.)
Give confidence levels with evidence
Enable learning from mistakes

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.claude		.claude
.github		.github
.machine_readable		.machine_readable
app		app
config		config
docs		docs
hooks		hooks
src		src
test		test
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.nojekyll		.nojekyll
ARCHITECTURE.md		ARCHITECTURE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.adoc		CONTRIBUTING.adoc
LICENSE		LICENSE
LICENSE.txt		LICENSE.txt
MAINTAINERS.adoc		MAINTAINERS.adoc
Mustfile		Mustfile
PALIMPSEST.adoc		PALIMPSEST.adoc
README.adoc		README.adoc
SECURITY.md		SECURITY.md
claude-verify.cabal		claude-verify.cabal
justfile		justfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Licenses found

Uh oh!

Repository files navigation

LLM-verify

The Problem

The Solution

Quick Start

Configuration

Architecture

Commands

Pre-commit Hook

CI/CD Integration

Feedback Loop

Building from Source

License

What This Doesn’t Do

What This Does

About

Licenses found

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Uh oh!

License

Licenses found

hyperpolymath/llm-verify

Folders and files

Latest commit

History

Repository files navigation

LLM-verify

The Problem

The Solution

Quick Start

Configuration

Architecture

Commands

Pre-commit Hook

CI/CD Integration

Feedback Loop

Building from Source

Related Projects

License

What This Doesn’t Do

What This Does

About

Topics

Resources

License

Licenses found

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages