Episodic Memory Pipeline

A local-first cognitive architecture for AI agents.

This system models human-like memory consolidation by separating Episodic Memory (raw, timestamped events) from Semantic Memory (stable, consolidated facts). It includes defense-in-depth LLM sanitization, provenance tracking, and multilingual (CJK) optimization using Qwen 2.5 and BGE-M3.

Quick Start (Reproducible Demo)

git clone https://github.com/wheevu/episodic-memory-pipeline
cd episodic-memory-pipeline
pip install -e .

# Generate local artifacts deterministically (no committed binaries)
make demo

For a fast, dependency-light run (no models), use make demo-mock.

CLI

After installation, the console script is available:

episodic-memory doctor --dry
episodic-memory ingest "I started learning Korean today"
episodic-memory query "What am I learning?"
episodic-memory recall "korean" --topic
episodic-memory consolidate --all
episodic-memory stats

Legacy entrypoint still works:

python cli.py doctor --dry
python cli.py query "What am I learning?"

Evaluation (Versioned Runs)

Evaluation runs live under runs/eval/<run_id>/eval_run.json and include:

git commit hash (if available)
config snapshot (provider/model, k, scenario)
metrics + warnings

episodic-memory eval-run --scenario diary
episodic-memory eval-list
episodic-memory eval-compare <runA> <runB>

Design Philosophy

Episodic memory ≠ vector blobs: Each memory is a structured event with context, time, and meaning.
Time and provenance matter: Every fact and summary links back to source episodes. Hallucination prevention starts with lineage.
Memory must be curated, not accumulated: Not everything is worth remembering. We filter aggressively via a "Memory Worthiness" gate.
Retrieval should feel like recalling a journey: Narrative coherence over raw similarity scores.
Defense-in-depth validation: Input sanitization (length limits, type checking), LLM output sanitization (topics/entities filtering), and automatic retry logic for API failures ensure production robustness.

Architecture & Evaluation Details

For the full architecture diagram, evaluation framework, and design rationale, see ARCHITECTURE.md.

Storage Choice: SQLite + FAISS

Why SQLite over Postgres?

Local-first, no server dependencies
Single-file portability (backup = copy file)
JSON1 extension for flexible metadata
Zero configuration required

Why FAISS for vectors?

Mature, fast, local-only C++ library
Supports multiple index types for scaling
Works well alongside SQLite for hybrid retrieval

Core Concepts

Episode (Episodic Memory)

A timestamped event capturing what happened, when, and in what context.

“On Tuesday at 3pm, I told my assistant I'm learning Korean for a trip to Seoul in March.”

Fact (Semantic Memory)

A distilled, stable piece of knowledge extracted from episodes.

“User is learning Korean. User has a trip to Seoul planned for March 2024.”

Summary (Consolidated Narrative)

A topic-level summary that weaves multiple episodes into a coherent narrative.

“User's Korean language learning journey: Started in January 2024 motivated by upcoming Seoul trip...”

Configuration

Copy env.example to .env (or export vars directly) and configure:

# Embeddings (default: local)
EMBEDDING_PROVIDER=local            # local|openai|ollama|mock
EMBEDDING_MODEL=BAAI/bge-m3          # local/openai model name
EMBEDDING_DEVICE=cpu                # cpu|cuda|mps
OLLAMA_EMBED_MODEL=nomic-embed-text # when EMBEDDING_PROVIDER=ollama

# LLM
LLM_PROVIDER=ollama                 # openai|ollama
LLM_MODEL=gpt-4o-mini               # when LLM_PROVIDER=openai
OLLAMA_MODEL=qwen2.5:7b-instruct
OLLAMA_BASE_URL=http://localhost:11434
LLM_TEMPERATURE=0.2

# API keys
OPENAI_API_KEY=sk-your-key-here

# Storage
DATABASE_PATH=./data/memory.db
VECTOR_INDEX_PATH=./data/vectors.faiss

Local-First Setup with Qwen (Recommended)

# Install Ollama
brew install ollama  # macOS
# or: curl -fsSL https://ollama.com/install.sh | sh  # Linux

ollama pull qwen2.5:7b-instruct
ollama serve

export LLM_PROVIDER=ollama
export OLLAMA_MODEL=qwen2.5:7b-instruct
export EMBEDDING_PROVIDER=local

Demo Data Policy

The demo_data/ directory contains synthetic data only for demo and testing.

✅ Fictional diary entries and memories
✅ Example evaluation queries
❌ Never commit real user data
❌ No sensitive information (API keys, PII)

See demo_data/README.md for details.

Development

Install dev dependencies with make install-dev (or pip install -e ".[dev]").

make test
make test-slow
make lint
make format

make demo
make demo-clean
make demo-mock

Project Structure (High Level)

src/cli/        # CLI commands + rendering (Rich/Click)
src/services/   # Business logic (no Rich/Click; returns plain dataclasses/dicts)
scripts/        # Reproducible bootstrap utilities
demo_data/      # Synthetic fixtures (safe-to-commit)
runs/eval/      # Versioned eval run outputs (gitignored per-run)
data/           # Generated local artifacts (gitignored)

macOS: FAISS + SentenceTransformers Note

On macOS, there is a known interaction issue between FAISS and SentenceTransformers during Python cleanup. This is handled automatically by the bootstrap module.

For library users, import from src.bootstrap:

from src.bootstrap import get_components

components = get_components()
# components.database, components.embedding_provider, etc.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
demo_data		demo_data
runs		runs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
cli.py		cli.py
config.py		config.py
env.example		env.example
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
schema.sql		schema.sql

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Episodic Memory Pipeline

Quick Start (Reproducible Demo)

CLI

Evaluation (Versioned Runs)

Design Philosophy

Architecture & Evaluation Details

Storage Choice: SQLite + FAISS

Core Concepts

Episode (Episodic Memory)

Fact (Semantic Memory)

Summary (Consolidated Narrative)

Configuration

Local-First Setup with Qwen (Recommended)

Demo Data Policy

Development

Project Structure (High Level)

macOS: FAISS + SentenceTransformers Note

License

About

Uh oh!

Releases

Packages

Languages

License

wheevu/episodic-memory-pipeline

Folders and files

Latest commit

History

Repository files navigation

Episodic Memory Pipeline

Quick Start (Reproducible Demo)

CLI

Evaluation (Versioned Runs)

Design Philosophy

Architecture & Evaluation Details

Storage Choice: SQLite + FAISS

Core Concepts

Episode (Episodic Memory)

Fact (Semantic Memory)

Summary (Consolidated Narrative)

Configuration

Local-First Setup with Qwen (Recommended)

Demo Data Policy

Development

Project Structure (High Level)

macOS: FAISS + SentenceTransformers Note

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages