Claude Code with Persistent Memory

Cross-Session, Cross-Machine Memory for Claude Code

7,000+ memories | 6.8x faster retrieval | 5 custom agents | Works everywhere

Problem · Solution · Architecture · Benchmarks · Quick Start · Docs

The Problem

Claude Code forgets everything between sessions.

Every time you start Claude Code, it's like meeting someone with amnesia. You have to:

Re-explain your project architecture
Repeat lessons learned from debugging
Rediscover patterns that worked before
Lose all context when switching machines

Your knowledge compounds. Your AI should too.

The Solution

This repository implements persistent, intelligent memory for Claude Code using Hindsight by Vectorize.io, deployed as a production-grade cloud service:

Remembers across sessions — Lessons learned yesterday are available today
Syncs across machines — Laptop, desktop, and server share the same memory
Captures automatically — No manual saving; importance scoring filters noise
Retrieves intelligently — 6.8x faster reranking with FlashRank ONNX
Travels with you — Custom agents, commands, and configurations follow everywhere

Before: "Claude, remember we're using AWS SSO for authentication"
After:  Claude already knows — auto-captured from your last session

For the full deployment story, see HINDSIGHT-DEPLOYMENT-GUIDE.pdf.

Architecture

GCP Deployment

Two Docker containers on a single GCP Compute Engine VM, backed by Cloud SQL PostgreSQL:

graph TD
    subgraph VM["GCP Compute Engine (e2-medium: 4 vCPU, 5.3 GB RAM)"]
        direction LR
        subgraph C1["hindsight container"]
            MCP["MCP Server<br/><i>port 8888</i>"]
            UI["Web UI<br/><i>port 9999</i>"]
            FR["FlashRank ONNX<br/>Reranker"]
            EMB["Local Embeddings<br/>BAAI/bge-small-en-v1.5"]
        end
        subgraph C2["litellm-proxy container"]
            LIT["LiteLLM Proxy<br/><i>port 4000</i>"]
            BED["Routes to<br/>AWS Bedrock"]
            MOD["Claude Opus 4.5"]
        end
        NET["Docker bridge network"]
    end

    subgraph DB["Cloud SQL"]
        PG["PostgreSQL"]
        BAK["Managed Backups"]
        HA["High Availability"]
    end

    MCP -->|"OpenAI-format<br/>API calls"| LIT
    MCP --> PG

    style VM fill:#e3edf7,stroke:#2c5f8a,stroke-width:2px,color:#1a1a1a
    style C1 fill:#cde0f2,stroke:#2c5f8a,stroke-width:1px,color:#1a1a1a
    style C2 fill:#cde0f2,stroke:#2c5f8a,stroke-width:1px,color:#1a1a1a
    style DB fill:#fef3e0,stroke:#c07d0e,stroke-width:2px,color:#1a1a1a
    style NET fill:#f0f4f8,stroke:#999,stroke-width:1px,color:#555
    style MCP fill:#4A90D9,stroke:#2c5f8a,color:#fff
    style LIT fill:#4A90D9,stroke:#2c5f8a,color:#fff
    style PG fill:#F5A623,stroke:#c07d0e,color:#fff

Multi-Machine Sync

All Claude Code configuration syncs automatically via OneDrive symlinks:

graph LR
    subgraph OD["OneDrive (Source of Truth)"]
        CLAUDE_SRC["CLAUDE.md"]
        HOOKS_SRC["hooks/"]
        AGENTS_SRC["agents/"]
        CMDS_SRC["commands/"]
        SETTINGS_SRC["settings.json<br/><i>template</i>"]
    end

    subgraph LOCAL["~/.claude/ (Local Machine)"]
        CLAUDE_DST["CLAUDE.md"]
        HOOKS_DST["hooks/"]
        AGENTS_DST["agents/"]
        CMDS_DST["commands/"]
        SETTINGS_DST["settings.json"]
    end

    CLAUDE_SRC -.->|"symlink"| CLAUDE_DST
    HOOKS_SRC -.->|"symlink"| HOOKS_DST
    AGENTS_SRC -.->|"symlink"| AGENTS_DST
    CMDS_SRC -.->|"symlink"| CMDS_DST
    SETTINGS_SRC -->|"copied at install"| SETTINGS_DST

    style OD fill:#fff9e6,stroke:#f0b429,stroke-width:2px,color:#1a1a1a
    style LOCAL fill:#f0e6ff,stroke:#7B68EE,stroke-width:2px,color:#1a1a1a
    style CLAUDE_SRC fill:#ffeeb3,stroke:#f0b429,color:#1a1a1a
    style HOOKS_SRC fill:#ffeeb3,stroke:#f0b429,color:#1a1a1a
    style AGENTS_SRC fill:#ffeeb3,stroke:#f0b429,color:#1a1a1a
    style CMDS_SRC fill:#ffeeb3,stroke:#f0b429,color:#1a1a1a
    style CLAUDE_DST fill:#e0d4f5,stroke:#7B68EE,color:#1a1a1a
    style HOOKS_DST fill:#e0d4f5,stroke:#7B68EE,color:#1a1a1a
    style AGENTS_DST fill:#e0d4f5,stroke:#7B68EE,color:#1a1a1a
    style CMDS_DST fill:#e0d4f5,stroke:#7B68EE,color:#1a1a1a

OneDrive auto-detect libraries (PowerShell, Bash, Node.js) find the correct OneDrive path on both work and personal machines. Git serves as backup and version control; OneDrive handles real-time sync.

Customizations vs Stock Hindsight

Starting from stock Hindsight, this deployment replaces or upgrades every major component:

Category	Stock Hindsight	This Deployment	Result
Database	SQLite (local file)	Cloud SQL PostgreSQL	Durable, concurrent, survives container rebuilds
LLM Provider	Direct OpenAI	LiteLLM Proxy → AWS Bedrock	Claude Opus 4.5, SSO auth, org compliance
Reranker Engine	SentenceTransformers (PyTorch)	FlashRank (ONNX Runtime)	6.8x faster on CPU, 80% less RAM
Reranker Model	ms-marco-MiniLM-L-6-v2 (22 MB)	ms-marco-TinyBERT-L-2-v2 (4 MB)	Fastest available model
Embeddings	Configurable	Local BAAI/bge-small-en-v1.5	No external API calls
Reranker Loading	Eager (on startup)	Lazy (`LAZY_RERANKER=true`)	Faster container startup

Performance Benchmarks

Reranking Speed

Metric	Before (SentenceTransformers)	After (FlashRank)	Improvement
Reranking 300 candidates	23.9s	3.5s	6.8x faster
Total `recall()` latency	24.3s	5.7s	4.2x faster
`reflect()` (3 iterations)	~78s	~18s	4.3x faster
Cold start (first query)	N/A	3.5s	One-time cost

Operational Metrics

Metric	Value
Total memories stored	7,273+
Knowledge graph links	738,808
Named entities tracked	8,066
Average `recall()` (warm)	~5.7s
Average `retain()`	~200ms
Container memory usage	~1.2 GB (both containers)
Uptime	99.9%+ (GCP managed)

Auto-Capture System

A PostToolUse hook evaluates every Claude Code tool invocation and automatically stores valuable activities as memories. No manual intervention needed.

Filtering Pipeline

graph TD
    START(["Tool Invocation"]) --> SKIP{"Skip Tool?<br/><i>Read, Glob, Grep,<br/>TaskOutput, TaskList,<br/>TaskGet</i>"}

    SKIP -->|"Yes"| SKIP_OUT["SKIP<br/><i>too noisy</i>"]
    SKIP -->|"No"| DEDUP{"Deduplicate?<br/><i>Same bash command<br/>within 5-min window?</i>"}

    DEDUP -->|"Yes"| DEDUP_OUT["SKIP<br/><i>duplicate</i>"]
    DEDUP -->|"No"| LOW{"Score < 20?"}

    LOW -->|"Yes"| LOW_OUT["SKIP<br/><i>too low value</i>"]
    LOW -->|"No"| MED{"Score 20-49?"}

    MED -->|"Yes"| ASYNC["Async retain<br/><i>fire-and-forget</i><br/>Tags: expires:7d"]
    MED -->|"No"| HIGH{"Score 50-69?"}

    HIGH -->|"Yes"| STD["Standard retain<br/>Tags: expires:30d"]
    HIGH -->|"No"| PRIO["High-priority retain<br/>Tags: permanent"]

    style START fill:#4A90D9,stroke:#2c5f8a,color:#fff
    style SKIP fill:#f8f9fa,stroke:#555,color:#1a1a1a
    style DEDUP fill:#f8f9fa,stroke:#555,color:#1a1a1a
    style LOW fill:#f8f9fa,stroke:#555,color:#1a1a1a
    style MED fill:#f8f9fa,stroke:#555,color:#1a1a1a
    style HIGH fill:#f8f9fa,stroke:#555,color:#1a1a1a

    style SKIP_OUT fill:#ef5350,stroke:#c62828,color:#fff
    style DEDUP_OUT fill:#ef5350,stroke:#c62828,color:#fff
    style LOW_OUT fill:#ef5350,stroke:#c62828,color:#fff
    style ASYNC fill:#fff176,stroke:#f9a825,color:#1a1a1a
    style STD fill:#66bb6a,stroke:#2e7d32,color:#fff
    style PRIO fill:#4A90D9,stroke:#2c5f8a,color:#fff

Importance Scoring

Activity	Base Score	Modifiers
`git commit`	90	+15 if errors
`git push`	85	+15 if errors
File edit (.ts/.js/.py)	65	+10 if Write (new file)
`package.json` edit	80	Critical file boost
Command execution	50	-30 if trivial (ls, pwd, cd)
Task completion	60	+10 if subtasks
Error presence	+15	Applied on top of base

Memories are tagged with auto-generated metadata (auto-captured, tool:bash, priority:high, etc.) and expiry tags (expires:7d, expires:30d, permanent) for automatic lifecycle management. See HINDSIGHT-SETUP.md for full configuration details.

Data Privacy

All embedding and reranking operations run locally on the GCP VM. Only LLM-dependent operations (retain for fact extraction and reflect for synthesis) send data to AWS Bedrock.

Operation	Where It Runs	Data Leaves VM?
Embedding	Local (BAAI/bge-small-en-v1.5)	No
Reranking	Local (FlashRank ONNX)	No
Fact extraction (retain)	AWS Bedrock via LiteLLM	Yes (to AWS)
Synthesis (reflect)	AWS Bedrock via LiteLLM	Yes (to AWS)
Storage	Cloud SQL PostgreSQL (same GCP project)	No

AWS Bedrock does not use customer data for model training. No telemetry is sent to external services.

Custom Agents and Commands

5 Specialized Agents

Agent	Purpose	When It Activates
qa-test-engineer	Comprehensive testing (unit through E2E)	After code changes, before merges
requirements-guardian	User acceptance testing	Verify features match specs
devops-guardian	Git operations, code review	Before commits, PRs, pushes
elite-security-auditor	Vulnerability scanning	Security-critical code
elite-documentation-architect	Technical writing	READMEs, APIs, architecture docs

Slash Commands

Command	Description
`/test`	Run comprehensive testing across all levels
`/worktree`	Manage git worktrees for parallel Claude sessions

MCP Tools

Hindsight exposes 5 core MCP tools to Claude Code:

Tool	Purpose
`retain`	Store a fact, decision, or insight
`recall`	Search stored memories using semantic similarity
`reflect`	Synthesize insights from multiple memories using LLM reasoning
`list_banks`	View all memory banks
`create_bank`	Create isolated memory namespaces

Example — recalling a past debugging session:

# You: "How did we fix the AWS SSO issue last time?"
# Claude internally runs: reflect("AWS SSO debugging history")
# Returns: Detailed solution from 3 weeks ago, including code fixes

Quick Start

One-Click Installation

Windows:

:: Double-click from your OneDrive folder:
OneDrive\Claude Backup\claude-config\Install-Claude-Code.bat

Mac/Linux:

git clone https://github.com/PakAbhishek/claude-code-config.git
cd claude-code-config/_scripts
bash install-claude-complete.sh

What Gets Installed

Claude Code CLI (latest version)
Hindsight MCP server connection
AWS Bedrock via SSO (opens browser for auth)
5 custom agents + 2 slash commands
Auto-capture hook (PostToolUse)
SDLC enforcement hooks (security, protocols)
Auto-sync symlinks (agents, commands, hooks, CLAUDE.md)
AWS credential auto-push to GCP (SessionStart hook + Scheduled Task)

Verify

claude --version                    # CLI installed
recall("test connection")           # Memory bank connected
ls ~/.claude/agents/                # 5 agent .md files

See INSTALLER-README.md for detailed installation guide.

Documentation

Document	Description
HINDSIGHT-DEPLOYMENT-GUIDE.pdf	Full deployment architecture, benchmarks, and optimization details
HINDSIGHT-SETUP.md	Hindsight integration guide (hooks, scoring, usage examples)
ARCHITECTURE.md	System design and technical architecture
SECURITY.md	Security model and compliance
INSTALLER-README.md	Installer technical documentation
TROUBLESHOOTING.md	Common issues and solutions
CHANGELOG.md	Version history

Repository Structure

claude-code-config/
  agents/                  # 5 custom agent definitions (.md)
  commands/                # Slash commands (/test, /worktree)
  hooks/                   # Auto-capture + SDLC enforcement hooks
  hindsight-setup/         # AWS credential pipeline scripts
  hindsight-mcp-server/    # Custom stdio MCP server (alternative)
  diagrams/                # Mermaid source files
  _scripts/                # Installers + cross-platform utilities
  HINDSIGHT-DEPLOYMENT-GUIDE.pdf
  README.md

Author

Abhishek Chauhan · GitHub

Resources: Hindsight · MCP Specification · Claude Code

_{7,273 memories · 738,808 connections · Growing every session}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Claude Code with Persistent Memory

Cross-Session, Cross-Machine Memory for Claude Code

The Problem

The Solution

Architecture

GCP Deployment

Multi-Machine Sync

Customizations vs Stock Hindsight

Performance Benchmarks

Reranking Speed

Operational Metrics

Auto-Capture System

Filtering Pipeline

Importance Scoring

Data Privacy

Custom Agents and Commands

5 Specialized Agents

Slash Commands

MCP Tools

Quick Start

One-Click Installation

What Gets Installed

Verify

Documentation

Repository Structure

Author

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
_scripts		_scripts
agents		agents
commands		commands
diagrams		diagrams
hindsight-mcp-server		hindsight-mcp-server
hindsight-setup		hindsight-setup
hooks		hooks
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
Build-PDF.bat		Build-PDF.bat
Build-PDF.ps1		Build-PDF.ps1
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CTO-REVIEW-SOC2-HOOK.md		CTO-REVIEW-SOC2-HOOK.md
DEVELOPER-QUICK-REFERENCE.md		DEVELOPER-QUICK-REFERENCE.md
HINDSIGHT-DEPLOYMENT-GUIDE.md		HINDSIGHT-DEPLOYMENT-GUIDE.md
HINDSIGHT-SETUP.md		HINDSIGHT-SETUP.md
INSTALLER-CHANGES-SOC2.md		INSTALLER-CHANGES-SOC2.md
INSTALLER-README.md		INSTALLER-README.md
Install-Claude-Code-Team.bat		Install-Claude-Code-Team.bat
Install-Claude-Code.bat		Install-Claude-Code.bat
README.md		README.md
SECURITY.md		SECURITY.md
aws-config-template		aws-config-template
install-linux.sh		install-linux.sh
settings.json		settings.json
soc2-validator.py		soc2-validator.py

PakAbhishek/claude-code-config

Folders and files

Latest commit

History

Repository files navigation

Claude Code with Persistent Memory

Cross-Session, Cross-Machine Memory for Claude Code

The Problem

The Solution

Architecture

GCP Deployment

Multi-Machine Sync

Customizations vs Stock Hindsight

Performance Benchmarks

Reranking Speed

Operational Metrics

Auto-Capture System

Filtering Pipeline

Importance Scoring

Data Privacy

Custom Agents and Commands

5 Specialized Agents

Slash Commands

MCP Tools

Quick Start

One-Click Installation

What Gets Installed

Verify

Documentation

Repository Structure

Author

About

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages