08 FAQ

Frequently Asked Questions

Common questions and answers about MCP Memory Service usage, configuration, and troubleshooting.

General Questions

What is MCP Memory Service?

MCP Memory Service is a Model Context Protocol server that provides semantic memory and persistent storage capabilities for AI assistants like Claude. It allows you to store, search, and retrieve information across conversations and sessions.

What storage backends are supported?

SQLite-vec (recommended): Fast, single-file database perfect for individual use
ChromaDB: Great for multi-client scenarios and team collaboration
Cloudflare: Production-ready distributed storage for enterprise use

How does semantic search work?

MCP Memory Service uses sentence transformers to convert your content into vector embeddings. When you search, it finds semantically similar content, not just exact text matches. This means searching for "authentication issues" will also find memories about "login problems" or "JWT errors."

Can I use this with other AI tools besides Claude?

Yes! MCP Memory Service implements the standard Model Context Protocol, so it works with any MCP-compatible client. It also provides REST APIs for custom integrations.

Is my data secure?

Yes. Data is stored locally by default (SQLite-vec backend), or in your chosen infrastructure (ChromaDB, Cloudflare). The service supports HTTPS, API key authentication, and you have full control over your data.

Installation & Setup

What are the system requirements?

Python: 3.10+ (with sentence-transformers support)
Memory: 2GB+ recommended for embedding models
Disk Space: Varies by usage (~1GB for 100K memories)
OS: Windows, macOS, Linux supported

Do I need GPU for good performance?

No, but it helps. MCP Memory Service automatically detects and uses:

CUDA (NVIDIA GPUs)
MPS (Apple Silicon M1/M2)
DirectML (Windows GPU acceleration)
CPU fallback for systems without GPU support

How do I migrate from one backend to another?

# Export from current backend
python scripts/export_memories.py --format json > backup.json

# Switch backend in config
export MCP_MEMORY_STORAGE_BACKEND=sqlite_vec

# Import to new backend
python scripts/import_memories.py --file backup.json

Can I run this on a server for team access?

Yes! Enable the HTTP server and configure for multi-client access:

export MCP_HTTP_ENABLED=true
export MCP_HTTPS_ENABLED=true
export MCP_HTTP_PORT=8443
export MCP_MEMORY_STORAGE_BACKEND=chroma  # Better for multi-client

Performance & Scaling

How fast are search queries?

Performance varies by backend:

SQLite-vec: ~5ms average
ChromaDB: ~15ms average
Cloudflare: Network dependent

How much memory does it use?

Base service: ~500MB
Embedding models: ~1GB
Memory cache: ~200MB per 1000 memories
Total: 2GB typical for active development use

Can I limit resource usage?

Yes, configure resource limits:

export MCP_MAX_MEMORY_MB=2048
export MCP_MAX_CONCURRENT_OPERATIONS=10
export MCP_EMBEDDING_BATCH_SIZE=50

How do I handle large numbers of memories?

Use appropriate result limits in searches
Implement memory cleanup routines
Consider database partitioning for >1M memories
Monitor performance with built-in metrics

Claude Code Integration

How do I install Claude Code hooks?

Download the hook system from the MCP Memory Service repository
Configure claude-hooks/config.json with your settings
Test with node test-hooks.js
Start using Claude Code normally

Why isn't my context loading automatically?

Check these common issues:

Memory service not running: curl -k https://localhost:8443/api/health
Hook configuration error: Check claude-hooks/config.json
Project not detected: Verify git repository and package files
No memories found: Check if you have relevant memories stored

How does the memory scoring algorithm work?

The algorithm uses four factors:

Time decay (25%): Recent memories weighted higher
Tag relevance (35%): Project and technology tag matches
Content relevance (15%): Semantic similarity to current context
Conversation relevance (25%): Related to ongoing conversation

Can I customize which memories are loaded?

Yes! Configure in claude-hooks/config.json:

{
  "memoryService": {
    "maxMemoriesPerSession": 8,
    "minRelevanceScore": 0.3
  },
  "memoryScoring": {
    "weights": {
      "timeDecay": 0.25,
      "tagRelevance": 0.35,
      "contentRelevance": 0.15,
      "conversationRelevance": 0.25
    }
  }
}

Troubleshooting

The service won't start

Common causes:

Port conflict: Check if port 8443 is in use
```
netstat -tlnp | grep 8443
```
Missing dependencies: Reinstall with python install.py
Python version: Requires Python 3.10+
```
python --version
```
Disk space: Ensure sufficient space for models and data

Search returns no results

Troubleshooting steps:

Check if memories exist:

curl -k "https://localhost:8443/api/memories/count"

Try exact text search: Use quotes for exact matching
Check tag search: search_by_tags(["your-tag"])
Verify embedding model: Check if sentence transformers loaded correctly

"Database is locked" error

This typically happens with SQLite-vec under high load:

# Enable WAL mode for better concurrent access
sqlite3 your_db.db "PRAGMA journal_mode=WAL;"

# Check for hanging connections
lsof | grep your_db.db

# Restart service if needed
systemctl restart mcp-memory-service

Memory usage keeps growing

Solutions:

Reduce cache sizes:

export MCP_EMBEDDING_CACHE_SIZE=500
export MCP_QUERY_CACHE_SIZE=50

Enable garbage collection:

import gc
gc.collect()  # Add to periodic maintenance

Implement memory cleanup:

# Remove old temporary memories
python scripts/cleanup_memories.py --older-than 30 --tag temporary

Claude Code hooks not working

Debug checklist:

Verify hook installation:
```
node claude-hooks/test-hooks.js
```

Check Claude Code CLI version:

claude --version  # Should be recent version

Test memory service connection:

curl -k https://localhost:8443/api/health

Enable debug logging:

{
  "logging": {
    "level": "debug",
    "enableDebug": true,
    "logToFile": true
  }
}

Performance is slow

Optimization steps:

Check backend choice: SQLite-vec is fastest for single users
Optimize search queries: Use specific terms, not vague searches
Limit result sizes: Don't retrieve more than you need
Enable hardware acceleration: GPU/MPS if available

Monitor resource usage:

htop  # Check CPU and memory usage
iotop  # Check disk I/O

Can't connect to remote instance

Network troubleshooting:

Check firewall: Ensure port 8443 is open
Verify HTTPS certificates: Use -k flag for self-signed
Test connectivity:
```
telnet your-server 8443
```
Check DNS resolution:
```
nslookup your-server
```

Import/Export issues

Common solutions:

JSON format validation:
```
python -m json.tool < export.json
```
Check file permissions: Ensure read/write access
Memory limits: Large imports may need increased memory
Backup before import: Always backup existing data

Getting help

If you're still having issues:

Check the Troubleshooting Guide
Search existing GitHub issues
Create a new issue with:
- Operating system and Python version
- Configuration files (remove sensitive data)
- Error logs and stack traces
- Steps to reproduce the problem

Best practices to avoid issues

Regular maintenance: Weekly cleanup and optimization
Monitor resources: Keep an eye on memory and disk usage
Backup data: Regular exports of important memories
Update regularly: Keep service and dependencies updated
Use appropriate backends: Choose based on your use case

Uh oh!

08 FAQ

Frequently Asked Questions

Table of Contents

General Questions

What is MCP Memory Service?

What storage backends are supported?

How does semantic search work?

Can I use this with other AI tools besides Claude?

Is my data secure?

Installation & Setup

What are the system requirements?

Do I need GPU for good performance?

How do I migrate from one backend to another?

Can I run this on a server for team access?

Performance & Scaling

How fast are search queries?

How much memory does it use?

Can I limit resource usage?

How do I handle large numbers of memories?

Claude Code Integration

How do I install Claude Code hooks?

Why isn't my context loading automatically?

How does the memory scoring algorithm work?

Can I customize which memories are loaded?

Troubleshooting

The service won't start

Search returns no results

"Database is locked" error

Memory usage keeps growing

Claude Code hooks not working

Performance is slow

Can't connect to remote instance

Import/Export issues

Getting help

Best practices to avoid issues

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally