AndroidMCP

On-Device LLM + Model Context Protocol for Android

An open-source Android application that combines on-device LLM inference with MCP (Model Context Protocol) support for extensible AI tools.

🎯 Vision

Build a private, offline-capable AI assistant for Android that:

Runs LLMs entirely on-device using llama.cpp
Supports MCP protocol for extensible tools
Includes built-in productivity tools (notes, calendar, etc.)
Allows third-party MCP servers via Android IPC

💡 Key Use Cases

1. The "Deep Privacy" Journal & Therapist

The Pitch: "The only AI that knows your deepest thoughts, but tells no one."

The Problem: People want AI insight into their mental health or personal life but are terrified of training the next GPT model with their private journals.
The Use Case: You document anxiety, relationship struggles, or unfiltered opinions.
The Query: "Have I been feeling more anxious lately?"
The Magic: The app uses vector search to scan months of entries and synthesizes an answer: "You tend to express higher anxiety on Sunday nights, specifically regarding work deadlines, a pattern visible since October."
Why it wins: Zero data exposure risk. It's a "Safe Space" in your pocket.

2. The "Messy Thinker's" Savior (Capture Now, Organize Never)

The Pitch: "Stop organizing. Just dump it here."

The Problem: Notes apps are where ideas go to die because organizing them is friction.
The Use Case: You dump raw, unstructured input—voice memos, screenshots, half-baked ideas, random URLs—into the app. No folders, no tags.
The Query: "What was that idea I had about a coffee shop app?"
The Magic: Vector search finds the semantic match in a voice note from 3 months ago, a screenshot from last week, and a text note from today, synthesizing them into a coherent project brief.
Why it wins: The AI is the organization layer.

3. The "Secure" Corporate Edge

The Pitch: "Bring your own AI to work—without getting fired by IT."

The Problem: Employees want AI help but are banned from pasting internal docs or proprietary code into ChatGPT.
The Use Case: You load confidential internal PDFs, strategy docs, and proprietary code snippets into the app.
The Query: "Summarize the Q3 risks from these 5 confidential reports."
Why it wins: It bridges the gap: AI power, zero data leak. It's "Shadow IT" that is actually secure.

📚 Documentation

See the specification documents for detailed architecture:

Document	Description
00_PROJECT_OVERVIEW.md	Project overview and architecture
01_UI_LAYER_SPEC.md	Jetpack Compose UI layer
02_AGENT_ORCHESTRATION_SPEC.md	Agent/orchestration layer
03_MCP_HOST_CLIENT_SPEC.md	MCP protocol implementation
04_ON_DEVICE_LLM_SPEC.md	llama.cpp integration
05_BUILTIN_MCP_SERVERS_SPEC.md	Notes and other built-in tools

🏗️ Architecture

┌─────────────────────────────────────────────────┐
│              UI Layer (Compose)                 │
├─────────────────────────────────────────────────┤
│         Agent/Orchestration Layer               │
├────────────────────┬────────────────────────────┤
│   MCP Host/Client  │   On-Device LLM (llama.cpp)│
├────────────────────┴────────────────────────────┤
│          Built-in MCP Servers (Notes, etc.)     │
└─────────────────────────────────────────────────┘

🔌 MCP Server Transport Strategy

Different MCP servers use different transport mechanisms based on trust level and crash isolation requirements:

┌─────────────────────────────────────────────────────────────┐
│                    AndroidMCP App                            │
│  ┌─────────────────────────────────────────────────────────┐│
│  │              MCP Host / Client                          ││
│  │  ┌─────────────────┐   ┌───────────────────────────────┐││
│  │  │  In-Process     │   │   stdio Transport             │││
│  │  │  Transport      │   │                               │││
│  │  │  ─────────────  │   │  ┌─────────────────────────┐ │││
│  │  │  • pkb_*        │   │  │ Browser MCP Server      │ │││  ← Separate process
│  │  │  • calendar     │   │  │ (WebView wrapper)       │ │││
│  │  │  • contacts     │   │  └─────────────────────────┘ │││
│  │  │  (built-in,     │   │  ┌─────────────────────────┐ │││
│  │  │   trusted)      │   │  │ 3rd-party servers       │ │││  ← Separate process
│  │  │                 │   │  └─────────────────────────┘ │││
│  │  └─────────────────┘   └───────────────────────────────┘││
│  └─────────────────────────────────────────────────────────┘│
└─────────────────────────────────────────────────────────────┘

Server Type	Transport	Rationale
Built-in (PKB, Calendar)	In-Process	Trusted code, performance critical
Browser Automation	stdio	Crash isolation, memory isolation, WebView in separate process
3rd-party MCP Servers	stdio / Unix Sockets	Untrusted code, MUST be isolated

Why stdio for Browser/External servers?

🛡️ Crash Isolation - Browser/tool crashes don't kill the app
🧠 Memory Isolation - Heavy operations get separate memory budget
🔌 Standard Protocol - Compatible with existing MCP servers
📱 Android Constraint - WebView already runs in separate process

🛠️ Build Setup

Prerequisites

Android Studio Hedgehog (2023.1.1) or later
JDK 17 or later (bundled with Android Studio)
Android SDK with API level 34
Android NDK 26.1.10909125 (will be downloaded automatically)
CMake 3.22.1 (will be downloaded automatically)

Local Development (Recommended for Apple Silicon)

For the best development experience on macOS (especially Apple Silicon M1/M2/M3), run builds directly using Android Studio's bundled JDK:

# Set JAVA_HOME to Android Studio's bundled JDK (add to ~/.zshrc for persistence)
export JAVA_HOME="/Applications/Android Studio.app/Contents/jbr/Contents/Home"

# Verify SDK is found
echo $ANDROID_HOME  # Should show ~/Library/Android/sdk

# Build debug APK
./gradlew assembleDebug

# Run unit tests (~2-3 minutes on Apple Silicon)
./gradlew testDebugUnitTest

# Build release APK
./gradlew assembleRelease

Why local over Docker? On Apple Silicon (M1/M2/M3), Docker runs x86_64 images via QEMU emulation, making builds 5-10x slower. Native builds are significantly faster.

Android Studio Development (GUI)

Open Project:

# Open via terminal
open -a "Android Studio" /path/to/android_llm_mcp

# Or: File → Open → Select the project folder

Wait for Gradle Sync:
- Android Studio will automatically sync Gradle dependencies (2-3 min first time)
- NDK and CMake will be downloaded automatically if missing
Build the App:
- Menu: Build → Make Project (or Cmd+F9)
- First build takes ~7 minutes (includes native llama.cpp compilation)
- Subsequent builds are much faster (~30 seconds)
Run on Device/Emulator:
- Select a device from the toolbar dropdown
- Click the green ▶️ Run button (or Ctrl+R)
- For LLM testing, use a physical device (recommended) or ARM64 emulator
Run Unit Tests:
- Right-click on app/src/test → Run 'Tests in app'
- Or: Menu Run → Run 'All Tests'
- Expected: 740 tests passing

Troubleshooting Android Studio

Issue	Solution
"SDK not found"	File → Project Structure → SDK Location → Set to `~/Library/Android/sdk`
"NDK not found"	Wait for auto-download, or: SDK Manager → SDK Tools → NDK
"CMake not found"	Wait for auto-download, or: SDK Manager → SDK Tools → CMake
Submodule errors	Run `git submodule update --init --recursive` in terminal
Gradle sync failed	File → Invalidate Caches → Restart

Docker Development (Optional)

If you prefer Docker or need a consistent CI environment:

# Build the Docker image
docker build -t android-mcp-builder .

# Run tests via Docker (slower on Apple Silicon due to emulation)
docker run --rm -v "$PWD/app:/app/app" android-mcp-builder ./gradlew testDebugUnitTest --no-daemon

# Build APK via Docker
docker run --rm -v "$PWD/app:/app/app" android-mcp-builder ./gradlew assembleDebug --no-daemon

llama.cpp Submodule Setup

This project uses llama.cpp (pinned to release b4380) as a Git submodule for on-device LLM inference.

# Clone with submodules
git clone --recursive https://github.com/sureshsankaran/android_llm_mcp.git

# Or if already cloned, initialize submodules
git submodule update --init --recursive

# Verify llama.cpp is at the correct version
cd app/src/main/cpp/llama.cpp
git describe --tags  # Should show b4380

Note: The build will fail with a clear error message if the submodule is not initialized.

Building the Project

# Build debug APK
./gradlew assembleDebug

# Build release APK
./gradlew assembleRelease

# Run unit tests
./gradlew test

# Run instrumented tests (requires device/emulator)
./gradlew connectedAndroidTest

Native Build Configuration

The project builds llama.cpp natively for:

arm64-v8a (ARM64, primary target)
armeabi-v7a (ARM32, fallback)

Key build features:

NEON SIMD optimizations enabled for ARM
CPU-only inference (GPU backends disabled for broader compatibility)
c++_shared STL for better compatibility
Memory-mapped models for efficient loading

Model Setup for Testing

For running integration tests, place a GGUF model file in one of these locations:

# Internal storage
adb push your_model.gguf /data/data/com.androidmcp.debug/files/models/test_model.gguf

# External storage (if accessible)
adb push your_model.gguf /storage/emulated/0/Android/data/com.androidmcp.debug/files/test_model.gguf

Recommended test models (Q4_K_M quantization):

TinyLlama 1.1B (~700MB)
Qwen2.5-0.5B (~350MB)

🔧 Tech Stack

UI: Jetpack Compose + Material 3
LLM: llama.cpp via JNI
MCP: MCP Kotlin SDK + AIDL for cross-app
Storage: Room Database
DI: Hilt

📱 Supported Models

Model	Size	RAM
Llama 3.2 1B	~1GB	~2GB
Llama 3.2 3B	~2GB	~4GB
Phi-3 Mini	~2.5GB	~4GB
Qwen2.5 3B	~2GB	~4GB

🚀 Status

Phase: Specification/Design

🔮 Future Improvements

Area	Investigation	Rationale
Vector DB	Keep `androidx.sqlite` + `sqlite-vec`	Current approach is solid for on-device vector search
LLM Runtime	Prototype MediaPipe LLM Inference	May improve NPU utilization and battery life vs llama.cpp
AICore / Gemini Nano	Add startup check for AICore availability	If Gemini Nano is available, use it - solves "Model Delivery" and "Battery" risks instantly (no PAD download, optimized for device)

📄 License

MIT License

🔗 References

RikkaHub - Android MCP chat client
android-mcp-sdk - Android MCP SDK
llama.cpp - LLM inference
MCP Kotlin SDK - Official MCP SDK

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
app		app
docs		docs
gradle		gradle
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
README.md		README.md
build.gradle.kts		build.gradle.kts
docker-compose.yml		docker-compose.yml
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AndroidMCP

🎯 Vision

💡 Key Use Cases

1. The "Deep Privacy" Journal & Therapist

2. The "Messy Thinker's" Savior (Capture Now, Organize Never)

3. The "Secure" Corporate Edge

📚 Documentation

🏗️ Architecture

🔌 MCP Server Transport Strategy

🛠️ Build Setup

Prerequisites

Local Development (Recommended for Apple Silicon)

Android Studio Development (GUI)

Troubleshooting Android Studio

Docker Development (Optional)

llama.cpp Submodule Setup

Building the Project

Native Build Configuration

Model Setup for Testing

🔧 Tech Stack

📱 Supported Models

🚀 Status

🔮 Future Improvements

📄 License

🔗 References

About

Uh oh!

Releases

Packages

Languages

sureshsankaran/android_llm_mcp

Folders and files

Latest commit

History

Repository files navigation

AndroidMCP

🎯 Vision

💡 Key Use Cases

1. The "Deep Privacy" Journal & Therapist

2. The "Messy Thinker's" Savior (Capture Now, Organize Never)

3. The "Secure" Corporate Edge

📚 Documentation

🏗️ Architecture

🔌 MCP Server Transport Strategy

🛠️ Build Setup

Prerequisites

Local Development (Recommended for Apple Silicon)

Android Studio Development (GUI)

Troubleshooting Android Studio

Docker Development (Optional)

llama.cpp Submodule Setup

Building the Project

Native Build Configuration

Model Setup for Testing

🔧 Tech Stack

📱 Supported Models

🚀 Status

🔮 Future Improvements

📄 License

🔗 References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages