Android QA Agent

A record-and-replay QA tool for Android. Describe a test scenario in natural language, and Claude Code executes it on an Android device via ADB — transparently recording every command. Recordings can be replayed at configurable speed, with screenshot verification powered by Claude.

_{Open the contacts app, add a new contact with name John Doe and phone number +001 1234 5678}

Prerequisites

Python 3.8+
adb on your PATH (or set ANDROID_HOME)
A running Android emulator or connected device
Claude Code

Getting Started

Clone the repository:

git clone git@github.com:tobrun/android-qa-agent.git

Open Claude Code from the project directory:

cd android-qa-agent && claude

Prompt Claude Code with a QA scenario. The more detail you provide, the better the test output:

open the settings app and toggle dark mode

Claude will:

Start a recording session, named after the scenario
Take UI dumps to find accurate tap coordinates
Execute each step via android-qa, an ADB gateway that logs every command
Claude returns execution and saves the recording to the recordings/ directory

Claude is generally good at reasoning about the UI and deciding next steps. Providing project-specific instructions in CLAUDE.md helps guide it more efficiently.

Replay and Verification

./android-qa-replay my-session --speed 5 --verify

Replay preserves relative timing between commands, compressed by the speed factor, and aborts on the first command failure.

The --verify flag runs Claude in headless mode (claude -p) to compare the final screenshot from replay against the one captured during the original recording. This approach tolerates minor visual differences — such as the system clock or transient UI elements — that are unrelated to the test case.

Project Structure

android-qa-agent/
├── android-qa              # ADB wrapper that records commands (Python)
├── android-qa-replay       # Replay tool with optional verification (Python)
├── start-recording         # Session start script
├── stop-recording          # Session stop script
├── recordings/             # Finalized recordings, per session (committable)
│   └── <session>/
│       ├── recording.json  # Commands and metadata (incl. original prompt)
│       └── golden.png      # Last screenshot from the recording
├── artifacts/              # Screenshots and UI dumps (per session)
├── CLAUDE.md               # Claude Code project instructions
├── .claude/
│   ├── settings.json       # Stop hook for auto-finalizing sessions
│   └── skills/             # Claude Code skills
└── .android-qa/            # Runtime state
    └── active-session.json # Lock file (exists only during recording)

Limitations

Streaming ADB commands (logcat, shell top, etc.) are not supported.
Single-threaded: one android-qa invocation at a time.
No stdout/stderr capture — only the command and its metadata are recorded.

Roadmap

Multi-device support (current MVP works only on a single ADB device)
Multi-step verification — verify screenshots at every step, not just the final result
App state cleanup — start each recording and replay from a known fresh state
UI dump verification alongside screenshot comparison
Complex touch gestures — rotate, pinch, double-tap, and other multi-touch input
3D content testing — GLSurfaceView/TextureView content is excluded from UI dumps, navigating purely by screenshot has limitations (incorrectly calculating touch points)
Support for capturing performance metrics through adb dumpsys

Contributions are welcome — feel free to open a PR!

License

This project is licensed under the Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.claude		.claude
example		example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
android-qa		android-qa
android-qa-replay		android-qa-replay
start-recording		start-recording
stop-recording		stop-recording

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Android QA Agent

Prerequisites

Getting Started

Replay and Verification

Project Structure

Limitations

Roadmap

License

About

Uh oh!

Releases 1

Packages

Languages

License

tobrun/android-qa-agent

Folders and files

Latest commit

History

Repository files navigation

Android QA Agent

Prerequisites

Getting Started

Replay and Verification

Project Structure

Limitations

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages