Name	Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github	.github
Recap.xcodeproj	Recap.xcodeproj
Recap	Recap
RecapTests	RecapTests
.gitignore	.gitignore
LICENSE	LICENSE
README.md	README.md

Screenshots

Recap Preview

Previous Recaps

General Settings

Whisper Models Section

Why I Built This

Ever been in a meeting where you wanted to focus on your work but still catch the important bits? That's exactly why I created Recap.

I found myself constantly torn between paying attention to meetings and getting actual work done. Sometimes I'd miss crucial decisions while coding, or I'd lose my flow state trying to take notes. I needed something that could listen for me and give me the highlights afterward.

But here's the thing - I didn't want my private conversations floating around on some company's servers. When you're discussing sensitive business matters, product roadmaps, or personal topics, that data should stay on YOUR machine. That's why Recap processes everything locally on your Mac using Apple's own technologies.

Now, Recap is broken. But it is a project that I am always working on on my free time so it meets my needs. I hope it can help you too.

Important

Recap is not complete yet, it is broken, and not recommended for production usage.

Tip

Recap is in an incomplete state and I suggest that it is not used for production and daily usage, but help in shaping it is greatly appreciated.

Recap

Recap is an open-source, privacy-focused, macOS-native project to help you summarize your meetings. You could summarize audio of any app, not just meetings.

Tech Stack

Linter

Not using any linter right now, but I am planning to use SwiftLint/SwiftFormat in the future (PRs greatly welcomed).

Built using native macOS technologies - no drivers or kernel extensions required.

Core Audio: Native Core Audio taps, AVAudioEngine, driver-free system audio capture
ML: WhisperKit (local transcription), Ollama/OpenRouter (summarization)
Platform: Swift + SwiftUI, Apple Silicon optimized, sandboxed execution

Roadmap:

Working on the following features now:

Right now, Recap is more of a POC of what I am trying to make. It records system audio (Core Audio Taps) + with an optional microphone recording (your audio) and feeds it to Whisper for transcription and then uses Ollama for summarizing.

LLM Provider Options:

Ollama (recommended): Complete privacy - everything stays on your device
OpenRouter: Cloud-based option if you lack local compute capacity, but data leaves your device

System Requirements

For Ollama (Local Processing)

Component	Minimum	Recommended
macOS	15.0 or later	15.0 or later
Processor	Apple M1	Apple M2 Pro or newer
RAM	16 GB	32 GB or more
Storage	10 GB free space	50 GB free space

For OpenRouter (Cloud Processing)

Component	Minimum	Recommended
macOS	15.0 or later	15.0 or later
Processor	Apple M1	Apple M2 or newer
RAM	8 GB	16 GB or more
Storage	2 GB free space	5 GB free space

Note: Intel Macs are not supported. Apple Silicon required for WhisperKit transcription.

How It Works

                                  DATA FLOW
                              ┌───────────────┐
                          ┌───┤ 1. App Start  │
                          │   └───────────────┘
                          ▼
                    ┌─────────────┐
                    │2. App Select│
                    └─────────────┘
                          │
                          ▼
               ┌────────────────────────┐
               │3. Start Recording      │
               │   • System Audio       │
               │   • Microphone (opt)   │
               └────────────────────────┘
                          │
                          ▼
               ┌────────────────────────┐
               │4. Stop Recording       │
               │   • Save Audio Files   │
               │   • Create DB Record   │
               └────────────────────────┘
                          │
                          ▼
               ┌────────────────────────┐
               │5. Background Process   │
               │   • Transcribe Audio   │
               │   • Generate Summary   │
               └────────────────────────┘
                          │
                          ▼
               ┌────────────────────────┐
               │6. Complete & Store     │
               │   • Update DB Record   │
               │   • Show Results       │
               └────────────────────────┘

Installation

Currently, Recap is only available through compilation from source. Pre-built releases will be available once core features are stabilized.

Compile from Source

Prerequisites:
- Ensure your system meets the requirements above
- Install Xcode 15.0 or later from the Mac App Store

Clone and Build:

git clone https://github.com/your-username/recap.git
cd recap
open Recap.xcodeproj

Build in Xcode:
- Select your development team in project settings
- Build and run (⌘+R)

Note: Distribution via Mac App Store and direct download will be available in future releases once the app reaches production readiness.

Usage

Required Environment Variables

Before using Recap, you need to set up the following environment variables:

HF_TOKEN (Required): Hugging Face token for downloading Whisper models
```
export HF_TOKEN="your_huggingface_token_here"
```
OPENROUTER_API_KEY (Optional): Only required if using OpenRouter for summarization
```
export OPENROUTER_API_KEY="your_openrouter_api_key_here"
```

First-Time Setup

Download Whisper Model:
- Open Recap and go to Settings → Whisper Models
- Download a Whisper model (recommended: Large v3 for best accuracy)
- Wait for the download to complete before proceeding
Configure LLM Provider:
- Go to Settings → LLM Models
- Choose your preferred provider (Ollama or OpenRouter)
- If using Ollama, ensure it's installed and running locally
Start Recording:
- Select an audio application from the dropdown
- Click the record button to start capturing
- Optionally enable microphone for dual-audio recording
- Click stop when finished - processing will begin automatically

Tech Stack

Recap is built using native macOS technologies, avoiding third-party drivers and kernel extensions for maximum system stability and security.

Core Audio Implementation

Native Core Audio: Direct integration with macOS audio subsystem
Audio Unit Taps: System-level audio interception without drivers
AVAudioEngine: Modern Swift audio processing pipeline
No Kernel Extensions: Driver-free audio capture using Apple's official APIs
Process-Specific Capture: Target individual applications without affecting system audio

Technologies Used

Language: Swift 5.9+
UI Framework: SwiftUI + AppKit (menu bar integration)
Audio Processing: Core Audio, AVAudioEngine, Audio Toolbox
Machine Learning:
- WhisperKit (local speech-to-text)
- MLX/CoreML optimization for Apple Silicon
LLM Integration:
- Ollama (local inference)
- OpenRouter (cloud API)
Data Storage: Core Data + local file system
Concurrency: Swift async/await, structured concurrency
Architecture: MVVM, Coordinator pattern, Dependency injection

Security & Privacy Benefits

Sandboxed Execution: Runs within macOS security boundaries
No System Modifications: Zero kernel-level changes required
Permission-Based: Uses standard macOS audio permissions
Local Processing: WhisperKit keeps transcription on-device
Secure Audio Taps: Apple's blessed method for audio interception

Contributing

I really need help finishing Recap! Any contribution is greatly welcomed.

Priority Areas

Critical needs:

See Roadmap above for current focus areas

Also needed:

Meeting app detection improvements
Custom prompt templates via settings
Performance optimizations
Documentation improvements
Bug fixes and stability improvements

How to Contribute

Fork the repository and create a feature branch
Check existing issues or create new ones for bugs/features
Follow the coding patterns established in the codebase
Test your changes thoroughly on Apple Silicon Macs
Submit a pull request with clear description of changes

All skill levels welcome - from fixing typos to architecting new features. Let's build something great together. I really mean it!

License

This project is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License.

What this means:

✅ Free personal use - Use, modify, and share for personal projects
✅ Open source contributions - Fork, improve, and contribute back
✅ Educational use - Perfect for learning and teaching
❌ Commercial redistribution - Cannot resell or distribute commercially without permission
❌ App Store uploads - Cannot publish to app stores for profit

If you want to use Recap commercially or have questions about licensing, please reach out to discuss options.

See the LICENSE file for full details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Screenshots

Recap Preview

Previous Recaps

General Settings

Whisper Models Section

Why I Built This

Recap

Tech Stack

Linter

Roadmap:

System Requirements

For Ollama (Local Processing)

For OpenRouter (Cloud Processing)

How It Works

Installation

Compile from Source

Usage

Required Environment Variables

First-Time Setup

Tech Stack

Core Audio Implementation

Technologies Used

Security & Privacy Benefits

Contributing

Priority Areas

How to Contribute

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

RecapAI/Recap

Folders and files

Latest commit

History

Repository files navigation

Screenshots

Recap Preview

Previous Recaps

General Settings

Whisper Models Section

Why I Built This

Recap

Tech Stack

Linter

Roadmap:

System Requirements

For Ollama (Local Processing)

For OpenRouter (Cloud Processing)

How It Works

Installation

Compile from Source

Usage

Required Environment Variables

First-Time Setup

Tech Stack

Core Audio Implementation

Technologies Used

Security & Privacy Benefits

Contributing

Priority Areas

How to Contribute

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages