A comprehensive video transcription tool with multiple LLM provider support and automatic journal integration.
- Video Transcription: Converts videos to text using parakeet-mlx
- Triple LLM Support: Choose between local LM Studio, Anthropic Claude Sonnet, or OpenRouter (Kimi K2)
- Instruction-Based Diff Enhancement: LLM provides specific edit instructions instead of full rewrites
- Model Testing: Test your LLM connection before processing videos
- Brief Titles: Auto-generated descriptive titles (8 words max) for each transcript
- Auto-Journal: Automatically saves enhanced transcripts to daily journals
- Batch Processing: Handle multiple videos at once
- Chronological Processing: Automatically sorts videos by recording time
pip install gradio anthropic requestsCreate a secrets.json file in the same directory as the script:
{
"anthropic_api_key": "your-anthropic-api-key-here",
"openrouter_api_key": "your-openrouter-api-key-here",
"comment": "Add your API keys here. This file should be kept private and not shared."
}- Sign up at Anthropic
- Create an API key
- Add it to
secrets.jsonasanthropic_api_key
- Sign up at OpenRouter
- Create an API key
- Add it to
secrets.jsonasopenrouter_api_key
- Install LM Studio
- Load a model
- Start the local server (usually http://localhost:1234)
python GradioTranscribeToJournal.pyThe web interface will open at http://localhost:7868
- Choose LLM Provider: Select between local, anthropic, or openrouter
- Test Connection: Use the "Test Model Connection" button to verify setup
- Upload Videos: Either drag & drop files or paste file paths
- Process: Click "Process Videos" to transcribe and enhance
- Anthropic: claude-sonnet-4-20250514
- OpenRouter: moonshot-v1-8k (Kimi K2)
- Local: Your choice of model in LM Studio
- API keys are stored in
secrets.json(not committed to git) - The
.gitignorefile prevents accidental commits of secrets - Never share your
secrets.jsonfile
AudioTools/
├── GradioTranscribeToJournal.py # Main application
├── secrets.json # API keys (create this)
├── .gitignore # Prevents committing secrets
└── README.md # This file