A ready-to-use, minimal app that converts any speech into text.
-
Updated
Jul 5, 2024 - JavaScript
A ready-to-use, minimal app that converts any speech into text.
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
在线前端频率分析扒谱 front-end music transcription
VOXD is a speech-to-text, voice-typing, dictation software for linux distributions. It is an open-source, free of charge, USER-FRIENDLY software, for as many linux distros as possible.
OpenAI/ChatGPT library for Java - Requires JDK 11 at minimum.
This contains a practical guide for non-technical users on how to use OpenAI's Whisper for transcription and translation
🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI Whisper. CPU-only, no GPU required, privacy-focused with local processing.
explore AMT from the perspective of timbre
Automatically generate accurate, per-word video captions with timestamps using Whisper ASR and FFmpeg, perfect for YouTube, social media, and accessibility.
🚀📜 Customized For Agentic AI: Enhanced the Whisper Assistant extension with improved setup scripts and documentation, ensuring seamless integration and functionality on Linux platforms.
Offline, privacy-first screen recorder with local AI transcription and smart summaries. Built with Electron, React, and TypeScript—capture desktop video, auto-generate transcripts, and get instant AI-powered meeting and lesson insights, all cross-platform and fully customizable.
a python script that can auto generate subtitle in YouTube Videos
One-command audio transcription from any video platform Transform video URLs into text transcripts instantly with automatic audio download, AI transcription, and clipboard integration. Perfect for content creators, researchers, students, and anyone who needs quick video-to-text conversion.
🎬 AI-powered subtitle generator using OpenAI Whisper. Multi-language support, batch processing, GPU acceleration. Generate SRT/WebVTT subtitles instantly!
Flick is a powerful AI-driven SaaS platform for real-time video sharing and collaboration, crafted for both web and desktop environments. Designed for seamless video recording, streaming, and sharing without third-party dependencies, Flick offers teams and individuals an integrated workspace to create, manage, and share video content in real-time.
"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind maps — saving hours of time and boosting productivity."
Al MOM is an Al-powered meeting intelligence platform that delivers real-time transcription, speaker recognition, and multi-LLM summaries using FastAPI, Whisper, Groq, and OpenRouter for intelligent meeting insights.
AI-powered assistant that automatically joins Google Meet sessions, transcribes conversations in real-time, and cleans transcripts using OpenAI, Google Gemini, and Forefront APIs.
🤖 Trascrizione automatica di documenti storici italiani con AI. Utilizza Google Gemini per digitalizzare certificati di decesso del 1800, estraendo dati strutturati ed esportandoli in Excel per ricerche genealogiche e storiche.
A Python application that converts MP4 videos to MP3 audio and transcribes the audio to text using OpenAI's Whisper API. Features a modern, user-friendly interface built with ttkbootstrap.
Add a description, image, and links to the ai-transcription topic page so that developers can more easily learn about it.
To associate your repository with the ai-transcription topic, visit your repo's landing page and select "manage topics."