Real-time multilingual speech-to-text and translation system with GPT-4o, featuring floating subtitles and keyword-enhanced transcription.
-
Updated
Jun 10, 2025 - Python
Real-time multilingual speech-to-text and translation system with GPT-4o, featuring floating subtitles and keyword-enhanced transcription.
Use gpt-4o-transcribe combined with gpt-4o to achieve real-time multilingual subtitle translation
This tool can download and utilize OpenAI's open-source Whisper model. It transcribes audio files using the model and outputs the results as timestamped text files or standard subtitle files, which can be used by other software.
Add a description, image, and links to the mutilanguage topic page so that developers can more easily learn about it.
To associate your repository with the mutilanguage topic, visit your repo's landing page and select "manage topics."