Whisper.Playground.mp4
- Stream audio using web sockets over the current approach of incrementally sending audio chunks
- Implement diarization (speaker identification) using
pyannote-audio
(example)
- Whisper requires the command-line tool
ffmpeg
andportaudio
to be installed on your system, which is available from most package managers:
# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg
sudo apt install portaudio19-dev
# on Arch Linux
sudo pacman -S ffmpeg
sudo pacman -S portaudio
# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg
brew install portaudio
# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg
# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg
- Clone or fork this repository
- Install the backend and frontend environmet
sh install_playground.sh
- Run the backend
cd backend && source venv/bin/activate && flask run --port 8000
- In a different terminal, run the React frontend
cd interface && yarn start
This repository and the code and model weights of Whisper are released under the MIT License.