Clip transcriber that that converts audio or video clips into written text. It is commonly used for tasks such as transcribing interviews, lectures, or recordings.
This project uses pip for package management.
To run this project, you will need to install the following dependencies:
- transformers
- torch
- datasets
- accelerate
- librosa
- pytube
- moviepy
- pyannote.audio
- speechbox==0.2.1
- jupyter
You can install these dependencies by running the following command:
pip install -r requirements.txt
jupyter notebook
Contributions are welcome! If you have any suggestions, bug reports, or feature requests, please open an issue on the GitHub repository.
This project is licensed under the MIT License. See the LICENSE file for more information.