Video Whisper

An experimental set of colab notebooks for video semantic segmentation.

The set of steps implemented:

text exctraction (transcription) from the youtube video with the latest ASR model: 'openai/whisper-large-v3'
text topic modeling with the BERTopic and HDBSCAN clustering
video scene segmentation with PySceneDetect

Use Google Colab Notebook with GPU.

Video timeline visualization with website text:

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
video_transcription.ipynb		video_transcription.ipynb
video_transcription_clustering.ipynb		video_transcription_clustering.ipynb