optimise-my-whisper

A very quick repo to showcase how you can experiment with Whisper (via Transformers) and make it run for your own use-cases!

All the experiments were run on a free Google Colab T4! 🔥

Results and Colab below:

Method	Time to Transcribe
fp16	62s
fp16 + SDPA	60s
fp16 + SDPA + Speculative Decoding	37.9s
fp16 + SDPA + Chunking + Speculative Decoding	43.8s
Distil-whisper + fp16 + SDPA + Chunking	17.2s

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
LICENSE		LICENSE
README.md		README.md
insanely-fast-whisper-slides.pdf		insanely-fast-whisper-slides.pdf
insanely_fast_whisper_distil_fp16_sdpa_chunking.ipynb		insanely_fast_whisper_distil_fp16_sdpa_chunking.ipynb
insanely_fast_whisper_fp16.ipynb		insanely_fast_whisper_fp16.ipynb
insanely_fast_whisper_fp16_sdpa.ipynb		insanely_fast_whisper_fp16_sdpa.ipynb
insanely_fast_whisper_fp16_sdpa_spec_dec.ipynb		insanely_fast_whisper_fp16_sdpa_spec_dec.ipynb
insanely_fast_whisper_fp16_sdpa_spec_dec_chunking.ipynb		insanely_fast_whisper_fp16_sdpa_spec_dec_chunking.ipynb

Provide feedback