Skip to content

Resulting .srt file loses synch with audio file used for transcription (Google Colab notebook) #288

Open
@BlohoJo

Description

Hi! 👋😀

I'm having a problem where the audio I uploaded for Whisper-Diarization loses sync with the srt subtitle file that it generates.

Here's the audio file I'm working with. It's a 16 bit WAV file.

https://drive.google.com/file/d/1aUvTx7dZcfQy7ZdlRTULPEQrl8WHmXyO/

I have left all options in the Google Colab notebook at their defaults (other than the audio file name).

The resulting SRT file loses synch for me at line 31, and continues to get worse from there. You can observe this in Subtitle Edit. (Open the generated .srt file and drag the audio file onto the waveform.)

(Not sure if this is the right place to report this?)

Just for reference, my methodology is:

from google.colab import drive
drive.mount('/content/drive')
  • Change Runtime Type, Select High-RAM, Save

  • Connect

  • Run "Installing Dependencies". When prompted to restart, hit "Cancel" and wait for running tasks to complete before selecting "Runtime" -> "Restart Session".

  • Run "Installing Dependencies" again after runtime has restarted.

  • Run "Helper Functions"

  • Run inserted cell in "Options" to connect to Google Drive.

  • Drag the uploaded audio file from "MyDrive" to "sample_data" then drag again to the content folder (up one level from "sample_data"). (Note: this resolves issues with dragging directly from MyDrive to the content folder.)

  • Change "audio_path =" under "# Name of the audio file" to WAV audio file name

  • Run "Options" cell

  • Run "Processing"

Note: The Google Drive upload is to work around issues with upload speed, trunication, and corruption when using the "Upload" function in the Google Colab notebook.

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions