New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Support explicit start/end timestamps of speech activity detection (VAD) given by the user + add vad segments in the output #175

Merged

Jeronymous merged 4 commits into master from features/explicit_vad

Mar 3, 2024

Member

Jeronymous commented Mar 1, 2024

option vad can be a list of explicit timestamps (ex: [(0.0, 3.50), (32.43, 36.43)])
Transcription results include a new key "speech_activity" with the timestamps of the VAD

  "speech_activity": [
    {
      "start": 0.0,
      "end": 3.76
    },
    {
      "start": 32.37,
      "end": 36.43
    }
  ]

Jeronymous added 4 commits

March 1, 2024 09:31


          Support explicit VAD timestamps


          Add speech activity segments in the output

f7e6fff


          clarify cache folder

0609d7c


          update non-regression test outputs

f2f17bd

Jeronymous merged commit bdee5d3 into master

1 check passed

Jeronymous deleted the features/explicit_vad branch

March 3, 2024 20:07

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet