Skip to content

Is there any way to generate transcription with time label #3572

Closed
@ChanganVR

Description

@ChanganVR

Is it possible for google speech API to transcribe a long audio and return the transcription with time label of each word or time label of each sentences?
If not, any idea of how to do that? For example, I have thought about slicing long audio into fixed length short segment with overlapping and transcribe those short audios and then eliminate those overlapped texts. Another possible way to do that is to break long audio into meaningful sentences using some segmentation technique and then combine them. By the way, the audio I want to transcribe is 110 minutes long sports video commentary. Transcribing the whole audio and generating a long messy paragraph is not what I want. Thanks for those who're willing to give help.

Metadata

Metadata

Labels

api: speechIssues related to the Speech-to-Text API.priority: p2Moderately-important priority. Fix may not be included in next release.type: questionRequest for information or clarification. Not an issue.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions