-
Notifications
You must be signed in to change notification settings - Fork 158
Planned Features
ThioJoe edited this page Jan 11, 2024
·
17 revisions
- N/A
- Azure Translator support
- Support for Amazon Polly
- Find a way to do 'forced alignment' of a good transcript with the video to get word-level timestamps, which should make it easier to start and stop speech clips at more natural breakpoints. And also not necessarily require pre-made subtitle files
- It seems Azure Translator may be able to provide "Word Alignment" information, which could be used to provide better translations. So you could translate the whole transcript at once instead of individual fragments which sometimes sentences in half resulting in incorrect translations. Then you could use the 'word indexes' to match up where the translated words should go based on where the pre-translated words are. I suspect this is how YouTube does the auto syncing of translated transcripts behind the scenes.