Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

语音转文字时,最大支持多长时间 #3659

Open
xxch opened this issue Jan 4, 2024 · 1 comment
Open

语音转文字时,最大支持多长时间 #3659

xxch opened this issue Jan 4, 2024 · 1 comment
Assignees
Labels

Comments

@xxch
Copy link

xxch commented Jan 4, 2024

当语音时长为1分47秒时程序报错,并且直接当掉了。
Token indices sequence length is longer than the specified maximum sequence length for this model (515 > 513). Running this sequence through the model will
result in indexing errors已放弃(吐核)
问题1、
如何修改配置可以改变时长?
问题2、
程序报错的时候不应该直接当掉,如何捕获异常?

@zxcd
Copy link
Collaborator

zxcd commented Jan 16, 2024

语音过长的话是否考虑使用vad进行语音切分?目前大部分模型都有音频的时长限制,基本上太长了就会OOM。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants