New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

无法生成和参考音色相近的结果 #8

Open

woodenwatcher opened this issue Oct 14, 2024 · 1 comment

woodenwatcher commented Oct 14, 2024 •

edited

Loading

使用huggingface上的应用，上传了一个女声音频作为参考音色，但是生成的结果却是男声。使用应用里的默认参考音色的音频，生成的结果听起来正常。是操作的姿势不对？

上传的女生参考音色的音频文件下载自该网址：http://www.pthxx.com/e_download/01_langdu/26.html
输入的文本是：“在深山中送走了好友，夕阳西坠把柴门关闭。待到明年春草又绿的时候，朋友啊你能不能回还？”

请大佬帮忙看一下，感谢！！！

Owner

HKoon commented Nov 7, 2024

有两个位置可能会影响合成效果：
1.参考音频的质量和长度，如果包含内容太少可能音色提取那一步会拿不到合适的特征；
2.ChatTTS有一个随机音色Seed，会导致合成的音色不稳定，可以尽量选固定的种子生成，如果效果不好也可以换一下试试（这个部分我有时间就优化一下）；

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment