Incorrect Number Pronunciation in Piper VITS with eSpeak-NG #756
Unanswered
Muzaffar-x
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
When training a model on Piper VITS, some eSpeak-NG functions do not work correctly. Specifically, numbers 10 and above are pronounced as separate digits (e.g., "10" is read as "one zero" instead of "ten").
Interestingly, when testing a similar model VITS by Coqui with the same dataset, this issue does not occur.
Expected Behavior:
Numbers should be pronounced correctly rather than as separate digits.
Steps to Reproduce:
Possible Causes:
Question:
How can this issue be resolved? Are there any ways to explicitly configure eSpeak-NG in Piper VITS to ensure correct number pronunciation?
Beta Was this translation helpful? Give feedback.
All reactions