-
Notifications
You must be signed in to change notification settings - Fork 4.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding neural HMM TTS Model #2272
Conversation
I am testing it locally first, then I will mark it ready for review! |
@erogol All tts-test are raising a |
I think it is about your system setup. Something is broken. We don't see it on the CI tests. You should also rebase your code with v0.10.2 to make the tests pass the above. |
@erogol |
That is not PR relevant. It is just a network issue. I'll restart the test. |
@shivammehta25 Thx again for the PR. I merge this. You might also want to add docs for your models for better visibility as in https://github.com/coqui-ai/TTS/tree/dev/docs/source/models PS. This time I can't train a model due to time constraints. |
#2271 Updated to a new model to avoid problems with model zoo and other test cases.
Neural HMM TTS (2021) is a parameter-efficient version and predecessor to OverFlow (28.6M vs 15.3M), it was the probabilistic spectrogram generation model which introduced neural HMMs into a TTS architecture in "Neural HMMs are all you need (for high-quality attention-free TTS)". Because of half the number of parameters the synthesis output quality is suboptimal (but comparable to Tacotron2 without Postnet) but it learns to speak with a lesser amount of data and is significantly faster than other attention-based methods.