Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Good pre-trained weights anyone? #19

Closed
duvtedudug opened this issue Feb 17, 2018 · 15 comments
Closed

Good pre-trained weights anyone? #19

duvtedudug opened this issue Feb 17, 2018 · 15 comments

Comments

@duvtedudug
Copy link

First, thank you very much @r9y9 and everyone for the great work!

Does anyone want to share pre-trained weights that sound good?

Particularly for LJSpeech if possible. My training is to be converging to a very high loss value. I would love to experiment with some sounds, and maybe figure out where I am going wrong in training.

Thanks in advance,
Duvte.

@azraelkuan
Copy link
Contributor

@duvtedudug i have trained ljspeech for 340k, this is the link ljspeech_340k_pth

@r9y9
Copy link
Owner

r9y9 commented Feb 18, 2018

https://www.dropbox.com/s/8qgcbd1mm2xsqgq/20180127_mixture_lj_checkpoint_step000410000_ema.pth?dl=0

Weights used to generate speech for https://r9y9.github.io/wavenet_vocoder/

@duvtedudug
Copy link
Author

@azraelkuan @r9y9
Thank you both very much!

@pbaljeka
Copy link

@r9y9 Thanks for the checkpoint! Would it be possible to share the multispeaker checkpoint as well?
Thanks!

@r9y9
Copy link
Owner

r9y9 commented Feb 20, 2018

@mfkfge
Copy link

mfkfge commented Mar 5, 2018

@r9y9 Could you show us the parameters/configurations for this checkpoint(20180212_mixture_multispeaker_cmu_arctic_checkpoint_step000740000_ema.pth). I tried to generate voices with this one, while the results were not as good as what you published.

My command line is as following:
python synthesis.py checkpoint.pth --hparams="input_type=raw,gin_channels=16" --speaker-id=5

One of my results is as following:

generated.zip

@r9y9
Copy link
Owner

r9y9 commented Mar 5, 2018

@mfkfge Sounds like there's mismatch between mel-spectrogram and speaker ID. Did you use mel-spectrogram of speaker ID 5?

@mfkfge
Copy link

mfkfge commented Mar 5, 2018

@r9y9 yes. i did try with mel-spectrogram of speaker id 5 as well as that of speaker id 6.

@r9y9
Copy link
Owner

r9y9 commented Mar 5, 2018

Oh, I see the problem. Can you try with --hparams="input_type=raw,gin_channels=16.sample_rate=16000"? sample_rate is 16kHz for CMU ARCTIC.

@mfkfge
Copy link

mfkfge commented Mar 6, 2018

@r9y9 Thanks! It turns good with "sample_rate=16000".

@skyw
Copy link

skyw commented Aug 24, 2018

I'm also having trouble generating good sound, even with sample_rate=16000.

My cmd line is:
python synthesis.py 20180212_mixture_multispeaker_cmu_arctic_checkpoint_step000740000_ema.pth ./ --conditional=cmu_arctic-mel-00001.npy --preset=20180212_multispeaker_cmu_arctic_mixture.json --symmetric-mels --speaker-id 5 --hparams="sample_rate=16000"

Am I doing the right thing?

@zctang
Copy link

zctang commented May 8, 2019

@mfkfge Sounds like there's mismatch between mel-spectrogram and speaker ID. Did you use mel-spectrogram of speaker ID 5?

@r9y9 could you explain how to match mel-spectrogram with speaker ID ? each speaker ID have its specific mel-spectrogram?

@r9y9
Copy link
Owner

r9y9 commented May 8, 2019

@zctang See train.txt in your preprocessed data directory. It should contain speaker ID in the last column. See also

return (audio_filename, mel_filename, timesteps, text, speaker_id)

@zctang
Copy link

zctang commented May 8, 2019

@zctang See train.txt in your preprocessed data directory. It should contain speaker ID in the last column. See also

return (audio_filename, mel_filename, timesteps, text, speaker_id)

OK, I see. Thank you.

@harirawat
Copy link

@skyw did you pre-process the CMU dataset to generate cmu_arctic-mel-00001.npy even when you are using the pre-trained model?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants