GitHub - eirene-aisa/glow-tts-practice: A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Glow-TTS Official Repository

Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Multispeaker enabled Glow-tts

Glow-tts with korean cleaner, enabled multispeaker training (reffering to some of issues).

This repo recommended to be used as a reference for multispeaker training.

_custom : executed with korean cleaners.

_custom_multi : executed with korean cleaners, for multispeaker training.

Single korean speaker demo with KSS is available. link

Korean cleaner

from Pitchtron https://github.com/hash2430/pitchtron

Solved issues

Due to apex(commit: 37cdaf4) dependency, I used pytorch 1.3.0 (instead of 1.2.0)
For multispeaker setting
- filelist should be in followed format.
  
  audio_path(*.wav)|speaker_id|transcript related issue
- Add n_speakers, gin_channels to config is recommended. related issue
- (TextMelLoader, TextMelCollate) should be replaced with (TextMelSpeakerLoader, TextMelSpeakerCollate) in init.py, train.py
  
  Also, edit (x, x_lengths, y, y_lengths) to (x, x_lengths, y, y_lengths, g).
- Usage of speaker information(g) should be delievered explicitly to FlowGenerator. related issue
  
  generator(x=x, x_lengths=x_lengths, y=y, y_lengths=y_lengths, g=g, gen=False) (I do not know why)
'Gradient overflow' might be caused due to data problem. related issue

1. Environments (edited)

Python==3.6.9
pytorch==1.3.0
cython==0.29.12
librosa==0.7.1
numpy==1.16.4
scipy==1.5.4
nltk==3.6.5

2. Pre-requisites

Please check official repository.

3. Training Example

sh train_custom_multi_ddi.sh configs/base.json base

4. Inference Example

See inference.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
configs		configs
data		data
filelists		filelists
hifi-gan @ 28ecbea		hifi-gan @ 28ecbea
korean_text		korean_text
monotonic_align		monotonic_align
resources		resources
text		text
waveglow @ 82e811f		waveglow @ 82e811f
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
attentions.py		attentions.py
audio_processing.py		audio_processing.py
commons.py		commons.py
data_utils.py		data_utils.py
data_utils_custom.py		data_utils_custom.py
inference.ipynb		inference.ipynb
inference_hifigan.ipynb		inference_hifigan.ipynb
init.py		init.py
init_custom.py		init_custom.py
init_custom_multi.py		init_custom_multi.py
models.py		models.py
modules.py		modules.py
stft.py		stft.py
train.py		train.py
train_custom.py		train_custom.py
train_custom_multi.py		train_custom_multi.py
train_custom_multi_ddi.sh		train_custom_multi_ddi.sh
train_ddi.sh		train_ddi.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Glow-TTS Official Repository

Multispeaker enabled Glow-tts

Korean cleaner

Solved issues

1. Environments (edited)

2. Pre-requisites

3. Training Example

4. Inference Example

About

Releases

Packages

Languages

License

eirene-aisa/glow-tts-practice

Folders and files

Latest commit

History

Repository files navigation

Glow-TTS Official Repository

Multispeaker enabled Glow-tts

Korean cleaner

Solved issues

1. Environments (edited)

2. Pre-requisites

3. Training Example

4. Inference Example

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages