A TensorFlow implementation of DeepMind's Tacotron. A deep neural network architectures described in many papers.
Especially for English, Korean.
highly inspired by here
- Python 3.x (preferred)
- Tensorflow 1.x
- matplotlib
- librosa
- numpy
- tqdm
python -m pip install -r requirements.txt
edit config.py
python train.py
DataSet | Samples | Size |
---|---|---|
IJSpeech-1.1 | 13100 | about 30GB is needed |
│
├── assets
│ └── images (readme images)
├── datasets
│ ├── ljspeech.py (LJSpeech 1.1 DataSet)
│ └── ...
├── model
│ └── log data (readme images)
├── config.py (whole configuration)
├── dataloader.py (data loading stuff)
├── model.py (lots of TTS models)
├── modules.py (lots of modules frequently used at model)
├── synthesize.py (inference)
├── train.py (model training)
├── utils.py (useful utils)
└── tfutils.py (useful TF utils)
soon!
HyeongChan Kim / @kozistr