iSTFT's generator and Avocodo's discriminators. Use faster
branch for ultra fast training and inference compared to original hifi-gan
.
python train.py --config config_v1.json
- Two of iSTFT Generator version can be used with Avocodo discriminaators.
- Losses should similar to HiFi-GAN.
faster
branch of this repo is 50 % speed up (0.33 s/b) in training and 60 % improvement in inference speed.- Quality is not as good as Avocodo and ISTFTNet.
- https://github.com/rishikksh20/iSTFTNet-pytorch
- https://github.com/rishikksh20/Avocodo-pytorch
- https://github.com/jik876/hifi-gan
@inproceedings{kaneko2022istftnet,
title={{iSTFTNet}: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform},
author={Takuhiro Kaneko and Kou Tanaka and Hirokazu Kameoka and Shogo Seki},
booktitle={ICASSP},
year={2022},
}
@misc{https://doi.org/10.48550/arxiv.2206.13404,
doi = {10.48550/ARXIV.2206.13404},
url = {https://arxiv.org/abs/2206.13404},
author = {Bak, Taejun and Lee, Junmo and Bae, Hanbin and Yang, Jinhyeok and Bae, Jae-Sung and Joo, Young-Sun},
keywords = {Audio and Speech Processing (eess.AS), Artificial Intelligence (cs.AI), Sound (cs.SD), FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Computer and information sciences, FOS: Computer and information sciences},
title = {Avocodo: Generative Adversarial Network for Artifact-free Vocoder},
publisher = {arXiv},
year = {2022},
copyright = {arXiv.org perpetual, non-exclusive license}
}