add waveglow #92

upvenly · 2023-06-01T09:03:34Z

No description provided.

training/nvidia/WaveGlow-pytorch/README.md

shh2000 · 2023-06-02T01:47:07Z

除了tacotron2_common之外，还有哪些文件是copy的未经修改，请comment

training/benchmarks/WaveGlow/pytorch/config/mutable_params.py

training/benchmarks/WaveGlow/pytorch/dataloaders/dataloader.py

training/benchmarks/WaveGlow/pytorch/loss/loss_function.py

training/benchmarks/WaveGlow/pytorch/model/model.py

training/benchmarks/WaveGlow/pytorch/run_pretraining.py

training/benchmarks/WaveGlow/pytorch/train/trainer.py

training/benchmarks/faster_rcnn/pytorch/train/trainer.py

training/nvidia/WaveGlow-pytorch/README.md

shh2000 · 2023-06-02T03:23:46Z

training/benchmarks/WaveGlow/pytorch/model/models.py

+              jittable=False):
+    """ Code chooses a model based on name"""
+    model = None
+    if model_name == 'Tacotron2':


rm redundent

training/benchmarks/WaveGlow/pytorch/model/models.py

training/benchmarks/WaveGlow/pytorch/utils/utils.py

training/benchmarks/WaveGlow/pytorch/train/evaluator.py

training/benchmarks/WaveGlow/pytorch/train/trainer.py

training/benchmarks/WaveGlow/pytorch/train/trainer_adapter.py

training/benchmarks/WaveGlow/pytorch/utils/utils.py

upvenly · 2023-06-05T02:46:44Z

以下文件参考源代码，不需要review：
training/benchmarks/WaveGlow/pytorch/model/models.py
training/benchmarks/WaveGlow/pytorch/model/model.py
training/benchmarks/WaveGlow/pytorch/model/model_parser.py
training/benchmarks/WaveGlow/pytorch/dataloaders/data_function.py
training/benchmarks/WaveGlow/pytorch/filelists
training/benchmarks/WaveGlow/pytorch/loss/loss_function.py
training/benchmarks/WaveGlow/pytorch/tacotron2_common
training/benchmarks/WaveGlow/pytorch/utils

yuzhou03 · 2023-06-05T07:26:29Z

training/benchmarks/WaveGlow/pytorch/README.md

+### 模型信息
+- Introduction
+
+The WaveGlow model is a flow-based generative model that generates audio samples from Gaussian distribution using mel-spectrogram conditioning (Figure 2). During training, the model learns to transform the dataset distribution into spherical Gaussian distribution through a series of flows. One step of a flow consists of an invertible convolution, followed by a modified WaveNet architecture that serves as an affine coupling layer. During inference, the network is inverted and audio samples are generated from the Gaussian distribution. Our implementation uses 512 residual channels in the coupling layer.


注意代词, we/us/our

yuzhou03 · 2023-06-05T07:40:53Z

training/benchmarks/faster_rcnn/pytorch/train/trainer.py

无需修改faster_rcnn的代码

yuzhou03 · 2023-06-05T07:42:45Z

training/nvidia/WaveGlow-pytorch/README.md

+### 运行情况
+| 训练资源 | 配置文件        | 运行时长(s) | 目标val_loss | 收敛val_loss | 性能(samples/s) |
+| -------- | --------------- | ----------- | ------------ | ------------ | --------------- |
+| 单机8卡  | config_A100x1x8 |     |      |        |           |


补上运行数据

training/benchmarks/WaveGlow/pytorch/train/trainer_adapter.py

training/benchmarks/driver/helper.py

upvenly added 10 commits May 16, 2023 17:43

add wav2vec2

7bc933c

add wav2vec2

53c0314

support-different-image

91e0825

update waveglow

4916362

update waveglow

0089955

update waveglow

887cfbb

update waveglow

dfbcb23

update waveglow

a680dc8

update waveglow

265ebac

update waveglow

44e35d1

upvenly requested review from yuzhou03, shh2000 and Ox7c000000 June 1, 2023 09:03

upvenly added 5 commits June 1, 2023 17:46

update

885380c

update

f1f7898

update

9e8dc0c

update

f00658d

update

d53ace4

yuzhou03 reviewed Jun 1, 2023

View reviewed changes

training/nvidia/WaveGlow-pytorch/README.md Show resolved Hide resolved

update

e80b2ce