Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add waveglow #92

Merged
merged 28 commits into from
Jun 6, 2023
Merged

add waveglow #92

merged 28 commits into from
Jun 6, 2023

Conversation

upvenly
Copy link
Collaborator

@upvenly upvenly commented Jun 1, 2023

No description provided.

@shh2000
Copy link
Collaborator

shh2000 commented Jun 2, 2023

除了tacotron2_common之外,还有哪些文件是copy的未经修改,请comment

jittable=False):
""" Code chooses a model based on name"""
model = None
if model_name == 'Tacotron2':
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

rm redundent

@upvenly
Copy link
Collaborator Author

upvenly commented Jun 5, 2023

以下文件参考源代码,不需要review:
training/benchmarks/WaveGlow/pytorch/model/models.py
training/benchmarks/WaveGlow/pytorch/model/model.py
training/benchmarks/WaveGlow/pytorch/model/model_parser.py
training/benchmarks/WaveGlow/pytorch/dataloaders/data_function.py
training/benchmarks/WaveGlow/pytorch/filelists
training/benchmarks/WaveGlow/pytorch/loss/loss_function.py
training/benchmarks/WaveGlow/pytorch/tacotron2_common
training/benchmarks/WaveGlow/pytorch/utils

### 模型信息
- Introduction

The WaveGlow model is a flow-based generative model that generates audio samples from Gaussian distribution using mel-spectrogram conditioning (Figure 2). During training, the model learns to transform the dataset distribution into spherical Gaussian distribution through a series of flows. One step of a flow consists of an invertible convolution, followed by a modified WaveNet architecture that serves as an affine coupling layer. During inference, the network is inverted and audio samples are generated from the Gaussian distribution. Our implementation uses 512 residual channels in the coupling layer.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

注意代词, we/us/our

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

无需修改faster_rcnn的代码

### 运行情况
| 训练资源 | 配置文件 | 运行时长(s) | 目标val_loss | 收敛val_loss | 性能(samples/s) |
| -------- | --------------- | ----------- | ------------ | ------------ | --------------- |
| 单机8卡 | config_A100x1x8 | | | | |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

补上运行数据

@upvenly upvenly merged commit ca824b1 into FlagOpen:main Jun 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants