GENet的精度复现问题 #3

pawopawo · 2020-08-09T13:42:04Z

看到您的GENet，很感兴趣，想复现一下论文的结果，但是发现论文的训练细节不是特别清楚。我用batch size 1024，lr 0.5，weight decay 1e-4，epochs 360， 5个epochs的 warmup，cosine 学习率衰减，无dropout， GENet-normal结构的精度只训练到了76.1。

想咨询一下GENet-normal结构的训练策略是怎么样的，比如 lr，batch size，weight decay ，dropout rate，epochs，学习率的衰减策略，以及是否用了warm up。盼望得到您的帮助～

MingLin-home · 2020-08-12T17:51:56Z

We will update our draft this week to include more detailed training parameters. We use cosine lr decay, warm-up 5 epochs, wd is 4e-5, lr=0.1, batch size 256.

pawopawo · 2020-09-03T10:48:18Z

请问蒸馏对论文的结果带来了多大的提升？

MingLin-home · 2020-09-05T23:23:45Z

The main purpose of teacher network is to help the student network escape the bad local minima. There is about 1% accuracy drop without the help of teacher network in the early training stages.

pawopawo · 2020-09-06T03:22:25Z

The main purpose of teacher network is to help the student network escape the bad local minima. There is about 1% accuracy drop without the help of teacher network in the early training stages.

所以蒸馏对最终精度没影响？只是收敛的更快了？

MingLin-home · 2020-09-11T03:06:46Z

The main purpose of teacher network is to help the student network escape the bad local minima. There is about 1% accuracy drop without the help of teacher network in the early training stages.

所以蒸馏对最终精度没影响？只是收敛的更快了？

Without teacher network, the training will quickly get stuck around 60 epochs. With teacher network, the accuracy will keep increasing as you train longer. It seems that what teacher network you use is not important, which is wired to us too.

merge from idstcv to minglin-home

Merge pull request #3 from idstcv/master

MingLin-home closed this as completed Aug 12, 2020

MingLin-home added a commit that referenced this issue Oct 7, 2020

Merge pull request #3 from idstcv/master

b07edf2

merge from idstcv to minglin-home

MingLin-home added a commit that referenced this issue Oct 7, 2020

Merge pull request #10 from MingLin-home/master

f3d18b1

Merge pull request #3 from idstcv/master

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GENet的精度复现问题 #3

GENet的精度复现问题 #3

pawopawo commented Aug 9, 2020

MingLin-home commented Aug 12, 2020

pawopawo commented Sep 3, 2020

MingLin-home commented Sep 5, 2020

pawopawo commented Sep 6, 2020

MingLin-home commented Sep 11, 2020

GENet的精度复现问题 #3

GENet的精度复现问题 #3

Comments

pawopawo commented Aug 9, 2020

MingLin-home commented Aug 12, 2020

pawopawo commented Sep 3, 2020

MingLin-home commented Sep 5, 2020

pawopawo commented Sep 6, 2020

MingLin-home commented Sep 11, 2020