fix adamw apply gradient #30130

wangxicoding · 2021-01-05T15:20:05Z

PR types

Bug fixes

PR changes

APIs

Describe

Fix Adamw weight_decay not applied when open AMP. see 静态图amp与paddle.optimizer.AdamW不兼容 #29794
Fix Adamw weight_decay LR is fixed when using LRSchedule.
Fix Adam step() not add imperative_base.no_grad, which gradients may be generated in the optimizer.
Optimization calculation, the calculation formula of param = param - param * lr * coeff is optimized as follows
param = param * (1.0 - lr * coeff)

paddle-bot-old · 2021-01-05T15:20:13Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

guoshengCS · 2021-01-06T10:31:59Z

python/paddle/optimizer/adamw.py

+            assert param.dtype == paddle.fluid.core.VarDesc.VarType.FP32, \
+                "the type of coeff(float) and parameter(%s) is not consistent."%(param.dtype)
+        else:
+            assert self._coeff.dtype == param.dtype, \


感觉用户使用double有点麻烦，能否float时就不要求param.dtype呢，下面的decay_coeff = 1.0 - self._coeff * learning_rate一定会受影响而不是用learning_rate的dtype是吗

是指把这整个判断逻辑都干掉吗

Done.
1.0 - self._coeff * learning_rate，返回的tensor类型是self._coeff的类型，如果self._coeff 和 learning_rate类型不一致，会自动加cast转换类型。
把判断逻辑去掉则支持任何类型，改成1.0 - learning_rate * self._coeff使用learning_rate的类型。

guoshengCS

LGTM

swtkiwi

LGTM

swtkiwi

LGTM

fix adamw apply gradient, test=develop

07e3348

fix weight decay lr

7b8a46f

wangxicoding force-pushed the fix_adamw_apply_gradient branch from e0f6567 to 7b8a46f Compare January 6, 2021 05:37

wangxicoding requested a review from gongweibao January 6, 2021 09:09

guoshengCS reviewed Jan 6, 2021

View reviewed changes

support any dtype

c612317

guoshengCS previously approved these changes Jan 6, 2021

View reviewed changes

wangxicoding requested a review from swtkiwi January 7, 2021 03:56

swtkiwi previously approved these changes Jan 7, 2021

View reviewed changes

fix adam dygraph no_grad

4c0c9d6

wangxicoding dismissed stale reviews from swtkiwi and guoshengCS via 4c0c9d6 January 7, 2021 07:28

wangxicoding requested a review from guoshengCS January 7, 2021 07:35

guoshengCS approved these changes Jan 7, 2021

View reviewed changes

swtkiwi approved these changes Jan 7, 2021

View reviewed changes

wangxicoding merged commit 619c62b into PaddlePaddle:develop Jan 7, 2021

wangxicoding added a commit to wangxicoding/Paddle that referenced this pull request Jan 7, 2021

fix adamw apply gradient (PaddlePaddle#30130)

c015b9b

wangxicoding mentioned this pull request Jan 7, 2021

[cherry-pick 2.0] fix adamw apply gradient #30207

Merged

fuyinno4 pushed a commit that referenced this pull request Jan 10, 2021

fix adamw apply gradient (#30130) (#30207)

c4cd99f

wangxicoding mentioned this pull request Jan 20, 2021

fix adamw lr_to_coeff is fixed when dygraph #30526

Merged

wangxicoding deleted the fix_adamw_apply_gradient branch January 21, 2021 10:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix adamw apply gradient #30130

fix adamw apply gradient #30130

Uh oh!

wangxicoding commented Jan 5, 2021 •

edited

Loading

Uh oh!

paddle-bot-old bot commented Jan 5, 2021

Uh oh!

guoshengCS Jan 6, 2021 •

edited

Loading

Uh oh!

wangxicoding Jan 6, 2021

Uh oh!

wangxicoding Jan 6, 2021

Uh oh!

guoshengCS left a comment

Uh oh!

swtkiwi left a comment

Uh oh!

swtkiwi left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

fix adamw apply gradient #30130

fix adamw apply gradient #30130

Uh oh!

Conversation

wangxicoding commented Jan 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Describe

Uh oh!

paddle-bot-old bot commented Jan 5, 2021

Uh oh!

guoshengCS Jan 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wangxicoding Jan 6, 2021

Choose a reason for hiding this comment

Uh oh!

wangxicoding Jan 6, 2021

Choose a reason for hiding this comment

Uh oh!

guoshengCS left a comment

Choose a reason for hiding this comment

Uh oh!

swtkiwi left a comment

Choose a reason for hiding this comment

Uh oh!

swtkiwi left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wangxicoding commented Jan 5, 2021 •

edited

Loading

guoshengCS Jan 6, 2021 •

edited

Loading