Support float16 when using ClipGradByGlobalNorm. #33565

huangxu96 · 2021-06-15T06:43:49Z

PR types

Bug fixes

PR changes

APIs

Describe

This PR supports gradient clip (ClipGradByGlobalNorm) when training with AMP(auto mixed precision).

Grad_clip 操作对于混合精度的tensor会出现错误，原因是因为grad_clip里，
有sum和reduce_sum等操作不支持fp16数据类型，或者不支持sum一个混合精度的tensor。
该PR支持在grad_clip（ClipGradByGlobalNorm）里分别sum fp16和fp32的tensor作为global scale。

paddle-bot-old · 2021-06-15T06:43:52Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-bot-old · 2021-07-01T02:34:40Z

Sorry to inform you that 0e38f7f's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

… grad_clip_float16

…16 and float32.

zhhsplendid

LGTM

zhiqiu

LGTM

This PR supports gradient clip (ClipGradByGlobalNorm) when training with AMP(auto mixed precision).

Support float16 when using ClipGradByGlobalNorm.

cccb54a

huangxu96 added 2 commits June 16, 2021 08:35

fix the bug in test_fleet_sharding_meta_optimizer

319d781

fix bug in test_pipeline, enlarging the delta from 1e0 to 1e1.

fa32555

test for CI, revert the delta in test_pipeline from 1e1 to 1e0

0e38f7f

huangxu96 force-pushed the grad_clip_float16 branch from 424fd37 to 0e38f7f Compare June 23, 2021 08:31

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f2c9382

… grad_clip_float16

huangxu96 force-pushed the grad_clip_float16 branch 2 times, most recently from eed7fa7 to e7c8231 Compare September 2, 2021 08:16

support fp16 for squared_l2_norm_op

afdbc38

huangxu96 force-pushed the grad_clip_float16 branch from e7c8231 to afdbc38 Compare September 2, 2021 08:17

fix conflict .

30f8662

zhhsplendid mentioned this pull request Sep 3, 2021

[Mix Precision] Support Float16 When Using ClipGradByGlobalNorm #35439

Closed

fix a bug when input type of _squared_l2_norm is float16

59dca80

fix a potential bug, when input type of _static_clip is besides float…

8797a83

…16 and float32.

huangxu96 force-pushed the grad_clip_float16 branch from 9493650 to 8797a83 Compare September 3, 2021 13:08

fix bug in test_fleet_sharding_meta_optimizer.py

eb59144

huangxu96 force-pushed the grad_clip_float16 branch from ad61788 to eb59144 Compare September 5, 2021 05:16

huangxu96 added 3 commits September 6, 2021 03:29

for code coverage rate

71fc18c

support fp64 grad_clip

a9640cc

optimize code style

75a6bc4

zhhsplendid approved these changes Sep 10, 2021

View reviewed changes

zhiqiu self-requested a review September 10, 2021 06:51

zhiqiu approved these changes Sep 10, 2021

View reviewed changes

zhhsplendid merged commit 5bdca05 into PaddlePaddle:develop Sep 10, 2021

wangxicoding mentioned this pull request Sep 13, 2021

[hybrid] fix GradientClipByGlobalNorm in hybrid parallel #35691

Merged

AnnaTrainingG pushed a commit to AnnaTrainingG/Paddle that referenced this pull request Sep 29, 2021

Support float16 when using ClipGradByGlobalNorm. (PaddlePaddle#33565)

0807a5c

This PR supports gradient clip (ClipGradByGlobalNorm) when training with AMP(auto mixed precision).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support float16 when using ClipGradByGlobalNorm. #33565

Support float16 when using ClipGradByGlobalNorm. #33565

Uh oh!

huangxu96 commented Jun 15, 2021 •

edited

Loading

Uh oh!

paddle-bot-old bot commented Jun 15, 2021

Uh oh!

paddle-bot-old bot commented Jul 1, 2021

Uh oh!

zhhsplendid left a comment

Uh oh!

zhiqiu left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support float16 when using ClipGradByGlobalNorm. #33565

Support float16 when using ClipGradByGlobalNorm. #33565

Uh oh!

Conversation

huangxu96 commented Jun 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Describe

Uh oh!

paddle-bot-old bot commented Jun 15, 2021

Uh oh!

paddle-bot-old bot commented Jul 1, 2021

Uh oh!

zhhsplendid left a comment

Choose a reason for hiding this comment

Uh oh!

zhiqiu left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

huangxu96 commented Jun 15, 2021 •

edited

Loading