Modify the calculation logic of LambOptimizer #29313

gfwm2013 · 2020-12-03T02:11:45Z

PR types

Performance optimization

PR changes

OPs

Describe

Modify the calculation logic of Lamb in order to make it correspond to the paper.

The original calculation logic in article is shown in Figure above.

But now the lamb calculation logic in paddle is shown in Figure. Compared with the original, it lacks the calculation of the red box in the first figure.

So, the main function of PR is to complete the calculation of LambOptimizer.

Document preview link: http://10.136.157.23:8090/documentation/docs/zh/api/index_cn.html?reviewVersion=jenkins-doc-review-167
Lamb: http://10.136.157.23:8090/documentation/docs/en/api/paddle/optimizer/Lamb_en.html
LambOptimizer: http://10.136.157.23:8090/documentation/docs/en/api/paddle/fluid/optimizer/LambOptimizer_en.html

test=develop

paddle-bot-old · 2020-12-18T03:27:04Z

Sorry to inform you that 6464340's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

… lamb

…into lamb_temp

zhhsplendid · 2021-01-14T10:00:24Z

paddle/fluid/operators/optimizers/lamb_op.h

    T mom1 = moment1_[i];
    T mom2 = moment2_[i];
    T p = param_[i];
+    T beta1_pow = *beta1_pow_;


Why do you need to initialize T beta1_pow and T beta2_pow here? Can you use pointer directly?

zhhsplendid · 2021-01-14T10:07:54Z

paddle/fluid/operators/optimizers/lamb_op.h

    moment2_out_[i] = mom2;
-    trust_ratio_div_[i] = mom1 / (sqrt(mom2) + epsilon_) + weight_decay_ * p;
+
+    mom1_unbiased = mom1 / (1 - beta1_pow);


Write T mom1_unbiased = mom1 / (1 - beta1_pow)' then you can do less constructing for T, same formom2_unbiased`

Thanks, done.

zhhsplendid · 2021-01-14T10:14:02Z

python/paddle/fluid/dygraph/amp/loss_scaler.py

            self._cache_founf_inf = True
        else:
+            if optimizer.__class__.__name__ in ["Lamb", "LambOptimizer"]:
+                from paddle.fluid.clip import ClipGradByGlobalNorm


Can we import at the file header ?

Thanks, done.

zhhsplendid · 2021-01-14T10:15:09Z

python/paddle/fluid/tests/unittests/test_imperative_auto_mixed_precision.py

+        def run_simple_conv(inp_np, use_scaler=True):
+            paddle.seed(10)
+            paddle.framework.random._manual_program_seed(10)
+            with fluid.dygraph.guard():


On paddle2.0, should we use paddle.disable_static(), paddle.enable_static() instead of dygraph.guard ?

Thanks, this unittest had be deleted.

zhhsplendid · 2021-01-14T10:15:39Z

python/paddle/fluid/tests/unittests/test_imperative_auto_mixed_precision.py

+                optimizer = paddle.optimizer.Lamb(
+                    learning_rate=0.01, parameters=model.parameters())
+                scaler = fluid.dygraph.AmpScaler(init_loss_scaling=1024)
+                data = fluid.dygraph.to_variable(inp_np)


Same to those fluid APIs, should we use Paddle 2.0 API?

Thanks, this unittest had be deleted.

zhhsplendid · 2021-01-14T10:16:24Z

python/paddle/fluid/tests/unittests/test_lamb_op.py



-class TestSparseLambOp(unittest.TestCase):
+class gTestSparseLambOp(unittest.TestCase):


Please remove meaningless g in this class name

Thanks, done.

wzzju · 2021-01-15T13:43:18Z

paddle/fluid/operators/optimizers/lamb_op.h

+        framework::TensorCopy(beta1_pow, platform::CUDAPlace(), &beta1_pow_gpu);
+        framework::TensorCopy(beta2_pow, platform::CUDAPlace(), &beta2_pow_gpu);


Either beta1_pow or beta2_pow has only one value. You don't need to do a copy here. Please refer to Adam.

Thanks, done.

wzzju · 2021-01-15T13:47:06Z

paddle/fluid/operators/optimizers/lamb_op.h

+        framework::TensorCopy(beta1_pow, platform::CUDAPlace(), &beta1_pow_gpu);
+        framework::TensorCopy(beta2_pow, platform::CUDAPlace(), &beta2_pow_gpu);


Either beta1_pow or beta2_pow has only one value. You don't need to do a copy here. Please refer to Adam.

Thanks, done.

zhiqiu

LGTM for op_function_generator.cc

wzzju

LGTM.

TCChenlong

LGTM

* Modify the calculation logic of LambOptimizer

…30510) * Modify the calculation logic of LambOptimizer (#29313) * Modify the calculation logic of LambOptimizer * Modify the calculation logic of LambOptimizer * Modify the calculation logic of LambOptimizer

gfwm2013 added 4 commits December 2, 2020 15:20

Modify the calculation logic of LambOptimizer

36e72f2

test=develop

Modify the calculation logic of LambOptimizer

22d0144

test=develop

Modify the calculation logic of LambOptimizer

76e97a0

test=develop

Modify the calculation logic of LambOptimizer

6464340

test=develop

gfwm2013 added 10 commits January 4, 2021 08:57

Fix the error of Optimizer

27c1399

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

819b660

… lamb

Modify the calculation logic of LambOptimizer

d3b3ad4

Modify the calculation logic of LambOptimizer

c25e766

Resolve the conflicts

a52cf90

Merge branch 'develop' of https://www.github.com/PaddlePaddle/Paddle …

d85138a

…into lamb_temp

Modify the calculation logic of LambOptimizer

a5710e3

Modify the calculation logic of LambOptimizer

092e0d1

Solve the error of circular reference

1a0e001

Fix the typo

bd0167f

zhhsplendid reviewed Jan 14, 2021

View reviewed changes

gfwm2013 added 4 commits January 14, 2021 14:14

Modify the calculation logic of LambOptimizer

743c135

Modify the calculation logic of LambOptimizer

580c35e

Modify the calculation logic of LambOptimizer

44f2272

Fix the error of CI:PR-CI-CPU-Py2

58d9edf

wzzju reviewed Jan 15, 2021

View reviewed changes

zhiqiu previously approved these changes Jan 15, 2021

View reviewed changes

Modify the calculation logic of LambOptimizer

19b3fbe

gfwm2013 dismissed zhiqiu’s stale review via 19b3fbe January 16, 2021 07:25

Fix the errors of document

b611a7a

gfwm2013 requested a review from zhiqiu January 16, 2021 11:27

zhiqiu approved these changes Jan 16, 2021

View reviewed changes

wzzju approved these changes Jan 16, 2021

View reviewed changes

TCChenlong approved these changes Jan 17, 2021

View reviewed changes

lanxianghit approved these changes Jan 17, 2021

View reviewed changes

gfwm2013 merged commit 11e78eb into PaddlePaddle:develop Jan 17, 2021

gfwm2013 added a commit to gfwm2013/Paddle that referenced this pull request Jan 17, 2021

Modify the calculation logic of LambOptimizer (PaddlePaddle#29313)

a6f6fa1

* Modify the calculation logic of LambOptimizer

gfwm2013 mentioned this pull request Jan 17, 2021

[cherry-pick]Modify the calculation logic of LambOptimizer (#29313) #30510

Merged



		class TestSparseLambOp(unittest.TestCase):
		class gTestSparseLambOp(unittest.TestCase):

		framework::TensorCopy(beta1_pow, platform::CUDAPlace(), &beta1_pow_gpu);
		framework::TensorCopy(beta2_pow, platform::CUDAPlace(), &beta2_pow_gpu);

Uh oh!

Modify the calculation logic of LambOptimizer #29313

Modify the calculation logic of LambOptimizer #29313

Uh oh!

Conversation

gfwm2013 commented Dec 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Describe

Uh oh!

paddle-bot-old bot commented Dec 18, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhiqiu left a comment

Choose a reason for hiding this comment

Uh oh!

wzzju left a comment

Choose a reason for hiding this comment

Uh oh!

TCChenlong left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

gfwm2013 commented Dec 3, 2020 •

edited

Loading