Fixed previous weight handling for DCASGD optimizer. #5140

sergeykolychev · 2017-02-24T23:36:33Z

While porting latest Python changes to the Perl interface I stumbled upon something that looks like a small logic error.

piiswrong · 2017-02-25T00:10:02Z

Without a copy weight previous will always be the same with weight. It should have been a state

sergeykolychev · 2017-02-25T01:02:42Z

@piiswrong
The logic still does not compute (I may be missing something)
weight - previous_weight is always 0
Can you devise what is should be from https://arxiv.org/pdf/1609.08326.pdf page 5
They seem to keep weight_bak for several iterations and only update it when 'pull request' arrives.
I do not see anything async about current code, do you have time to check this ?

back to before updating the weight. original bug was in momentum == 0.0 block it seems.

sergeykolychev · 2017-02-25T02:37:37Z

I do not know if it actually implements the paper (I do not know about inner workings of mxnet that deep yet), but the logic now computes in my mind.

piiswrong · 2017-02-25T19:56:17Z

python/mxnet/optimizer.py

-            else:
-                weight[:] += -lr * (grad + wd * weight)
-                self.weight_previous[index] = weight
+        mom, previous_weight = state


This wastes memory and compute. better use different states and test for momentum == 0

sergeykolychev · 2017-03-01T20:53:33Z

@piiswrong
Eric, could you please review the latest change and see if it makes sense to you.

fixed previous weight handling for DCASGD optimizer.

f2335ac

converted previous weight to state variable.

477bcef

simplified logic and moved weight_bak initialization

259d2e7

back to before updating the weight. original bug was in momentum == 0.0 block it seems.

piiswrong reviewed Feb 25, 2017

View reviewed changes

sergeykolychev added 5 commits February 25, 2017 12:40

Merge remote-tracking branch 'dev/master' into dcasgd_fix

bc6ca00

addressed code review, removed wasteful placeholder variable.

54abfb1

Merge branch 'master' into dcasgd_fix

92bd744

Merge branch 'master' into dcasgd_fix

2b1f0c0

Merge branch 'master' into dcasgd_fix

cf20d86

Merge branch 'master' into dcasgd_fix

6abac9e

piiswrong merged commit a23608f into apache:master Mar 5, 2017

Ldpe2G mentioned this pull request Mar 14, 2017

[Scala] add optimizer DCASGD #5386

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed previous weight handling for DCASGD optimizer. #5140

Fixed previous weight handling for DCASGD optimizer. #5140

sergeykolychev commented Feb 24, 2017 •

edited

Loading

piiswrong commented Feb 25, 2017

sergeykolychev commented Feb 25, 2017 •

edited

Loading

sergeykolychev commented Feb 25, 2017

piiswrong Feb 25, 2017

sergeykolychev commented Mar 1, 2017

Fixed previous weight handling for DCASGD optimizer. #5140

Fixed previous weight handling for DCASGD optimizer. #5140

Conversation

sergeykolychev commented Feb 24, 2017 • edited Loading

piiswrong commented Feb 25, 2017

sergeykolychev commented Feb 25, 2017 • edited Loading

sergeykolychev commented Feb 25, 2017

piiswrong Feb 25, 2017

Choose a reason for hiding this comment

sergeykolychev commented Mar 1, 2017

sergeykolychev commented Feb 24, 2017 •

edited

Loading

sergeykolychev commented Feb 25, 2017 •

edited

Loading