add lookahead optimiser #969

chengchingwen · 2019-12-16T16:18:30Z

This add the implementation of Lookahead optimiser from this paper and based on the pytorch impl

src/optimise/Optimise.jl

DhairyaLGandhi · 2019-12-16T17:09:57Z

src/optimise/lookahead.jl

+  state::IdDict
+end
+
+const MomentumOptim = Union{Momentum, RMSProp, Nesterov, ADAM, RADAM, AdaMax, ADAGrad, ADADelta, AMSGrad, NADAM}


Perhaps an AbstractOptimiser type would help with some of the repitition.

But some optimiser don't have momentum term. Maybe we also need a AbstractMomentumOptimiser?

Thinking if we could use traits here?

https://docs.julialang.org/en/v1/manual/methods/#Trait-based-dispatch-1
if you are talking about this, I guess it's possible.

abstract type OptimiserStyle end struct Momentumlike <: OptimiserStyle end struct NonMomentumlike <: OptimiserStyle end has_momentum(o::Type{Descent}) = NonMomentumlike() has_momentum(o::Type{Momentum}) = Momentumlike() has_momentum(o::Type{Adam}) = Momentumlike()

like this?

Something like that, yeah.

Also, ref https://github.com/oxinabox/oxinabox.github.io/blob/master/notebook_posts/Dispatch%2C%20Traits%20and%20Metaprogramming%20Over%20Reflection.ipynb

chengchingwen · 2019-12-17T17:36:55Z

@dhairyagandhi96 I define a AbstractOptimiserand the GradientStyletraits`.

CarloLucibello · 2020-07-02T16:25:11Z

I would step back to the original problem and just add the Lookahead optimizer

chengchingwen · 2020-07-03T03:05:22Z

@CarloLucibello Sure! should I revert the commit to 510cfaa or 14956c2 ?

CarloLucibello · 2020-07-03T07:03:43Z

yes please. If we want to introduce a type hierarchy for the optimizers let's first discuss it in a separate issue/PR

chengchingwen · 2020-07-03T07:57:30Z

@CarloLucibello Reverted.

CarloLucibello · 2020-07-04T08:03:34Z

In the paper, I read
"""
We have the choice of maintaining, interpolating, or resetting the internal state (e.g. momentum) of
the inner optimizer. We evaluate this tradeoff on the CIFAR dataset (where every choice improves
convergence) in Appendix D.1 and maintain internal state for the other experiments.
....
D.1 Inner Optimizer State
Throughout our paper, we maintain the state of our inner optimizer for simplicity. For SGD with
heavy-ball momentum, this corresponds to preserving the momentum. Here, we present a sensitivity
study by comparing the convergence of Lookahead when maintaining the momentum, interpolating
the momentum, and resetting the momentum. All three improve convergence versus SGD.
"""
So let's keep this simple and just maintain the state.

This also needs a rebase

CarloLucibello · 2020-07-04T08:12:41Z

So let's keep this simple and just maintain the state.

I'm saying this because the specific needs of this PR, we probably need a state setter/getter API for the optimizers, so we should think of it more carefully in a separate issue/PR, then we can revisit LookAhead when we have that in place.
So I think it's better to just merge the minimal LookAhead implementation here, with no state reset

chengchingwen · 2021-11-10T09:38:16Z

@DhairyaLGandhi rebased and updated

CarloLucibello · 2023-06-17T08:20:16Z

sorry we droppebd the ball here. Closing since optimisers are now in Optimisers.jl

bhvieira reviewed Dec 16, 2019

View reviewed changes

src/optimise/Optimise.jl Outdated Show resolved Hide resolved

DhairyaLGandhi reviewed Dec 16, 2019

View reviewed changes

chengchingwen requested a review from DhairyaLGandhi December 17, 2019 17:33

CarloLucibello mentioned this pull request Jul 4, 2020

interface for setting/getting optimizer's internal state #1268

Open

chengchingwen force-pushed the lookahead branch from 891a828 to e71b46b Compare November 10, 2021 06:35

chengchingwen added 2 commits November 10, 2021 15:04

add lookahead optimiser

5c40ea0

use fieldtype to find momentum

328c0be

chengchingwen force-pushed the lookahead branch from e71b46b to 328c0be Compare November 10, 2021 07:04

update with new opt

f4942ef

mcabbott mentioned this pull request Mar 7, 2022

Add Lookahead optimiser FluxML/Optimisers.jl#61

Draft

CarloLucibello closed this Jun 17, 2023

Uh oh!

add lookahead optimiser #969

add lookahead optimiser #969

Uh oh!

Conversation

chengchingwen commented Dec 16, 2019

Uh oh!

Uh oh!

DhairyaLGandhi Dec 16, 2019

Choose a reason for hiding this comment

Uh oh!

chengchingwen Dec 16, 2019

Choose a reason for hiding this comment

Uh oh!

DhairyaLGandhi Dec 16, 2019

Choose a reason for hiding this comment

Uh oh!

chengchingwen Dec 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chengchingwen Dec 16, 2019

Choose a reason for hiding this comment

Uh oh!

DhairyaLGandhi Dec 16, 2019

Choose a reason for hiding this comment

Uh oh!

DhairyaLGandhi Dec 16, 2019

Choose a reason for hiding this comment

Uh oh!

chengchingwen commented Dec 17, 2019

Uh oh!

CarloLucibello commented Jul 2, 2020

Uh oh!

chengchingwen commented Jul 3, 2020

Uh oh!

CarloLucibello commented Jul 3, 2020

Uh oh!

chengchingwen commented Jul 3, 2020

Uh oh!

CarloLucibello commented Jul 4, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CarloLucibello commented Jul 4, 2020

Uh oh!

chengchingwen commented Nov 10, 2021

Uh oh!

CarloLucibello commented Jun 17, 2023

Uh oh!

Uh oh!

chengchingwen Dec 16, 2019 •

edited

Loading

CarloLucibello commented Jul 4, 2020 •

edited

Loading