add step! #1833

FelixBenning · 2022-01-13T09:49:00Z

Add step! like suggested in #666 as a single step of train! to allow for more exotic optimisers to simply overload step! and be able to use the train! wrapper.

PR Checklist

Tests are added
Entry in NEWS.md
Documentation, if applicable
API changes require approval from a committer (different from the author, if applicable)

DhairyaLGandhi · 2022-01-13T10:35:49Z

Similar to #1017 which should be complete

darsnack · 2022-01-13T13:48:23Z

True, sorry I missed that. Let's just move ahead with this one since it has a docstring and Felix put in some time recently.

src/optimise/train.jl

darsnack

A couple of docstring changes. The implementation looks good to me.

Can you also update the docs for "Custom Training loops" to use this new function? And add the docstring into the docs in that location?

One other suggestion is to call this trainstep!, since step! is very generic, and we might want to hold onto the name for something else.

src/optimise/train.jl

Co-authored-by: Kyle Daruwalla <daruwalla.k.public@icloud.com>

FelixBenning · 2022-01-18T15:10:19Z

src/optimise/train.jl

@@ -81,29 +81,29 @@ batchmemaybe(x) = tuple(x)
 batchmemaybe(x::Tuple) = x

 """
-    step!(loss, params, opt)
+    optimstep!(loss, params, opt)


I suggest optimstep! instead of trainstep! to indicate that this is the optimiser interface and keep the ML jargon to a minimum

One vote for something evoking train! to stress that they are closely related.

If the longer-term plan is to use Optimisers.jl, this may not fit with train! at all -- some recent discussion here: #1902 (comment) . In which case there will be an implicit-style train! & Params story, and an explicit-style gradient and Optimisers.update!. With such a divide, this function wants to be clearly on the train! & Params side.

Maybe it should just be 3-arg train!? Without a data iterator, there is no iteration, that's all:

train!(loss, ::Params, data, ::AbstractOptimiser) # calls loss(d...) for d in data train!(loss, ::Params, ::AbstractOptimiser) # calls loss() since there is no data

FelixBenning · 2022-01-18T15:33:36Z

docs/src/training/training.md

-  # Calculate the gradients of the parameters
-  # with respect to the loss function
-  grads = Flux.gradient(parameters) do
+  # Update the parameters based on the chosen


This is right at the beginning instead of in the Custom Training Loop Section. It seems to me like the custom training loop section might either be redundant or demonstrate how to have a custom gradient calculation now.

NEWS.md

src/optimise/Optimise.jl

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

FelixBenning added 2 commits January 13, 2022 10:45

add step!

2fa1c7c

add NEWS.md

c8f147b

darsnack requested changes Jan 13, 2022

View reviewed changes

src/optimise/train.jl Outdated Show resolved Hide resolved

mcabbott added the enhancement label Jan 14, 2022

use withgradient instead

1f34fd7

DhairyaLGandhi reviewed Jan 16, 2022

View reviewed changes

src/optimise/train.jl Show resolved Hide resolved

darsnack requested changes Jan 18, 2022

View reviewed changes

src/optimise/train.jl Outdated Show resolved Hide resolved

src/optimise/train.jl Outdated Show resolved Hide resolved

FelixBenning and others added 3 commits January 18, 2022 15:52

apply suggested changes to docstring

0765010

Co-authored-by: Kyle Daruwalla <daruwalla.k.public@icloud.com>

apply suggested changes to docstring

5b4fc17

Co-authored-by: Kyle Daruwalla <daruwalla.k.public@icloud.com>

rename step! to optimstep!

12a8284

FelixBenning commented Jan 18, 2022

View reviewed changes

add optimstep! to docs

68a0590

FelixBenning commented Jan 18, 2022

View reviewed changes

FelixBenning added 3 commits January 18, 2022 16:42

update optimisers section

c321e4f

apostrophes

bb7b5c5

bracket at the end of the sentence

cc31acb

ToucheSir reviewed Feb 4, 2022

View reviewed changes

NEWS.md Outdated Show resolved Hide resolved

ToucheSir reviewed Feb 4, 2022

View reviewed changes

src/optimise/Optimise.jl Outdated Show resolved Hide resolved

FelixBenning and others added 2 commits May 24, 2022 12:34

Update NEWS.md

5220fe9

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

Update src/optimise/Optimise.jl

fa68993

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

add step! #1833

add step! #1833

Uh oh!

FelixBenning commented Jan 13, 2022 •

edited

Loading

Uh oh!

DhairyaLGandhi commented Jan 13, 2022

Uh oh!

darsnack commented Jan 13, 2022

Uh oh!

Uh oh!

Uh oh!

darsnack left a comment

Uh oh!

Uh oh!

Uh oh!

FelixBenning Jan 18, 2022

Uh oh!

mcabbott Mar 20, 2022 •

edited

Loading

Uh oh!

FelixBenning Jan 18, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

add step! #1833

Are you sure you want to change the base?

add step! #1833

Uh oh!

Conversation

FelixBenning commented Jan 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Checklist

Uh oh!

DhairyaLGandhi commented Jan 13, 2022

Uh oh!

darsnack commented Jan 13, 2022

Uh oh!

Uh oh!

Uh oh!

darsnack left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

FelixBenning Jan 18, 2022

Choose a reason for hiding this comment

Uh oh!

mcabbott Mar 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

FelixBenning Jan 18, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

FelixBenning commented Jan 13, 2022 •

edited

Loading

mcabbott Mar 20, 2022 •

edited

Loading