Closed
Description
It would be nice to have a easy and rule independent way of changing the learning rate like in Flux.jl.
Right now the best way i can think of is making another state tuple while changing the first argument but then changing the learning rate is not rule independent.
@mcabbott suggested
Although perhaps there ought to be one, like
st = setup(rule, st)
without zero-ing the momenta. Maybe make an issue?
related: #15 (seems to be not active)
Metadata
Metadata
Assignees
Labels
No labels