Optimise a subset of parameters

Flux's `trainable` works like this:
```julia
julia> Flux.trainable(BatchNorm(2, relu))  # this avoids half the parameters
(Float32[0.0, 0.0], Float32[1.0, 1.0])

julia> Functors.children(BatchNorm(2, relu))   # this sees them all, for |> gpu
(λ = NNlib.relu, β = Float32[0.0, 0.0], γ = Float32[1.0, 1.0], μ = Float32[0.0, 0.0], σ² = Float32[1.0, 1.0], ϵ = 1.0f-5, momentum = 0.1f0, affine = true, track_stats = true, active = nothing, chs = 2)
```
This doesn't seem great, it relies on `objectid` to know which parameters those really are. So this:
```julia
function _trainable_walk(f, x)
  func, re = functor(x)
  nb = trainable(x)
  re(map(c -> c in nb ? f(c) : c, func))
end
```
will not work correctly for say `β === SA[0.0, 0.0] === μ`. 

How should it work? 

* One idea would be to clone the `@functor` macro to have `@trainable BatchNorm  (β, γ)`? In fact this case is even worse, it checks a value [here](https://github.com/FluxML/Flux.jl/blob/8d3b8d3e9082d7e010485a8a0f5f69f1aeaa7640/src/layers/normalise.jl#L291) but we could probably move `affine` into the type.

* Another idea would be just to have `trainable(:: BatchNorm) = (:β, :γ)` the symbols. That's much easier to write and perhaps less mysterious. Might be slower, do we care? Or might not be, if the symbols are known from the type. It would be easy here to allow Flux-style tuples as a fallback, detecting NTuple{Symbol} etc, making it easier to have both old- and new-style at once.

This would be used during `setup`, just one pass. After that, the tree of optimiser states should tell you whether or not to update a given array, so `update` need never call this.

What might call it more often is `destructure`, which I think we want to walk only the trainable parameters, and will sometimes be called in a loop.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Optimise a subset of parameters #35

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Optimise a subset of parameters #35

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions