Only functional wrapped layers except `VariableLayer`?

We want to be able to perform some generic things on parameters, such as weight norm, wight dropout or L2 loss (see #59) in a unified and straightforward way.

When we have some modules where the parameters are hidden inside the RETURNN layer (e.g. `Linear`), any such logic could be quite counter-intuitive, complicated and potentially even buggy. I expect that when we can directly see all parameters in the returnn-code, that this should become much easier (see e.g. the code behind [`torch.nn.utils.weight_norm`](https://pytorch.org/docs/stable/generated/torch.nn.utils.weight_norm.html), which is quite simple, but would be tricky if parameters are hidden in RETURNN layers).

There are actually not much such modules:

- `Linear`
- `Conv`
- `TransposedConv`
- `BatchNorm`
- `RelativePositionalEncoding`

We also need to have a functional variant of the `RecLayer` (https://github.com/rwth-i6/returnn/issues/817).

That's all. And they are all very simple to be reimplemented using pure functional modules, e.g. `dot` etc.
Specifically:

- `Linear`: Use `dot`
- `Conv`: Use the functional variant of `ConvLayer`
- `TransposedConv`: Use the functional variant of `TransposedConvLayer`
- `BatchNorm`: reimplement, maybe even more efficient by more directly wrapping fused TF ops
- `RelativePositionalEncoding`: anyway reimplement, see discussion in #55

So then the only module which really is a `tf.Variable` is the `Variable` module (or maybe rename to `Parameter`, to be more consistent to PyTorch). We can also easily implement functions like `parameters()` and `named_parameters()` for modules, and then follow very similar simple logic for things like weight norm etc as in PyTorch.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Only functional wrapped layers except `VariableLayer`? #82

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Only functional wrapped layers except VariableLayer? #82

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Only functional wrapped layers except `VariableLayer`? #82