Better initialization support

This is very nice, thanks!

It would be useful to open an issue to discuss the need for the `Linear` layer here. Hopefully we can make the builtins more flexible so this kind of thing is less necessary.

_Originally posted by @MikeInnes in https://github.com/FluxML/model-zoo/pull/115#issuecomment-470900633_

The primary need for making a new type `Linear`, was the bias initializer only takes in the output dimension, which is intuitive but is problematic when considering some bias initialization rely on more than the output dimension. For example, the default [`nn.Linear`](https://pytorch.org/docs/stable/_modules/torch/nn/modules/linear.html#Linear) layer in PyTorch scales the initialization of the bias by the input dimension. Relevant code:

```python
def reset_parameters(self):
        init.kaiming_uniform_(self.weight, a=math.sqrt(5))
        if self.bias is not None:
            fan_in, _ = init._calculate_fan_in_and_fan_out(self.weight)
            bound = 1 / math.sqrt(fan_in)
            init.uniform_(self.bias, -bound, bound)
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Better initialization support #670

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Better initialization support #670

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions