Differences from the paper author's official implementation

The implementation of the PyTorch version you released is different from the official version of TensorFlow released by the author. According to the official implementation published in the paper, the author's code implementation skips some parameters according to their names when calculating. But in your implementation, it seems that all parameters are directly involved in the calculation.
Their implementation:
https://github.com/tensorflow/addons/blob/master/tensorflow_addons/optimizers/lamb.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Differences from the paper author's official implementation #10

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Differences from the paper author's official implementation #10

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions