Skip to content

Conversation

singraber
Copy link
Member

Implements decoupled Kalman filter (DEKF) NN training. Different levels of decoupling are possible:

  • No decoupling (GEKF), limiting case of single weight group.
  • Element-wise decoupling (ED-GEKF), see also https://doi.org/10.1021/acs.jctc.5b00211
  • Per-layer decoupling.
  • Per-node decoupling (NDEKF).
  • Per-weight decoupling (full decoupling), limiting case.

@singraber singraber added the enhancement New feature or request label Sep 5, 2020
singraber and others added 9 commits September 6, 2020 00:55
- Weights and biases grouped by neuron.
- Simplifies node decoupling implementation => block submatrices.
- Old weight ordering available via compile flag
(-DALTERNATIVE_WEIGHT_ORDERING)
- Weight file output remains unchanged.
- Works now with limits instead of a group mask.
- Groups are assumed to be contiguous.
- Implemented rank 0 only update via new communicator.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant