Refactor for convnets #58

milancurcic · 2022-04-29T18:54:17Z

The original neural-fortran code was limited in application because the network type was hardcoded for dense (fully-connected) layers. This PR introduces a large refactor of the library to allow extending it to other network architectures (convolutional for imagery and model data, recurrent for time series, etc.).

Key changes:

Define forward and backward passes on the layer instead of the network (fixes Define forward and backward passes on the layer instead of the network #41)
Weights are now defined on the same layer as biases and outputs, as is more conventional, rather than on the preceding layer as it was in the original code. There is no significant practical implication here other than that the algorithm implementation is easier to track in the conventional form.
Input layers are now their own layer type, similar to the Keras API. 1-d (for dense networks) and 3-d (for convolutional networks) input layers are provided under the same generic name (input).
Losses are in their own module, albeit still only one function (quadratic).
Convolutional layer is only a placeholder for now.
Tests, though minimal for now, are quiet.
Source file names and modules are now prefixed with nf_ instead of mod_, to minimize the chance of name clashes with other libraries that may enter the same namespace in a user application.
Not using preprocessor macros

What's not there anymore:

Support for real64 or real128. Rationale: Not too useful to begin with, and can be easily added if anybody asks for it.
save and load methods to save and load pre-trained networks. Rationale: we'll be adding support for HDF5 I/O soon, and I assume most people who used save and load did it via FKB rather than the upstream neural-fortran.

A nice side-effect of this refactor is that the MNIST training example is about 135% (2.35 times) faster than the original code. This is likely due to the fact that this time around I was careful about minimizing copies and re-allocations. This result is with ifort-2021.3 using -Ofast on Intel E5-1650.

Known issues:

With higher optimization levels on GFortran (anything above -O0), the network does not converge as expected, and this is true for all 3 included examples. For example, the MNIST example it reaches high 80% in one epoch and then slowly drops in subsequent epochs. Same behavior with 9.4.0 and 10.3.0. This issue goes away with -O0, and doesn't appear at any optimization level with ifort. I hope to diagnose and resolve this before the merge.

TODO before merging:

Tag and release v0.2.0 from the main branch

CC @katherbreen

milancurcic · 2022-05-02T19:15:27Z

Known issues:

With higher optimization levels on GFortran (anything above -O0), the network does not converge as expected, and this is true for all 3 included examples. For example, the MNIST example it reaches high 80% in one epoch and then slowly drops in subsequent epochs. Same behavior with 9.4.0 and 10.3.0. This issue goes away with -O0, and doesn't appear at any optimization level with ifort. I hope to diagnose and resolve this before the merge.

Adding -fno-frontend-optimize allows GFortran to generate code that runs correctly (examples converge) at any optimization level, including -Ofast. So, -ffrontend-optimize, implied for any optimization level above -O0, seems to cause the issue. I don't know exactly why yet. From the GFortran manual:

       -ffrontend-optimize
           This option performs front-end optimization, based on manipulating parts the Fortran parse tree.  Enabled by default
           by any -O option except -O0 and -Og.  Optimizations enabled by this option include:

           *<inlining calls to "MATMUL",>
           *<elimination of identical function calls within expressions,>
           *<removing unnecessary calls to "TRIM" in comparisons and assignments,>
           *<replacing TRIM(a) with "a(1:LEN_TRIM(a))" and>
           *<short-circuiting of logical operators (".AND." and ".OR.").>

           It can be deselected by specifying -fno-frontend-optimize.

Of these, inlining calls to "MATMUL" and elimination of identical function calls within expressions seem like candidates for the cause of the issue. I don't know if this list of optimizations is a complete list or a subset.

milancurcic added 9 commits April 29, 2022 13:25

Initial commit of the refactored library

4678135

Update example programs

908f76b

Update test programs

5acdc52

Emit test errors to stderr; exit with status 1 on failure

c9cf69b

Update CMake build for refactor

eb06b96

Bump version

d006755

Remove travis CI stuff; not used anymore

65a46c5

Remove travis badge

52f3472

Update README

c872ca8

milancurcic added the enhancement New feature or request label Apr 29, 2022

milancurcic requested a review from rouson April 29, 2022 18:54

milancurcic self-assigned this Apr 29, 2022

milancurcic mentioned this pull request Apr 29, 2022

Refactor: move procedure definitions submodules #51

Merged

3 tasks

Fix test name in report

3702c20

milancurcic added 4 commits May 2, 2022 21:47

Fix indentation

3d2b148

Update build instructions for -fno-frontend-optimize

6ccb92a

Add a few tests for a dense network

11c3e5a

Resolve conflict with the main branch

440439d

milancurcic merged commit 4b211b4 into modern-fortran:main May 4, 2022

milancurcic deleted the refactor-for-convnets branch May 4, 2022 19:22

milancurcic mentioned this pull request May 10, 2022

Add support for convolutional layers #64

Closed

7 tasks

milancurcic mentioned this pull request May 21, 2022

Add a display method for class(network_type) #22

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor for convnets #58

Refactor for convnets #58

Uh oh!

milancurcic commented Apr 29, 2022 •

edited

Loading

Uh oh!

milancurcic commented May 2, 2022

Uh oh!

Uh oh!

Refactor for convnets #58

Refactor for convnets #58

Uh oh!

Conversation

milancurcic commented Apr 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

milancurcic commented May 2, 2022

Uh oh!

Uh oh!

milancurcic commented Apr 29, 2022 •

edited

Loading