N3

Code for evolving large neural networks from small ones paper.

This code base is uv compatible and pip installable.

Authors

Anil Radhakrishnan, John F. Lindner, Scott T. Miller, Sudeshna Sinha, William L. Ditto

Link to paper

Key Results

Growing networks can dynamically evolve their size during gradient descent to help solve problems involving nonlinear regression and classification
We present two different algorithms:
- an auxiliary weight algorithm implemented in Mathematica serving as a conceptual proof of concept where an additional weight is added to a standard multi-layer perceptron (MLP) to whose value dictates the size of the network
- a more generalized scheme implemented in Python using JAX separating the auxiliary weight from the network and allowing for more flexibility in the network architecture in a controller-mlp paradigm that can be efficiently vectorized and parallelized in standard deep learning frameworks
- We show improved results for growing networks over static(grown) networks for in both pipelines for regression and classification tasks

Installation

We recommend using uv to manage python and install the package.

Then, you can simply git clone the repository and run,

uv pip install .

to install the package with all dependencies.

Usage

The notebooks in the nbs illustrate the pipeline for training and testing the growing and grown networks for regression and classification tasks. To run the notebooks you can install the additional dependencies as noted in the pyproject.toml file.

The scripts in the scripts directory are the same as the notebooks but with argparsing for easy command line usage for use in batch processing. To run the scripts, you can use the uv run command to run the scripts in the scripts directory.

The analysis of the statistical run results can be done using the analysis notebook in the nbs directory.

The mathematica notebooks used for this work can be accessed in the mathematica directory.

For running grid searches and sweeps, the config/sweeps.yaml config file can be edited accordingly and run with scripts/run_sweep.py

Code References

Equinox Pytorch like module for JAX
JAX Accelerator-oriented array computation and program transformation
Optax Gradient processing and optimization library for JAX

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
benchmarking		benchmarking
config		config
mathematica		mathematica
nbs		nbs
scripts		scripts
src/n3		src/n3
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

N3

Code for evolving large neural networks from small ones paper.

Authors

Link to paper

Key Results

Installation

Usage

Code References

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

NonlinearArtificialIntelligenceLab/N3

Folders and files

Latest commit

History

Repository files navigation

N3

Code for evolving large neural networks from small ones paper.

Authors

Link to paper

Key Results

Installation

Usage

Code References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages