MNIST Neural Network Project

This project builds a neural network from scratch in Python to recognize handwritten digits from the MNIST dataset. The code is organized in a modular fashion, making it easy to reuse and extend in future projects.

Features

Neural Network from Scratch: Implemented without using any machine learning libraries.
Modular Architecture: Separated modules for dataset handling, network architecture, layers, activations, loss functions, optimizers, and utilities.
Training Pipeline: Includes forward and backward propagation, loss computation, weight updates, and validation.
Testing: Unit and integration tests ensure reliability of individual components and the overall training process.
Jupyter Notebook Integration: Interactive notebook for training, visualization, and analysis of results.
Extensibility: Well-documented and structured for future enhancements and integration into other projects.

Project Structure

mnist_nn/
├── README.md                # Project overview and instructions (this file)
├── LICENSE                  # License information
├── requirements.txt         # List of required Python libraries and their versions
├── data/
│   └── mnist/               # MNIST dataset files (download and extract here)
├── docs/
│   └── design_documentation.md  # Detailed design decisions and future extension plans
├── notebooks/
│   └── mnist_nn.ipynb       # Jupyter Notebook for interactive training and visualization
└── src/
    ├── dataset/             # Module for loading and preprocessing the MNIST dataset
    │   └── dataset.py
    ├── neuralnet/           # Neural network core module
    │   ├── __init__.py
    │   ├── network.py       # NeuralNetwork class for training and inference
    │   ├── layers.py        # Dense layer implementation and others
    │   ├── activations.py   # Activation functions and their derivatives
    │   ├── losses.py        # Loss functions and gradients (e.g., cross-entropy)
    │   ├── optimizers.py    # Simple optimizers (e.g., gradient descent)
    │   └── utils.py         # Utility functions (e.g., weight initialization, one-hot encoding)
    └── tests/               # Unit and integration tests
        └── test_network.py

Installation

Clone the Repository:

git clone https://github.com/rishimule/mnist_nn.git
cd mnist_nn

Set Up the Conda Environment:

Create a new conda environment with Python 3.11 and install the required dependencies:
```
conda create -n mnist_nn python=3.11
conda activate mnist_nn
pip install -r requirements.txt
```

Download the MNIST Dataset:

Navigate to the data/mnist directory and run the following commands to download and extract the dataset:

cd data/mnist
wget http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz
wget http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz
wget http://yann.lecun.com/exdb/mnist/t10k-images-idx3-ubyte.gz
wget http://yann.lecun.com/exdb/mnist/t10k-labels-idx1-ubyte.gz
gunzip *.gz
cd ../../

or

cd data/mnist
wget https://ossci-datasets.s3.amazonaws.com/mnist/train-images-idx3-ubyte.gz
wget https://ossci-datasets.s3.amazonaws.com/mnist/train-labels-idx1-ubyte.gz
wget https://ossci-datasets.s3.amazonaws.com/mnist/t10k-images-idx3-ubyte.gz
wget https://ossci-datasets.s3.amazonaws.com/mnist/t10k-labels-idx1-ubyte.gz
gunzip *.gz
cd ../../

Usage

Training the Network

You can train the network using the provided training script:

python -m src.train

This script will load the MNIST dataset, initialize the network architecture, run the training loop, and evaluate the model on a test set.

Using the Jupyter Notebook

To run the interactive Jupyter Notebook:

Start Jupyter Notebook:
```
jupyter notebook
```
Open notebooks/mnist_nn.ipynb to view and run the notebook cells, which include dataset loading, model configuration, training, and visualization of results.

Running Tests

To run the unit and integration tests, execute the following command from the project root:

python -m src.tests.test_network

This will run tests for activation functions, loss functions, layer operations, and the overall training pipeline.

Future Extensions

Additional Layer Types: Extend layers.py to include convolutional, pooling, or recurrent layers.
Advanced Optimizers: Implement optimizers like Adam or RMSProp in optimizers.py.
Enhanced Data Handling: Incorporate data augmentation and advanced preprocessing techniques.
Dynamic Configuration: Enable dynamic network configuration using external configuration files or command-line arguments.
Integration: The modular design allows easy integration of this codebase into larger projects or frameworks.

License

This project is licensed under the MIT License.

Contributing

Contributions are welcome! Please fork the repository and submit a pull request for any enhancements or bug fixes.

Acknowledgements

MNIST Dataset (lecun), 2 (stackOverfow)
This project was developed as an educational tool to demonstrate building neural networks from scratch.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST Neural Network Project

Features

Project Structure

Installation

Usage

Training the Network

Using the Jupyter Notebook

Running Tests

Future Extensions

License

Contributing

Acknowledgements

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.idea		.idea
data/mnist		data/mnist
docs		docs
notebooks		notebooks
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

rishimule/mnist_nn

Folders and files

Latest commit

History

Repository files navigation

MNIST Neural Network Project

Features

Project Structure

Installation

Usage

Training the Network

Using the Jupyter Notebook

Running Tests

Future Extensions

License

Contributing

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages