- Project Overview
- Dataset
- Models
- Project Structure
- Installation
- Usage
- Results
- Documentation
- Contributing
This repository contains machine learning and deep learning models designed to predict customer spending patterns. The project demonstrates the application of data preprocessing, feature engineering, and model development to forecast customer expenditure.
The project uses a custom dataset (MOCK_DATA.csv
) containing customer information. This data is cleaned and preprocessed to create cleaned_customer_data.csv
, which is used for training and testing our models.
Our deep learning model uses TensorFlow and Keras:
- Multiple dense layers with dropout for regularization
- ReLU activation for hidden layers, linear activation for output layer
- Adam optimizer
- L2 regularization and dropout to prevent overfitting
For details, see Deep Learning Model Details.
The ensemble model combines multiple algorithms:
- Random Forest, Gradient Boosting, and Neural Network regressors
- Voting Regressor as the ensemble method
For more information, refer to Ensemble Model Details.
.
├── datasets/
│ ├── MOCK_DATA.csv
│ └── cleaned_customer_data.csv
├── keras/
│ ├── deeplearn.py
│ └── info.md
├── scikitlearn/
│ ├── ensemble.py
│ └── info.md
├── preprocess/
│ ├── datacleaner.py
│ └── info.md
├── README.md
└── requirements.txt
-
Clone the repository:
git clone https://github.com/cloaky233/NeuralNetworks.git cd NeuralNetworks
-
Install required packages:
pip install -r requirements.txt
-
Data Preprocessing:
python preprocess/datacleaner.py
-
Run Deep Learning Model:
python keras/deeplearn.py
-
Run Ensemble Model:
python scikitlearn/ensemble.py
The performance metrics for each model are printed after running the respective scripts. These include Mean Squared Error (MSE) and Mean Absolute Percentage Error (MAPE).
Contributions to improve the project are welcome. Please fork the repository and submit a pull request with your proposed changes.