Machine Failure Prediction

This project aims to predict machine failures using logistic regression. The dataset is downloaded from Kaggle, and it contains sensor readings collected from various machines. The notebook includes data preprocessing, feature scaling, model training from scratch, evaluation, and visualization of results. Machine Failure prediction is really important for preventing uneccessary maintanence cost, helping in scheduling maintenance activities during non-operational hours, and ensuring peak productivity. Involving a ML model like this one for failure prediction can help ensure early detection of failure and utilize the data coming from the sensors in real-time.

Link to the dataset: https://www.kaggle.com/datasets/umerrtx/machine-failure-prediction-using-sensor-data

Link to my Kaggle notebook: https://www.kaggle.com/code/specialterminator69/regularized-logistic-regression-from-scratch

Overview

The notebook contains the following key steps:

Data Import and Exploration
Data Preprocessing
Handling Imbalanced Dataset
Feature Scaling
Data Splitting
Model Training
Model Evaluation
Visualization of Results

Prerequisites

To run the code in this notebook, you need to have the libraries mentioned in the requirements.txt file installed. Put the file in the same directory you are working in, and run the following command in the termial.

pip install -r requirements.txt

Setup

Clone this repository or download the notebook file.
Ensure you have the required libraries installed.
Place the data.csv file in the same directory as the notebook.

Steps

Data Import and Exploration:

The data is imported using pandas, and basic exploration is conducted to understand the structure, data types, and summary statistics. Duplicate rows are identified and counted.

Data Preprocessing:

Any duplicate rows found in the dataset are dropped to ensure data quality.

Handling Imbalanced Dataset:

The dataset is checked for class imbalance in the target variable. SMOTE (Synthetic Minority Over-sampling Technique) is used to balance the dataset by oversampling the minority class.

Feature Scaling:

The features are scaled using MinMaxScaler to bring all values into the range between 0 and 1, which helps improve the performance of the machine learning model.

Data Splitting:

The dataset is split into training and testing sets. A 70:30 split is used to ensure a proper evaluation of the model.

Model Training:

A regularized logistic regression model is trained from scratch using gradient descent. Functions for computing cost, gradients, and applying gradient descent are defined and used to optimize the model parameters.

Model Evaluation:

The model is evaluated using various metrics such as accuracy, precision, recall, and F1 score. A confusion matrix and classification report are generated to provide detailed insights into the model's performance on each class.

Visualization of Results:

Confusion matrix and evaluation scores are visualized using matplotlib and seaborn. Bar plots are used to display accuracy, precision, recall, and F1 score for easy comparison.

Results

The model showed exceptional performance in both classes, obtaining 92% accuracy, recall, precision and f1-score. The confusion matrix showed few examples that were incorrectly labelled, but overall, showed great performance.

Conclusion

This notebook provides a comprehensive approach to predicting machine failures using logistic regression implemented from scratch. Follow the steps to preprocess data, train the model, evaluate its performance, and visualize the results.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Machine_Failure_Prediction.ipynb		Machine_Failure_Prediction.ipynb
README.md		README.md
data.csv		data.csv
requirements.txt		requirements.txt
trained_parameters.pkl		trained_parameters.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Failure Prediction

Overview

Prerequisites

Setup

Steps

Results

Conclusion

About

Releases

Packages

Languages

zohaibterminator/machine-failure-classification-from-scratch

Folders and files

Latest commit

History

Repository files navigation

Machine Failure Prediction

Overview

Prerequisites

Setup

Steps

Results

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages