heart-disease-ml-classifier

A PyTorch model designed to predict the risk of heart disease based on a combination of symptoms, lifestyle factors, and medical history from 70,000+ data samples. The model achieves approximately 99.27% accuracy on test data.

Download the model here

Kaggle Dataset link

Overview

This project uses PyTorch to build a neural network classifier for heart disease risk prediction. The model analyzes several medical predictor variables to determine if a patient is at risk of heart disease.

Dataset

The dataset contains medical predictor variables from the heart_disease_risk_dataset_earlymed.csv file.

It contains 18 medical predictors of heart disease:

Chest Pain: Presence of chest pain (Yes/No)
Shortness of Breath: Difficulty breathing (Yes/No)
Fatigue: Feeling of tiredness (Yes/No)
Palpitations: Irregular heartbeat sensations (Yes/No)
Dizziness: Feeling lightheaded (Yes/No)
Swelling: Edema in extremities (Yes/No)
Pain Arms Jaw Back: Pain radiating to arms/jaw/back (Yes/No)
Cold Sweats Nausea: Presence of cold sweats or nausea (Yes/No)
High BP: High blood pressure diagnosis (Yes/No)
High Cholesterol: High cholesterol diagnosis (Yes/No)
Diabetes: Presence of diabetes (Yes/No)
Smoking: Current smoking status (Yes/No)
Obesity: Obesity status (Yes/No)
Sedentary Lifestyle: Physical inactivity (Yes/No)
Family History: Family history of heart disease (Yes/No)
Chronic Stress: Ongoing stress condition (Yes/No)
Gender: Patient's gender (Male/Female)
Age: Age of patient in years

Output variable:

Risk: Risk of Heart Disease (low/high)

Requirements

Python 3.8+
PyTorch
pandas
scikit-learn
matplotlib

Usage

Clone the repository
Create a virtual environment py -m venv .venv and activate it .venv/Scripts/activate
Install dependencies: pip install -r requirements.txt
Run the model: python main.py

Model Architecture

Input layer: 18 features
Hidden layer 1: 64 neurons with ReLU activation
Hidden layer 2: 28 neurons with ReLU activation
Output layer: 2 neurons (Binary classification)
Optimization: Adam optimizer with learning rate 0.005
Loss function: Cross Entropy Loss

Results

The model is trained for 1000 epochs and the training progress is visualized through a loss plot that is automatically generated and saved as 'loss_plot.png'. The model achieves approximately 99.27% accuracy on the test set, with results being reproducible using a fixed random seed (392).

Model Persistence

The trained model is saved to 'heart_disease_classifier_model.pth' for later use.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
heart-disease-risk-prediction-dataset		heart-disease-risk-prediction-dataset
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
heart-disease-risk-prediction-dataset.zip		heart-disease-risk-prediction-dataset.zip
heart_disease_classifier_model.pth		heart_disease_classifier_model.pth
loss_plot.png		loss_plot.png
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

heart-disease-ml-classifier

Overview

Dataset

Requirements

Usage

Model Architecture

Results

Model Persistence

License

About

Releases

Packages

Languages

License

aAa1928/heart-disease-ml-classifier

Folders and files

Latest commit

History

Repository files navigation

heart-disease-ml-classifier

Overview

Dataset

Requirements

Usage

Model Architecture

Results

Model Persistence

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages