Skip to content

The πƒπ’πšπ›πžπ­πžπ¬ 𝐏𝐫𝐞𝐝𝐒𝐜𝐭𝐒𝐨𝐧 π’π²π¬π­πžπ¦ is a machine learning-based project designed to predict the likelihood of a person having ππ’πšπ›πžπ­πžπ¬ π›πšπ¬πžπ on key medical diagnostic measurements. It utilizes data science techniques and predictive modeling to analyze health parameters such as glucose level, blood pressure, BMI

Notifications You must be signed in to change notification settings

Abhay-art-git/Diabetes-Prediction-using-Machine-Learning-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

19 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🩺 Diabetes Prediction Using Machine Learning

πŸ“˜ Overview

The Diabetes Prediction System is a machine learning project that predicts whether a person is likely to have diabetes based on their medical and lifestyle data.
It leverages supervised learning algorithms to analyze various health parameters such as glucose level, BMI, blood pressure, insulin level, and age.

By training on medical datasets (like the PIMA Indian Diabetes Dataset), this system can help in early detection of diabetes, enabling better medical decisions and preventive measures.


🎯 Objectives

  • To analyze health-related data and identify key factors influencing diabetes.
  • To build an accurate machine learning model that can predict diabetes risk.
  • To demonstrate the power of data science in healthcare prediction and diagnosis.

🧠 Algorithms Used

  • Logistic Regression
  • Random Forest Classifier
  • Support Vector Machine (SVM)
  • K-Nearest Neighbors (KNN)
  • XGBoost (optional for advanced tuning)

The model with the best accuracy and ROC-AUC score is selected for deployment.


🧩 Dataset

The project typically uses the PIMA Indian Diabetes Dataset, available from Kaggle or UCI Machine Learning Repository.

Dataset features include:

  • Pregnancies
  • Glucose
  • Blood Pressure
  • Skin Thickness
  • Insulin
  • BMI
  • Diabetes Pedigree Function
  • Age
  • Outcome (1 = Diabetic, 0 = Non-Diabetic)

βš™οΈ Workflow

  1. Data Collection – Load and inspect the dataset.
  2. Data Preprocessing – Handle missing values, normalize data, and split into train/test sets.
  3. Model Training – Train multiple ML algorithms for comparison.
  4. Evaluation – Compare models using accuracy, precision, recall, and F1-score.
  5. Prediction – Predict whether a person has diabetes based on input features.
  6. Deployment (Optional) – Deploy using Flask/Streamlit for a web interface.

πŸ“Š Performance Metrics

  • Accuracy
  • Precision
  • Recall
  • F1 Score
  • ROC-AUC Curve

πŸ’» Technologies Used

  • Programming Language: Python
  • Libraries: NumPy, Pandas, Scikit-learn, Matplotlib, Seaborn
  • Optional Frameworks: Streamlit / Flask for Web UI
  • Dataset Source: Kaggle - PIMA Indian Diabetes Dataset

πŸš€ How to Run the Project

  1. Clone this repository:
    git clone https://github.com/Abhay-art-git/diabetes-prediction-ml.git

About

The πƒπ’πšπ›πžπ­πžπ¬ 𝐏𝐫𝐞𝐝𝐒𝐜𝐭𝐒𝐨𝐧 π’π²π¬π­πžπ¦ is a machine learning-based project designed to predict the likelihood of a person having ππ’πšπ›πžπ­πžπ¬ π›πšπ¬πžπ on key medical diagnostic measurements. It utilizes data science techniques and predictive modeling to analyze health parameters such as glucose level, blood pressure, BMI

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published