The Diabetes Prediction System is a machine learning project that predicts whether a person is likely to have diabetes based on their medical and lifestyle data.
It leverages supervised learning algorithms to analyze various health parameters such as glucose level, BMI, blood pressure, insulin level, and age.
By training on medical datasets (like the PIMA Indian Diabetes Dataset), this system can help in early detection of diabetes, enabling better medical decisions and preventive measures.
- To analyze health-related data and identify key factors influencing diabetes.
- To build an accurate machine learning model that can predict diabetes risk.
- To demonstrate the power of data science in healthcare prediction and diagnosis.
- Logistic Regression
- Random Forest Classifier
- Support Vector Machine (SVM)
- K-Nearest Neighbors (KNN)
- XGBoost (optional for advanced tuning)
The model with the best accuracy and ROC-AUC score is selected for deployment.
The project typically uses the PIMA Indian Diabetes Dataset, available from Kaggle or UCI Machine Learning Repository.
Dataset features include:
- Pregnancies
- Glucose
- Blood Pressure
- Skin Thickness
- Insulin
- BMI
- Diabetes Pedigree Function
- Age
- Outcome (1 = Diabetic, 0 = Non-Diabetic)
- Data Collection β Load and inspect the dataset.
- Data Preprocessing β Handle missing values, normalize data, and split into train/test sets.
- Model Training β Train multiple ML algorithms for comparison.
- Evaluation β Compare models using accuracy, precision, recall, and F1-score.
- Prediction β Predict whether a person has diabetes based on input features.
- Deployment (Optional) β Deploy using Flask/Streamlit for a web interface.
- Accuracy
- Precision
- Recall
- F1 Score
- ROC-AUC Curve
- Programming Language: Python
- Libraries: NumPy, Pandas, Scikit-learn, Matplotlib, Seaborn
- Optional Frameworks: Streamlit / Flask for Web UI
- Dataset Source: Kaggle - PIMA Indian Diabetes Dataset
- Clone this repository:
git clone https://github.com/Abhay-art-git/diabetes-prediction-ml.git