Diabetes Prediction Capstone

Predict diabetes using classic ML models on the Pima Indians Diabetes dataset (or a compatible healthcare CSV).

Project Structure

.
├─ README.md
├─ requirements.txt
├─ app.py                      # Streamlit app for inference
├─ .gitignore
├─ LICENSE
├─ diabetes_capstone.ipynb
├─ diabetes_best_model.joblib  # created after running notebook
└─ data/
   └─ README.md               # how to obtain/place datasets

Dataset

Preferred: Kaggle Pima Indians Diabetes: https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database
Or use a compatible CSV placed in Downloads (default search list in the notebook):
- diabetes.csv
- Disease_symptom_and_patient_profile_dataset.csv
- healthcare_dataset.csv

Target column should be one of: Outcome, target, diabetes, or class. Adjust in the notebook if different.

Quickstart

Create environment and install dependencies

python3 -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt

Launch the notebook

jupyter notebook diabetes_capstone.ipynb

Run all cells

Performs EDA, preprocessing, trains Logistic Regression / RandomForest / SVM / GradientBoosting
Compares metrics and saves best model as diabetes_best_model.joblib

Export to PDF (for submission)

From the notebook UI: File → Print Preview → Print to PDF
Or via CLI:

jupyter nbconvert --to pdf diabetes_capstone.ipynb

Reproducibility

All steps are captured in the notebook.
requirements.txt pins core packages for consistent runs.

License

MIT License — see LICENSE.

Streamlit App

Run locally:

pip install -r requirements.txt
streamlit run app.py

Usage:

Option 1: Upload a CSV with the same feature columns used during training.
Option 2: Use the manual input form (typical numeric features from Pima dataset).

Deployment (optional):

Push this repo to GitHub. On Streamlit Cloud (or similar), create a new app pointing to app.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Diabetes Prediction Capstone

Project Structure

Dataset

Quickstart

Reproducibility

License

Streamlit App

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
diabetes.csv		diabetes.csv
diabetes_best_model.joblib		diabetes_best_model.joblib
diabetes_capstone.ipynb		diabetes_capstone.ipynb
requirements.txt		requirements.txt

License

dhmodh/diabetes-prediction

Folders and files

Latest commit

History

Repository files navigation

Diabetes Prediction Capstone

Project Structure

Dataset

Quickstart

Reproducibility

License

Streamlit App

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages