💊 Drug Prediction — Decision Tree

A clear and interpretable baseline for predicting drug categories using patient features and a Decision Tree classifier. Designed to be interview-friendly, with emphasis on clarity, step-by-step decisions, and interpretability.

📂 Project Structure

├── Drug_Prediction_DecisionTree_polished.ipynb
├── README.md # Project documentation

⚙️ Skills & Tech

Python, Jupyter Notebook
pandas, NumPy — Data handling
matplotlib, seaborn — Visualization
scikit-learn — DecisionTreeClassifier, model evaluation
EDA, Preprocessing, Model interpretation

📝 Project Overview

This notebook demonstrates a complete Machine Learning workflow for predicting drug categories:

Exploratory Data Analysis (EDA) – Inspect dataset distribution and patterns
Preprocessing – Encoding categorical features, handling data types
Model Training – Decision Tree Classifier
Evaluation – Accuracy, interpretability, decision paths

📊 Dataset

Features:
- Age: Age of the patient
- Sex: Male/Female
- Blood Pressure: Low / Normal / High
- Cholesterol: Normal / High
- Na_to_K ratio: Sodium-to-Potassium ratio in the blood
Target: Drug type (DrugA, DrugB, DrugC, DrugX, DrugY)
Size: 200 samples
Source: UCI / educational dataset

▶️ How to Run

Clone or download this repository:

git clone https://github.com/Shamir-Havas/Drug-Prediction-Decision-Tree.git
cd Drug-Prediction-Decision-Tree

Install dependencies:

bash Copy code pip install -r requirements.txt Open Jupyter Notebook and run the workflow:

bash Copy code jupyter notebook Drug_Prediction_DecisionTree_polished.ipynb Run all cells:

Kernel → Restart & Run All

📊 Results

🔹 Category Counts

🔹 Decision Tree Visualization

🔹 Model Accuracy

🔍 Model Explainability

Decision Tree Visualization: Interpretable decision paths using plot_tree

Classification Report: Precision, recall, F1-score

🚀 Future Improvements

Hyperparameter tuning with GridSearchCV / RandomizedSearchCV

Cross-validation (e.g., Stratified K-Fold) for robustness

Try ensemble methods (Random Forest, XGBoost)

Domain-specific validation & feature engineering

📦 Requirements

pandas==2.0.3
numpy==1.25.2
matplotlib==3.7.2
seaborn==0.12.2
scikit-learn==1.3.0

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
Drug_Prediction_DecisionTree.ipynb		Drug_Prediction_DecisionTree.ipynb
README.md		README.md
accuracy.png		accuracy.png
category_counts.png		category_counts.png
decision_tree.png		decision_tree.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

💊 Drug Prediction — Decision Tree

📂 Project Structure

⚙️ Skills & Tech

📝 Project Overview

📊 Dataset

▶️ How to Run

About

Uh oh!

Releases

Packages

Languages

Shamir-Havas/Drug-Prediction-Decision-Tree

Folders and files

Latest commit

History

Repository files navigation

💊 Drug Prediction — Decision Tree

📂 Project Structure

⚙️ Skills & Tech

📝 Project Overview

📊 Dataset

▶️ How to Run

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages