This repository contains assignments and projects for the Machine Learning course completed by Team 3.
- AbdelRahman Hesham Zakaria (1210148)
- Shehab Tarek ElHadary (1210366)
- Omar Walid Mohamed (1210269)
Preprocessing.ipynb- Initial data preprocessing techniquesPreprocessing1.ipynb- Advanced preprocessing methods- Datasets: Healthcare stroke data and medical insurance data
task4a updated.ipynb- Loan approval classification using:- Logistic Regression
- Hard Margin SVM
- Soft Margin SVM with RBF Kernel
- Dataset: Loan approval data
Assignment 5.ipynb- Customer churn prediction analysis- Dataset: Churn dataset
CART ML Report.pdf- Comprehensive report on CART algorithm covering:- Binary recursive partitioning
- Splitting criteria (Gini impurity, entropy)
- Pruning techniques
- Model strengths and limitations
Assignment 7.ipynb- Enhanced customer churn analysis- Dataset: Churn dataset
- Python - Primary programming language
- Jupyter Notebook - Development environment
- scikit-learn - Machine learning library
- pandas - Data manipulation
- numpy - Numerical computing
- matplotlib & seaborn - Data visualization
- Data preprocessing and feature engineering
- Classification algorithms (Logistic Regression, SVM)
- Decision Trees and CART
- Model evaluation and comparison
- Confusion matrices and performance metrics
- Customer churn prediction
- Loan approval prediction
- Datasets should be placed in their respective assignment folders for notebooks to run correctly
- Clone the repository
- Ensure you have Python 3.x installed with required libraries:
pip install jupyter pandas numpy scikit-learn matplotlib seaborn
- Place the required CSV datasets in the appropriate assignment folders
- Open the desired notebook:
jupyter notebook
This repository is for educational purposes as part of a university Machine Learning course.
Course: Machine Learning
Academic Year: 2024-2025 (First Term)