Project in data science
Titanic survival data Explore by implementing a decision tree in sci-kit-learn
The aim of this workbook is to provide practice with metrics specifically related to classification problems. To this end, we will use the spam dataset.
The aim of this workbook is to provide practice with metrics specifically related to regression problems. To this end, we will use the Boston Housing dataset.
In this mini-lab, we'll fit a decision tree model to some sample data. This initial model will overfit heavily. Then we'll use Grid Search to find better parameters for this model, to reduce the overfitting.
In this mini-lab, we'll try to optimize a number of models using Randomized Grid Searching in this notebook.
Find Donors for CharityML with Kaggle: CharityML is a fictitious charity organization that provides financial support. In an effort to improve donor outreach effectiveness, I had to build an algorithm that best identifies potential donors. I evaluated and optimized several different supervised learners (AdaBoost, Decision Tree & Logistic Regression) to determine which algorithm will provide the highest donation yield.