In this repo, you'll find tutorials of a variety of data analysis projects on real world datasets that you need to perform in order to build ML models! I will be explaining every step and what results mean during my work! Enjoy learning !
You can find the datasets I used in this repo or online !
1st Project: Rhode Island dataset from the Stanford Open Policing Project
- 'sopp.csv' is the Rhode Island dataset from the Stanford Open Policing Project, made available under the Open Data Commons Attribution License
2nd Project: Socioeconomic Indicators in Chicago (Dataset Here)
- The city of Chicago released a dataset of socioeconomic data to the Chicago City Portal. It contains a selection of six socioeconomic indicators of public health significance and a “hardship index,” for each Chicago community area, for the years 2008 – 2012.
3rd Project: Customer Segmentation with K-Means (Dataset Here)
- Despite the simplicity of K-means, it is vastly used for clustering in many data science applications, especially useful if we need to quickly discover insights from unlabeled data. In this notebook, I'll be using K-means for customer segmentation.