This project aims to cluster students of a public school in Chicago. The data set available contains information about students contact, progress, family environment, soft skills scores... etc.
- Treat the data set
- Understand the data and its imporant characteristics
- Cluster the students using different models
- Kmeans
- DBSCAN
- Gaussian mixture
- Agglomerative hierarchical clustering
- Dendrogram
- Silhouette plot
- Elbow plot
- Silhouette score
- CH index
- DB index