Skip to content

Commit

Permalink
a note about the data used
Browse files Browse the repository at this point in the history
  • Loading branch information
jlooper committed Jun 18, 2021
1 parent b333541 commit 98a12e5
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion 5-Clustering/1-Visualize/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Introduction to clustering

Clustering is a type of [Unsupervised Learning](https://wikipedia.org/wiki/Unsupervised_learning) that presumes that a dataset is unlabelled. It uses various algorithms to sort through unlabeled data and provide groupings according to patterns it discerns in the data.
Clustering is a type of [Unsupervised Learning](https://wikipedia.org/wiki/Unsupervised_learning) that presumes that a dataset is unlabelled or that its inputs are not matched with predefined outputs. It uses various algorithms to sort through unlabeled data and provide groupings according to patterns it discerns in the data.

[![No One Like You by PSquare](https://img.youtube.com/vi/ty2advRiWJM/0.jpg)](https://youtu.be/ty2advRiWJM "No One Like You by PSquare")

Expand Down Expand Up @@ -211,6 +211,8 @@ df.describe()
| 75% | 2017 | 242098.5 | 31 | 0.8295 | 0.403 | 0.87575 | 0.000234 | 0.164 | -3.331 | 0.177 | 125.03925 | 4 |
| max | 2020 | 511738 | 73 | 0.966 | 0.954 | 0.995 | 0.91 | 0.811 | 0.582 | 0.514 | 206.007 | 5 |

> 🤔 If we are working with clustering, an unsupervised method that does not require labeled data, why are we showing this data with labels? In the data exploration phase, they come in handy, but they are not necessary for the clustering algorithms to work. You could just as well remove the column headers and refer to the data by column number.
Look at the general values of the data. Note that popularity can be '0', which show songs that have no ranking. Let's remove those shortly.

Use a barplot to find out the most popular genres:
Expand Down

0 comments on commit 98a12e5

Please sign in to comment.