The data in this repo comes from Spotify and
Genius. Thank you to the authors of the
spotifyr
and geniusr
packages for making it easy to access data from
these platforms!
There are 3 data sets about or related to the Spice Girls:
studio_album_tracks
: Audio features of each song from the three studio albums by the Spice Girls. From Spotify.related artists
: Artists deemed to be similar to the Spice Girls, with info about each artist including their musical genres and follower numbers. Includes a row with details for the Spice Girls, for comparison purposes. From Spotify.lyrics
: Lyrics of each song from the three studio albums by the Spice Girls. From Genius.
Credit: Jacquie Tran
A data dictionary for each data set is provided here.
The R code below uses the studio_album_tracks
data set to produce
summary statistics for selected audio features.
# Load libraries
library(dplyr)
# Read data into R
studio_album_tracks <- readr::read_csv("https://github.com/jacquietran/spice_girls_data/raw/main/data/studio_album_tracks.csv")
# For each album, calculate mean values for danceability, energy, and valence
studio_album_tracks %>%
group_by(album_name) %>%
summarise(
danceability_mean = mean(danceability),
energy_mean = mean(energy),
valence_mean = mean(valence)) %>%
ungroup() %>%
# Set factor levels of album_name
mutate(
album_name = factor(
album_name, levels = c("Spice", "Spiceworld", "Forever"))) %>%
arrange(album_name)
#> # A tibble: 3 x 4
#> album_name danceability_mean energy_mean valence_mean
#> <fct> <dbl> <dbl> <dbl>
#> 1 Spice 0.694 0.731 0.756
#> 2 Spiceworld 0.570 0.776 0.642
#> 3 Forever 0.695 0.722 0.583
spotifyr
: https://www.rcharlie.com/spotifyr/index.htmlgeniusr
: https://ewenme.github.io/geniusr/