SpotifyAPI-Data-Engineering-Project

This project uses ETL (Extract, Transform, and Load) pipeline to extract data from Spotify using its API and loads the data to a data source(AWS Athena). The entire pipeline will be built using Amazon Web Services (AWS).

About Dataset / API

The API contains information about music, artists, albums, and songs. Spotify API

Services Used

AWS S3: Amazon S3 (Simple Storage Service) is a highly scalable object storage service that can store and retrieve any amount of data from anywhere on the web. It is commonly used to store and distribute large media files, data backups, and static website files.

AWS Lambda: AWS Lambda is a serverless computing service that lets you run your code without managing servers. You can use lambda to run code in response to events like changes in S3, DynamoDB, or other AWS services.

AWS CloudWatch: Amazon CloudWatch is a monitoring< service for AWS resources and the applications you run on them. You can use CloudWatch to collect and track metrics, collect and monitor log files, and set alarms.

Glue Crawler: AWS Glue Crawler is a fully managed service that automatically crawls your data sources, identifies datatypes, and infers schema to create an AWS Glue Data Catalog.

Data Catalog: AWS Data Catalog is a fully managed metadata repository that makes it easy to discover and manage data in AWS. Data Catalog can be used with other AWS Services, such as AWS Athena.

Amazon Athena: Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena can be used to analyze data in Glue Data Catalog or in other S3 buckets

Install Packages

pip install pandas
pip install numpy
pip install spotipy

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
architecture		architecture
data		data
README.md		README.md
ingest.py		ingest.py
spotify.ipynb		spotify.ipynb
spotipy_layer.zip		spotipy_layer.zip
transform_load.py		transform_load.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpotifyAPI-Data-Engineering-Project

About Dataset / API

Services Used

Install Packages

About

Releases

Packages

Languages

Undisputed-jay/SpotifyAPI-Data-Engineering-Project

Folders and files

Latest commit

History

Repository files navigation

SpotifyAPI-Data-Engineering-Project

About Dataset / API

Services Used

Install Packages

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages