Skip to content

jarretjeter/Spotify-Data-Loader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spotify Dataloader

By Jarret Jeter

A data pipeline using classes to load to a sql database

Technologies Used

  • Python
  • Pandas
  • SQL
  • Beekeeper Studio
  • Docker

Description

In the main.py file, a DataLoader class is defined with reusable code to read a csv file and perform pandas and sqlalchemy methods on that csv. Then functions are defined to create a sqlalchemy engine and create tables to load into a sql database.

Setup/Installation Requirements

  • _Make sure you have a text editor such as Visual Studio Code installed.
  • _Have a running version of Python3.7
  • _Install Beekeeper Studio and Docker
  • Clone this repository (https://github.com/jarretjeter/Spotify-Data-Loader.git) onto your local computer from github
  • Run the start_db.sh script to create a Docker container connected to the MariaDB database
  • In Beekeeper create a new connection with type MariaDB, Connection Mode: Host and Port, Host: localhost, Port: 3306, user: root, password: mysql and connect
  • Click the dropdown menu to the left and then click the spotify database
  • Now you're ready to run the code in the main.py file to and see the results in beekeeper

Known Bugs

  • "Unnamed:0" column is still loaded to database even though the column is not defined in the sqlalchemy table schema

License

If you have any questions, please email me at jarretjeter@gmail.com

MIT

Copyright (c) 5/17/2022 Jarret Jeter

About

A data pipeline using classes to load to a SQL database

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published