A proof of concept extract-transform-load pipeline using publicly available data about Canadian Business Licenses in Toronto, Edmonton and Calgary. The data was downloaded in CSV format from open data portals on municipal websites, and then read into Pandas dataframes to apply transformations. Once the transformations were complete, the data was imported into a PostreSQL database.
- Programming languages: Python
- Relevant libraries: Pandas, Numpy
- Environment: Jupyter Notebook