Skip to content

sudhxan/Azure-DataEngineering-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Tokyo Olympics Azure Data Engineering Project

Overview

This project showcases my experience with various Azure services for data engineering. I have utilized Azure Data Factory, Azure Data Lake Storage, Azure App Services, Azure Databricks, and Azure Synapse Analytics to build an end-to-end data pipeline and perform analytics on the data.

Technologies Used

  • Azure Data Factory: Used for orchestrating and automating data movement and transformation. I have used the HTTP request method to extract data and stored it in Azure Data Lake Storage.

  • Azure Data Lake Storage: Used as a scalable data lake for storing raw and processed data.

  • Azure App Services: Hosted the web application for data visualization and interaction.

  • Azure Databricks: Utilized for data transformation and running machine learning models.

  • Azure Synapse Analytics: Used for running SQL queries and performing analytics on the data.

    Data Factory Workings:

Screenshot (669)

Synapse Analytics Workings:

Screenshot (684)

Project Structure

  • ETL Pipeline: Created using Azure Data Factory to extract data, transform it using Databricks, and load it into Data Lake Storage.
  • Web Application: Hosted on Azure App Services, providing interactive dashboards for data visualization.
  • Analytics: Leveraged Azure Synapse Analytics for running SQL queries and generating insights from the data.

Key Learnings

This project has deepened my understanding of Azure services for data engineering, especially in building scalable and efficient data pipelines, performing data transformations at scale, and leveraging analytics for data-driven insights.


About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors