Projects done in the Data Engineer Nanodegree Program by Udacity.com
-
Updated
Dec 8, 2022 - Jupyter Notebook
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Udacity Data Engineering Nanodegree Program
A library for data warehouse and data integration pattern and architecture documentation.
Modeled for longitudinal storage and reporting of P-20W data, the Common Education Data Standards (CEDS) Data Warehouse implements star schema data warehouse normalization techniques for improved query performance.
The Common Education Data Standards (CEDS) Data Warehouse Parquet (DW Parquet) standard is designed for data engineering and data science needs in the cloud. The DW Parquet Models mirror the SQL-based CEDS Data Warehouse. Parquet files are designed for rapid and distributed reporting across multiple technology stacks, data processing and BI tool…
A blockchain enabled OLAP Data Warehouse. It is the implementation / application part of my thesis as an undergraduate Computer Science student of Athens University of Economics and Business
This is a repository to hold the files and notebooks produced throughout my Udacity's Nanodegree Data Engineering program.
Multiwoven Documentation
Personal Projects submitted for the Data Engineer Nanodegree from Udacity.
Students will build an ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team.
Production-ready ELT pipelines using Airflow
Contains all files from my data science projects
Cloud Data Warehouse of Sparkify Data using Redshift
Udacity Data Engineering Nanodegree
project developed during the course Data Warehouses @ Master in Data Science and Engineering, Faculty of Engineering, University of Porto
RAGDWAREuz - Retrieval-Augmented Generation for a DataWarehouse Academic Retrieval Engine
Phân tích dữ liệu tai nạn giao thông tại Chicago: Tiền xử lý và xây dựng kho dữ liệu, phân tích và trực quan hóa dữ liệu bằng OLAP (SSIS, SSAS, Power BI, Looker Studio), đồng thời phát triển mô hình học máy phân loại mức độ nghiêm trọng của tai nạn.
Follow along with materials in the book "Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses and data lakes" (Lipp, 2023)
Data Engineering Lab, powered by TITAN and ReGeneration
🛍️ Modern E-commerce Data Warehouse built with dbt, PostgreSQL & Python. Features dimensional modeling, automated testing, CI/CD pipeline, and comprehensive analytics for customer insights, product performance, and marketing ROI. 📊✨
Add a description, image, and links to the data-warehouses topic page so that developers can more easily learn about it.
To associate your repository with the data-warehouses topic, visit your repo's landing page and select "manage topics."