Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
-
Updated
Nov 21, 2023 - Python
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
implementing an end-to-end tweets ETL/Analysis pipeline.
Efficient YouTube data harvesting and warehousing with Python, SQL, MongoDB, and Streamlit, enabling seamless analysis and visualization for insightful decision-making in content management and audience engagement strategies
Data warehousing date dimension and time dimension builders written in Python.
This project demonstrates the creation of a Data Warehouse using SQL Server 2022. It includes the design of dimension and fact tables, ETL processes for data integration, Python scripts for synthetic data generation, and SQL queries for KPI analysis to support business decision-making.
Using dbt to load(seed) and do some transformations and then finally load that data to some Cloud Warehouse
Batch ETL pipeline project on GCP to load and transform daily flight data using Spark to update tables in BigQuery. The pipeline is automated using Airflow.
This project goal is to design a Data Platform for retail Data Analytics.
Banking Data Warehouse Pipeline
Date dimension based on Iranian calendar
Data Modeling with Postgres and Python
How to manage SCD2 with Apache Hive 1.1 and HBase 1.2 w/o HiveQL UPDATE operation
Check daily covid information
A comprehensive LLM data processing system designed to transform raw multi-format data into high-quality training datasets optimized for Large Language Models.
⚡ Automatically produce a data model on your database using its information schema using GenAI.
End-to-end data warehousing project integrating APIs, ETL workflows, and PostgreSQL for analytics and reporting.
End-to-end EV charging demand analytics pipeline using Python, SQL Server, AWS S3, Excel, and Power BI. Analyzes session-level charging data to identify peak hours, city-wise demand, daily trends, and station utilization.
Implementation ETL with Python for data integration workflows.
Add a description, image, and links to the datawarehousing topic page so that developers can more easily learn about it.
To associate your repository with the datawarehousing topic, visit your repo's landing page and select "manage topics."