star-schema

Here are 27 public repositories matching this topic...

jkoth / Data-Lake-with-Spark-and-AWS-S3

Create Data Lake on AWS S3 to store dimensional tables after processing data using Spark on AWS EMR cluster

apache-spark aws-s3 aws-emr pyspark data-engineering data-lake json-format udacity-nanodegree spark-dataframes dimensional-model star-schema etl-pipeline

Updated Oct 10, 2019
Python

minhky2185 / incident_data_warehouse

Star

Build a data warehouse from scratch, including full load, daily incremental load, design schema, SCD Type 1 and 2.

python docker airflow s3 orchestration data-warehouse data-engineering redshift powerbi airflow-plugin staging-area star-schema etl-pipeline etl-automation scd-type-2 airflow-dag bridge-table incident-data-warehouse scd-type-1

Updated Feb 1, 2023
Python

gupta-meghna64 / Informational_Model_to_Star_Schema

Star

This is a flask application that converts an informational model of a decision problem to a snow-flaked star schema

flask data-warehousing star-schema decisional-tool informational-model

Updated May 5, 2019
Python

Faisal-AlDhuwayhi / Data-Warehouse

Star

Building Data Warehouse and ETL pipelines using Amazon S3 and Redshift

aws sql data-warehouse data-engineering cloud-computing amazon-s3-storage data-modeling amazon-redshift star-schema etl-pipeline fact-table dimension-table

Updated Dec 23, 2022
Python

SvetlanaM / data-scripts

Star

Simple scripts for data cleaning, etl transformations and data reorganisations

sql etl numpy dimensions data-transformation snowflake pandas python3 facts data-cleaning star-schema keboola

Updated May 7, 2022
Python

AbhishekGit-hash / Real-Time-Delta-Lake-with-Pyspark

Star

Batch & streaming data pipelines built using Databricks with Pyspark and modeled the data into star schema to analyze in PowerBI, Formula-1 racing data from multiple data sources, APIs.

python pyspark spark-streaming powerbi data-modeling databricks star-schema etl-pipeline

Updated Jan 29, 2024
Python

AbdulRafay365 / GCP-People-Analytics-with-Google-Cloud-SQL-and-Tableau

Star

Transformed raw HR data into a star schema using GCP & Cloud SQL, wrote SQL queries for business reporting and analyzed trends like age vs. income, performance, and hiring by gender. Visualized insights in Tableau for data-driven HR decisions. Tools: Google Cloud SQL(Postgres), GCP, Tableau.

python excel postgresql faker tableau google-cloud-platform cloud-migration google-cloud-sql star-schema google-colab data-synthesis semantic-model sql-reporting chatgpt deepseek

Updated Aug 1, 2025
Python

lkv971 / fabric-ecom-supplychain-analytics

Star

Open-source Supply Chain analytics on Microsoft Fabric: a scalable Bronze-Silver-Gold pipeline with automated CSV ingestion, Delta Lake transforms, semantic modeling (DAX & RLS) and interactive Power BI reports. Join to enhance pipelines, refine models, and build next-gen supply-chain insights!

analytics supply-chain pyspark batch warehouse e-commerce logistics powerbi t-sql data-pipeline star-schema snowflake-schema lakehouse semantic-model microsoft-fabric

Updated Aug 11, 2025
Python

tobazan / olympic-dbt

Star

Model an star schema from raw normalized Olympic Games data using dbt - postgres, airflow and docker

docker airflow postgresql dbt star-schema dbt-postgres dbt-expectations

Updated Jun 24, 2024
Python

najuzilu / DM-ApacheCassandra

Star

Data Modeling with Apache Cassandra

python sql data-engineering apache-cassandra data-modeling star-schema

Updated Jul 31, 2021
Python

oiannace / ETL-pipeline

Star

ETL pipeline that extracts and transforms student athlete academic performance data, then populates a data warehouse using a star schema dimensional model.

python postgresql star-schema etl-pipeline

Updated Apr 1, 2022
Python

ShafakatArnob / Books-ETL-Pipeline-with-Data-Warehousing

Star

ETL Pipeline that Scrapes, Cleans, and Loads Book Data into PostgreSQL, then builds a Star-Schema Data Warehouse for Optimized Analysis.

python etl scraping postgresql data-warehousing extract-transform-load star-schema etl-pipeline books-to-scrape

Updated Mar 8, 2025
Python

INFJakZda / Processing-Massive-Data-Sets

Star

University lab exercises with processing big data.

data-processing massive-datasets star-schema

Updated Nov 19, 2018
Python

AleGuarnieri / Relational-DB-ETL

Star

Udacity project: implementing an ETL process on a PostgreSQL DB to create a star schema data model

etl postgresql star-schema

Updated Jan 1, 2021
Python

AntonisCSt / Aws_ETL_Sparkify

Star

Creating a Data Warehouse using Aws Redshift.

database etl dwh star-schema

Updated Jun 18, 2022
Python

Thelin90 / liberta

Star

All in one slice and dice module

docker-compose postgresql metabase pyspark minio star-schema

Updated Jul 6, 2023
Python

ramesh-datastack / upramesh-dataengineering-projects

Star

A full-stack data engineering portfolio project with ingestion, batch processing, star schema modeling, orchestration, and analytics dashboard.

python ecommerce analytics python3 data-engineering batch-processing portfolio-project star-schema etl-pipeline etl-automation streamlit duckdb reports-dashboard

Updated Aug 7, 2025
Python

melissa-nicholas / airflow-data-pipeline

Star

An Airflow + dbt project that models e-commerce data using DuckDB and a mini star schema.

airflow data-engineering dbt elt star-schema duckdb

Updated Jul 7, 2025
Python

AlexLapsin / Sales-Data-Aggregation-Pipeline

Star

End-to-end sales pipeline: CSV → Parquet → star schema → RDS Postgres. Orchestrated with Airflow; infra via Terraform on AWS

docker aws airflow etl terraform postgresql s3 pandas data-engineering iac parquet rds star-schema

Updated Aug 21, 2025
Python

Naga-Manohar-Y / Food-Delivery-Analysis-in-Real-Time

Star

This project builds a real-time food delivery analytics pipeline using AWS Kinesis, PySpark, Redshift, and QuickSight, with automated deployments via CodeBuild.

aws streaming pyspark data-modeling aws-redshift aws-kinesis-stream star-schema

Updated Mar 29, 2025
Python

Improve this page

Add a description, image, and links to the star-schema topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the star-schema topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

star-schema

Here are 27 public repositories matching this topic...

jkoth / Data-Lake-with-Spark-and-AWS-S3

minhky2185 / incident_data_warehouse

gupta-meghna64 / Informational_Model_to_Star_Schema

Faisal-AlDhuwayhi / Data-Warehouse

SvetlanaM / data-scripts

AbhishekGit-hash / Real-Time-Delta-Lake-with-Pyspark

AbdulRafay365 / GCP-People-Analytics-with-Google-Cloud-SQL-and-Tableau

lkv971 / fabric-ecom-supplychain-analytics

tobazan / olympic-dbt

najuzilu / DM-ApacheCassandra

oiannace / ETL-pipeline

ShafakatArnob / Books-ETL-Pipeline-with-Data-Warehousing

INFJakZda / Processing-Massive-Data-Sets

AleGuarnieri / Relational-DB-ETL

AntonisCSt / Aws_ETL_Sparkify

Thelin90 / liberta

ramesh-datastack / upramesh-dataengineering-projects

melissa-nicholas / airflow-data-pipeline

AlexLapsin / Sales-Data-Aggregation-Pipeline

Naga-Manohar-Y / Food-Delivery-Analysis-in-Real-Time

Improve this page

Add this topic to your repo