etl-pipeline

Here are 20 public repositories matching this topic...

redis-field-engineering / redis-connect-dist

Real-Time Event Streaming & Change Data Capture

redis csv replication etl event-sourcing connect cdc etl-framework etl-pipeline event-streaming etl-automation rediscdc redisconnect

Updated Apr 16, 2025
Shell

jjasghar / COBOL-on-k8s

Star

Running an ETL pipeline with COBOL on Kubernetes

kubernetes yaml s3-bucket cobol etl-pipeline

Updated Oct 1, 2020
Shell

Stefen-Taime / Kafka-pipeline

Star

In the following post, we will learn how to build a data pipeline using a combination of open-source software (OSS), including Debezium, Apache Kafka, Kafka Connect.

mysql bash docker elasticsearch data kibana kafka mongodb pipeline postgresql k kafka-topic kafka-connect kafka-streams masking pii etl-pipeline ksqldb

Updated Oct 24, 2022
Shell

OtmaneDaoudi / finnhub-data-streaming-pipline

Star

Finnhub data streaming pipeline for real-time Bitcoin trades analysis.

cloud data-engineering spark-streaming data-pipeline etl-pipeline

Updated Aug 11, 2024
Shell

ozone-his / ozone-analytics

Star

Ozone Analytics provides modular data pipelines for streaming, flattening, storage, and visualization—powered by Flink, Kafka, PostgreSQL, Drill, MinIO, and Superset.

big-data analytics flink etl-pipeline

Updated Apr 29, 2025
Shell

nathyBekele / custom-data-warehouse

Star

A scalable data warehouse solution designed for AI-driven traffic analytics using vehicle trajectory data from swarm UAVs. Built with Airflow for orchestration, dbt for data transformation, PostgreSQL for storage, and Redash for visualization.

airflow postgresql redash dbt etl-pipeline data-warehouse-architecture

Updated Dec 28, 2023
Shell

Adematics / ETL-pipeline-with-shell-scripting.

Star

Extract, Transform, and Load (ETL) processes are used when flexibility, speed, and scalability of data are crucial in an organization.

shell-script postgresql-database etl-pipeline

Updated Jul 11, 2022
Shell

mehassanhmood / DataEngineering-Project

Star

A data engineering project.

mongodb datawarehousing tableau etl-pipeline cognos-dashboard

Updated Dec 22, 2023
Shell

dougdss89 / wideworldadventure

Star

This repository includes all files that compose the design and unification of the databases AdventureWorks and WideWorldAdventure project.

spark etl bigdata dbt elt datawarehouse databricks datalake etl-pipeline deltalake duckdb

Updated Nov 30, 2024
Shell

Aly-Habib / Weather-Forecast-vs-Actuals-Bash-Cron

Star

Weather ETL pipeline entirely in Bash script and cron job (no manual work)

bash data etl cronjob bash-script dataengineering etl-pipeline

Updated May 10, 2025
Shell

praisegee / sql-data-warehouse

Star

Building data warehouse from scratch using PostgreSQL as primary DBMS

sql postgresql data-warehouse data-engineering data-analysis etl-pipeline

Updated Oct 14, 2025
Shell

kr900910 / hospital_data_analysis

Star

ETL process which loads and transforms Medicare hospital data using Python and Hive

python data-analysis hiveql etl-pipeline hospital-compare-datasets

Updated May 31, 2018
Shell

Farahat612 / ETL-Shell-Script

Star

A simple shell script to perform a simple ETL pipeline using Linux environment bash scripting.

shell etl data-engineering shell-scripting etl-pipeline

Updated Sep 28, 2023
Shell

AbderrahmaneOd / ecommerce-elt-pipeline

Star

The goal of this project is to build an ELT Pipeline that processes transaction data from a UK-based online retailer using Spark, Hive, Airflow and Hadoop.

airflow spark hive hadoop hdfs etl-pipeline

Updated Dec 18, 2024
Shell

simon-bronnikov / ETL-Airflow-Hive-Spark-Postres-Docker-

Star

Этот проект реализует процесс извлечения, трансформации и загрузки (ETL).

postgres airflow hive etl-pipeline

Updated Jun 13, 2025
Shell

Joshua-omolewa / Toronto_Climate_API_ETL_project

Star

Built an ETL Pipeline that extract Climate data from API and transform the data by combining all data extracted from API into a single file which is then loaded into an output folder

python api shell-script etl-pipeline

Updated Apr 5, 2023
Shell

aohus / etl-pipeline-with-modeling

Star

Docker를 사용하여 Hadoop 생태계의 구성 요소와 기타 필수 서비스를 컨테이너화하여 강력한 데이터 엔지니어링 환경을 설정하는 방법을 보여줍니다. 설정에는 Hadoop (HDFS, YARN), Apache Hive, PostgreSQL 및 Apache Airflow가 포함되며, 이들 모두가 원활하게 작동하도록 구성되어 있습니다.

hadoop data-modeling datawarehouse airflow-docker etl-pipeline datavault

Updated May 29, 2024
Shell

Farahat612 / Automatic-Backup-Script

Star

A shell script ensures that recent updates to sensitive files are regularly backed up, enhancing data security and reducing manual effort.

python data-engineering shell-scripting etl-pipeline

Updated Sep 24, 2023
Shell

priyanshubiswas-tech / data-streaming-kafka-flink-postgres

Star

A robust real-time data streaming pipeline using Apache Kafka for event ingestion, Apache Flink for real-time processing, and PostgreSQL for storage and analytics. Designed for low-latency insights and scalable data workflows.

python real-time apache-flink kafka-producer postgresql-database real etl-pipeline

Updated Nov 25, 2023
Shell

andresvalle / LLM-ETL-pipeline

Star

Using large language models and AWS Bedrock to orchestrate an ETL pipeline

aws etl etl-pipeline large-language-models aws-bedrock

Updated Nov 18, 2024
Shell

Improve this page

Add a description, image, and links to the etl-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl-pipeline

Here are 20 public repositories matching this topic...

redis-field-engineering / redis-connect-dist

jjasghar / COBOL-on-k8s

Stefen-Taime / Kafka-pipeline

OtmaneDaoudi / finnhub-data-streaming-pipline

ozone-his / ozone-analytics

nathyBekele / custom-data-warehouse

Adematics / ETL-pipeline-with-shell-scripting.

mehassanhmood / DataEngineering-Project

dougdss89 / wideworldadventure

Aly-Habib / Weather-Forecast-vs-Actuals-Bash-Cron

praisegee / sql-data-warehouse

kr900910 / hospital_data_analysis

Farahat612 / ETL-Shell-Script

AbderrahmaneOd / ecommerce-elt-pipeline

simon-bronnikov / ETL-Airflow-Hive-Spark-Postres-Docker-

Joshua-omolewa / Toronto_Climate_API_ETL_project

aohus / etl-pipeline-with-modeling

Farahat612 / Automatic-Backup-Script

priyanshubiswas-tech / data-streaming-kafka-flink-postgres

andresvalle / LLM-ETL-pipeline

Improve this page

Add this topic to your repo