Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; Kafka for Stream processing
-
Updated
Feb 19, 2025 - Python
Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; Kafka for Stream processing
A simple demo showing how to use Ably and fastAPI to route messages into Kafka for stream processing
Current 2022 Confluent Keynote Demo covering Stream Designer, Stream Catalog, and Stream Sharing.
For recreational use. Just a playground of Kafka+Spark+MQTT+KSQLDB+others
Interactive ksqlDB command line client with autocompletion and syntax highlighting written in Python
Free and simple way to interact with ksqlDB using UI
Pythonic KSQL REST API - Next Gen.
An example of a Kappa architecture solution for transaction fraud detection using Apache Kafka and Python
Kafka Connect and kSQLDB with Oracle
Real-time Coinbase market data streaming pipeline with visualizations. Much appreciation to DataTalks.Club Data Engineering Zoom Camp: https://github.com/DataTalksClub/data-engineering-zoomcamp
An app to keep track of Youtube videos and sends the notification to a Telegram bot to inform you if anyone comments on those
Kubernetes demo
Real time fraud analysis using Kafka Streams
Streaming event pipeline around Apache Kafka and its ecosystem, simulating Real-time Data Streaming
This project demonstrates a modern ETL (Extract, Transform, Load) streaming pipeline using various open-source technologies.
Add a description, image, and links to the ksqldb topic page so that developers can more easily learn about it.
To associate your repository with the ksqldb topic, visit your repo's landing page and select "manage topics."