Apache Flink (Pyflink) and Related Projects
-
Updated
Dec 3, 2025 - Python
Apache Flink (Pyflink) and Related Projects
Self-contained demo using PyFlink with Gensim+spaCy to find topics in the Flink User Mailing List. All you need is Docker! 🐳
Primary Recommender System: online[matching|ranking...](Flask|Vue) - nearline[model serving|real-time service](Flink|tensorflow serving|redis) - offline[feature engine|model training](Spark|Hdfs(Hbase)|tf)
Sahibinden.com Data Engineering Technical Case Study
A Makeshift data infrastructure setup for datafirstjobs.com.
Engaging, interactive visualizations crafted with Streamlit, seamlessly powered by Apache Flink in batch mode to reveal deep insights from data.
Python Practices 🐙 offers step-by-step Python exercises, real-world examples, and best coding standards from fundamentals to SOLID OOP for practical, scalable learning and growth.
This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.
Implementation of the PLStream locally on macOS.
Modern Flink 2.x + Kafka 4.x + Iceberg lakehouse demos with SQL, PyFlink, and Java.
PyFlink data stream processing utilities 🐿
a complete streaming data pipeline that ingests simulated IoT sensor data, processes it and displays in real time through a simple website
Big Data Stack
Add a description, image, and links to the pyflink topic page so that developers can more easily learn about it.
To associate your repository with the pyflink topic, visit your repo's landing page and select "manage topics."