Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
-
Updated
Jul 3, 2025 - Python
Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
Dataset Batch(offline) Reinforcement Learning for recommender system
📈 A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transformation, storage, monitoring, and AI/ML serving with CI/CD automation using Terraform & GitHub Actions.
Jupyter Integration for Flink SQL via Ververica Platform
Apache Flink (Pyflink) and Related Projects
Feature demos, integration guides & hands-on labs/projects using Kpow, Flex, Kafka, Flink, Iceberg & more
This repository contains the ingredients for the Digital Twin Concept of Industry Fusion.
🚀 Traffic Sentinel: A scalable IoT system using Fog nodes and Apache Flink to process 📷 IP camera streams, powered by YOLO for intelligent 🚗 traffic monitoring on highways. 🛣️
Discover Flink clusters on Hadoop YARN for Prometheus
Snapshot manager for Amazon Kinesis Data Analytics for Apache Flink helps the users to generate a snapshot on a periodic basis.
An end-to-end application for abstractive document summarization on top of TensorFlow, Flink-AI-Extended and Flink ML pipeline framework.
Apache Paimon Python The Python implementation of Apache Paimon.
Build, deploy, and orchestrate event-driven agents natively on Apache Flink® and Apache Kafka®
Scalable real-time fraud detection platform built on modern Data Lakehouse architecture
Add a description, image, and links to the flink topic page so that developers can more easily learn about it.
To associate your repository with the flink topic, visit your repo's landing page and select "manage topics."