This project builds a real-time data pipeline that ingests, processes, and stores data using Apache Kafka, Apache Spark, and MySQL. It simulates streaming data, processes it in real time, and saves the results for analysis. Automated with Apache Airflow, it highlights expertise in data engineering and real-time data processing.
-
Updated
Dec 17, 2024