A simple Spark-powered ETL framework that just works 🍺
-
Updated
Oct 2, 2025 - Scala
A simple Spark-powered ETL framework that just works 🍺
Extensible streaming ingestion pipeline on top of Apache Spark
Pipeline Pattern implementation in Scala
Various data stream/batch process demo with Apache Scala Spark 🚀
Capabilities of StanfordNLP and OpenNLP on Spark
A real-time text classification based on Kafka and Spark.
🍦 Serve doddle-model in a pipeline implemented with Apache Beam
Spark ML Dashboard built to plug-in and tweak the model params to real-time verify classification results on sample test data
Experimental stream processing pipeline with anomaly detection.
Recommendation system for the MovieLens database using Apache Spark
A simple pipeline to transform data within Azure Data Factory using Azure Databricks. Although it is written in Scala the same can be replicated in Python.
GameTuner BigQuery Loader is application that loads enriched event to BigQuery
Add a description, image, and links to the pipeline topic page so that developers can more easily learn about it.
To associate your repository with the pipeline topic, visit your repo's landing page and select "manage topics."