Stars
The official home of the Presto distributed SQL query engine for big data
Apache Druid: a high performance real-time analytics database.
Flink CDC is a streaming data integration tool
Apache Pinot - A realtime distributed OLAP datastore
Upserts, Deletes And Incremental Processing on Big Data.
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
An extensible distributed system for reliable nearline data streaming at scale
Simple examle for Spark Streaming over Kafka topic
Code Samples for my Ververica Webinar "99 Ways to Enrich Streaming Data with Apache Flink"
Personal Book Library Web Project
This repository contains a Kafka Connect sink connector for copying data from Apache Kafka into IBM MQ.
Kafka connect Elastic sink connector, with just in time index/delete behaviour.
IoT MQTT sensor stream capture for Apache NiFi
Storm Trident API State Management Example