Stars
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
ClickHouse® is a real-time analytics DBMS
Uniffle is a high performance, general purpose Remote Shuffle Service.
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers
Fork of tagtraum industries' GCViewer. Tagtraum stopped development in 2008, I aim to improve support for Sun's / Oracle's java 1.6+ garbage collector logs (including G1 collector)
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
A UI dashboard that allows CRUD operations on Zookeeper.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Tomcat clustering redis session manager java client.