Stars
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
A composable and fully extensible C++ execution engine library for data management systems.
The native Rust implementation for Apache Hudi, with C++ & Python API bindings.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Upserts, Deletes And Incremental Processing on Big Data.