Skip to content
View eric9204's full-sized avatar

Block or report eric9204

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 11,872 3,333 Updated Sep 16, 2025

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 2,996 1,226 Updated Sep 16, 2025

The Internals of Spark SQL

475 136 Updated Sep 12, 2025

A composable and fully extensible C++ execution engine library for data management systems.

C++ 3,889 1,421 Updated Sep 16, 2025

The native Rust implementation for Apache Hudi, with C++ & Python API bindings.

Rust 251 51 Updated Sep 16, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 44,271 6,374 Updated Sep 16, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 149,854 30,413 Updated Sep 16, 2025

Apache Flink

Java 25,263 13,781 Updated Sep 15, 2025

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,935 2,439 Updated Sep 16, 2025