- San Jose, CA
-
spark Public
Forked from apache/sparkMirror of Apache Spark
-
iceberg Public
Forked from apache/icebergApache Iceberg
-
arrow-datafusion-comet Public
Forked from apache/datafusion-cometApache Arrow DataFusion Comet Spark Accelerator
Rust Apache License 2.0 UpdatedSep 23, 2024 -
arrow-datafusion Public
Forked from apache/datafusionApache Arrow DataFusion SQL Query Engine
Rust Apache License 2.0 UpdatedApr 8, 2024 -
arrow-rs Public
Forked from apache/arrow-rsOfficial Rust implementation of Apache Arrow
Rust Apache License 2.0 UpdatedAug 5, 2023 -
trino Public
Forked from trinodb/trinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Java Apache License 2.0 UpdatedDec 6, 2022 -
parquet-mr Public
Forked from apache/parquet-javaApache Parquet
Java Apache License 2.0 UpdatedJun 18, 2022 -
spark-website Public
Forked from apache/spark-websiteApache Spark Website
Apache License 2.0 UpdatedJan 28, 2022 -
hudi Public
Forked from apache/hudiUpserts, Deletes And Incremental Processing on Big Data.
Java Apache License 2.0 UpdatedNov 10, 2021 -
hyperspace Public
Forked from microsoft/hyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Scala Apache License 2.0 UpdatedJul 23, 2021 -
jvm-profiler Public
Forked from uber-common/jvm-profilerJVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter
Java Other UpdatedJun 4, 2021 -
orc Public
Forked from apache/orcApache ORC - the smallest, fastest columnar storage for Hadoop workloads
HTML Apache License 2.0 UpdatedMay 26, 2021 -
parquet-format Public
Forked from apache/parquet-formatApache Parquet
Java Apache License 2.0 UpdatedApr 22, 2021 -
delta Public
Forked from delta-io/deltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Scala Apache License 2.0 UpdatedApr 15, 2021 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
C++ Other UpdatedDec 26, 2019 -
scikit-learn Public
Forked from scikit-learn/scikit-learnscikit-learn: machine learning in Python
Python Other UpdatedDec 26, 2019 -
presto Public
Forked from prestodb/prestoDistributed SQL query engine for big data
Java Apache License 2.0 UpdatedApr 18, 2018 -
spark-examples Public
Forked from dportabella/spark-examplesofficial spark examples adapted for sbt
Scala Apache License 2.0 UpdatedJul 18, 2016 -
spark-redshift Public
Forked from databricks/spark-redshiftSpark and Redshift integration
Scala Apache License 2.0 UpdatedSep 23, 2015