-
incubator-celeborn Public
Forked from apache/celebornApache Celeborn is an elastic and high-performance service for shuffle and spilled data.
-
spark Public
Forked from apache/sparkMirror of Apache Spark
-
celeborn-website Public
Forked from apache/celeborn-websiteApache Celeborn Site
Shell Apache License 2.0 UpdatedApr 29, 2024 -
incubator-uniffle Public
Forked from apache/uniffleUniffle is a high performance, general purpose Remote Shuffle Service.
Java Apache License 2.0 UpdatedOct 30, 2023 -
-
iceberg Public
Forked from apache/icebergApache Iceberg
Java Apache License 2.0 UpdatedSep 22, 2023 -
ec2-selector-cli Public
the cli tool to select ec2 instances based on filters
Rust Apache License 2.0 UpdatedApr 23, 2023 -
velox-intel Public
Forked from oap-project/veloxA new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
C++ Apache License 2.0 UpdatedFeb 24, 2023 -
frameless Public
Forked from typelevel/framelessExpressive types for Spark.
Scala Apache License 2.0 UpdatedFeb 21, 2023 -
-
gazelle_plugin Public
Forked from oap-project/gazelle_pluginNative SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Scala Apache License 2.0 UpdatedAug 21, 2022 -
incubator-sedona Public
Forked from apache/sedonaA cluster computing framework for processing large-scale geospatial data
Java Apache License 2.0 UpdatedAug 10, 2022 -
terraform-aws-eks-node-group Public
Forked from cloudposse/terraform-aws-eks-node-groupTerraform module to provision a fully managed AWS EKS Node Group
HCL Apache License 2.0 UpdatedMar 2, 2022 -
arrow-datafusion Public
Forked from apache/datafusionApache Arrow DataFusion and Ballista query engines
Rust Apache License 2.0 UpdatedAug 20, 2021 -
delta Public
Forked from delta-io/deltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Scala Apache License 2.0 UpdatedApr 19, 2021 -
how-query-engines-work Public
Forked from andygrove/how-query-engines-workThis is the companion repository for the book How Query Engines Work.
Kotlin Apache License 2.0 UpdatedApr 10, 2021 -
spark-sql-macros Public
Forked from hbutani/spark-sql-macrosSpark SQL Macros provides a mechanism similar to Spark User-Defined function registration; with the key enhancement being that custom code gets compiled to equivalent Catalyst Expressions at macro …
Scala Apache License 2.0 UpdatedMar 17, 2021 -
spark-lineage Public
Forked from thesquelched/spark-lineageSpark SQL listener to record lineage information
Scala Apache License 2.0 UpdatedJan 24, 2021 -
xgboost Public
Forked from dmlc/xgboostLarge-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.
C++ Apache License 2.0 UpdatedJan 4, 2021 -
cockroachdb_playground Public
some programs to play around cockroachdb
Python Apache License 2.0 UpdatedJan 3, 2021 -
cockroachdb-todo-apps Public
Forked from cockroachdb/cockroachdb-todo-appsCockroachDB To-Do Apps
Python Apache License 2.0 UpdatedNov 1, 2020 -
noisepage Public
Forked from cmu-db/noisepageSelf-Driving Database Management System from Carnegie Mellon University
C++ MIT License UpdatedOct 22, 2020 -
rabit Public
Forked from dmlc/rabitReliable Allreduce and Broadcast Interface for distributed machine learning
C++ BSD 3-Clause "New" or "Revised" License UpdatedDec 21, 2019 -
xgboost4j-spark-scalability Public
a benchmark to test scalability of xgboost4j-spark and relevant projects
-
morpheus Public
Forked from opencypher/morpheusMorpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Scala Apache License 2.0 UpdatedMay 9, 2019 -
-
BigDL Public
Forked from intel/ipex-llmBigDL: Distributed Deep Learning Library for Apache Spark
Scala Apache License 2.0 UpdatedApr 26, 2019 -
analytics-zoo Public
Forked from intel/BigDLDistributed Tensorflow, Keras and BigDL on Apache Spark
Jupyter Notebook Apache License 2.0 UpdatedApr 25, 2019 -
github-markdown-toc Public
Forked from ekalinin/github-markdown-tocEasy TOC creation for GitHub README.md
Shell MIT License UpdatedApr 19, 2019 -
dmlc-core Public
Forked from dmlc/dmlc-coreA common bricks library for building scalable and portable distributed machine learning.
C++ Other UpdatedMar 24, 2019