- Los Angeles, CA
-
arrow-rs Public
Forked from apache/arrow-rsOfficial Rust implementation of Apache Arrow
Rust Apache License 2.0 UpdatedMar 14, 2023 -
docker-stacks Public
Forked from jupyter/docker-stacksReady-to-run Docker images containing Jupyter applications
Python Other UpdatedMar 10, 2023 -
nessie Public
Forked from projectnessie/nessieNessie provides Git-like capabilities for your Data Lake
Java Apache License 2.0 UpdatedMar 6, 2023 -
iceberg Public template
Forked from apache/icebergApache Iceberg (Incubating)
Java Apache License 2.0 UpdatedMar 5, 2023 -
orc Public
Forked from apache/orcApache ORC - the smallest, fastest columnar storage for Hadoop workloads
HTML Apache License 2.0 UpdatedJan 16, 2023 -
avro Public
Forked from apache/avroApache Avro is a data serialization system.
Java Apache License 2.0 UpdatedJan 15, 2023 -
docker-spark-iceberg Public
Forked from databricks/docker-spark-iceberg -
delta Public
Forked from delta-io/deltaAn open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Scala Apache License 2.0 UpdatedSep 14, 2022 -
parquet-mr Public
Forked from apache/parquet-javaApache Parquet
Java Apache License 2.0 UpdatedSep 13, 2022 -
-
scalingpythonml Public
Forked from scalingpythonml/scalingpythonmlScaling Python Machine Learning
Jupyter Notebook Apache License 2.0 UpdatedAug 22, 2022 -
k8s-device-plugin Public
Forked from NVIDIA/k8s-device-pluginNVIDIA device plugin for Kubernetes
Go Apache License 2.0 UpdatedAug 22, 2022 -
message-backend-ray Public
Forked from PigsCanFlyLabs/message-backend-rayA Ray port of the message backend
Python Apache License 2.0 UpdatedAug 22, 2022 -
arctic Public
Forked from apache/amoroArctic is a streaming lake warehouse service open sourced by NetEase
Java Apache License 2.0 UpdatedAug 16, 2022 -
-
-
graviton2-workshop Public
Forked from aws-samples/graviton-workshopPython MIT No Attribution UpdatedAug 3, 2022 -
trino Public
Forked from trinodb/trinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Java Apache License 2.0 UpdatedAug 1, 2022 -
mermaid Public
Forked from mermaid-js/mermaidGeneration of diagram and flowchart from text in a similar manner as markdown
JavaScript MIT License UpdatedJul 21, 2022 -
ngods-stocks Public
Forked from zsvoboda/ngods-stocksNew Generation Opensource Data Stack Demo
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedJul 14, 2022 -
jnr-ffi Public
Forked from jnr/jnr-ffiJava Abstracted Foreign Function Layer
Java Other UpdatedJul 1, 2022 -
superset Public
Forked from apache/supersetApache Superset is a Data Visualization and Data Exploration Platform
TypeScript Apache License 2.0 UpdatedJun 24, 2022 -
-
spark-cassandra-connector Public
Forked from datastax/spark-cassandra-connectorDataStax Spark Cassandra Connector
Scala Apache License 2.0 UpdatedJun 22, 2022 -
ozone Public
Forked from apache/ozoneScalable, redundant, and distributed object store for Apache Hadoop
Java Apache License 2.0 UpdatedJun 17, 2022 -
dbt-spark Public
Forked from dbt-labs/dbt-sparkdbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks
Python Apache License 2.0 UpdatedJun 16, 2022 -
python-zstandard Public
Forked from indygreg/python-zstandardPython bindings to the Zstandard (zstd) compression library
C BSD 3-Clause "New" or "Revised" License UpdatedJun 13, 2022 -
querybook Public
Forked from pinterest/querybookQuerybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
TypeScript Apache License 2.0 UpdatedMay 30, 2022 -
presto Public
Forked from prestodb/prestoThe official home of the Presto distributed SQL query engine for big data
Java Apache License 2.0 UpdatedMay 29, 2022 -
academy Public
Forked from anyscale/academyRay tutorials from Anyscale
Jupyter Notebook Apache License 2.0 UpdatedMay 29, 2022