- Albany, NY
- caseynbrown.com
Pinned Loading
-
airflow
airflow PublicForked from apache/airflow
Airflow is a system to programmatically author, schedule and monitor data pipelines.
Python
-
ethereum-etl
ethereum-etl PublicForked from blockchain-etl/ethereum-etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in…
Python
-
-
-
delta-io/kafka-delta-ingest
delta-io/kafka-delta-ingest PublicA highly efficient daemon for streaming data from Kafka into Delta Lake
-
Spark Parquet
Spark Parquet 1import java.io.Serializable;
23import org.apache.spark.api.java.JavaRDD;
4import org.apache.spark.api.java.function.Function;
5import org.apache.spark.sql.Dataset;
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.