Stars
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Apache Spark - A unified analytics engine for large-scale data processing
Generate and Visualize Data Lineage from query history
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
A standard infrastructure environment for Kubernetes
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin