A collection of AI skills for working with Dagster
-
Updated
Jun 1, 2026 - Python
A collection of AI skills for working with Dagster
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
Build and ship production ML pipelines faster: a pipeline library with an optional self-hosted visual layer for modular, reproducible workflows, local testing, and experiment tracking.
Data-aware orchestration with dagster, dbt, and airbyte
A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran
Get started with Dagster ASAP
A simple pipeline infrastructure with ETL pipeline contained in a Docker environment on Apache Airflow for orchestration and Postgres for data warehousing
Introduction to using and scaling dagster
Develop a real-time data ingestion pipeline using Kafka and Spark. Collect minute-level stock data from Yahoo Finance, ingest it into Kafka, and process it with Spark Streaming, storing the results in Cassandra. Orchestrated the workflow using Airflow deployed on Docker.
EHR pipeline that simulates MIMIC-IV patient data streams, performs advanced feature engineering and clinical severity scoring using machine learning (Random Forest Classifier), and prepares structured outputs for scalable downstream analytics
Code, scripts, and resources for the Data Engineering Fundamentals Course Webinar, covering Python, data pipelines, Apache Airflow, and more.
Build an ELT pipeline with dagster and dbt to schedule loading HDB resale transactions in Singapore into Google BigQuery data warehouse, then create Power BI dashboard to enhance insight exploration.
Data orchestration repo with Docker deployment
Prefect - Data orchestration tool practice & learning
Cloud-agnostic Airflow MLOps sandbox combining parallelized data pipelines with ML engineering tooling (MinIO, MLflow, Qdrant, RAPIDS) for end-to-end experimentation and observability.
A poor-man's data lake fill with ducks
📡 Modern Data Orchestration: Multi-layer ETL pipeline (Raw→Business) using Dagster for cable modem signal analytics.
Trying out Data Engineering
Add a description, image, and links to the data-orchestration topic page so that developers can more easily learn about it.
To associate your repository with the data-orchestration topic, visit your repo's landing page and select "manage topics."