Skip to content

Simple dockerized DAG of scraping date, and creating/inserting into tables in postgres via Airflow orchestration.

Notifications You must be signed in to change notification settings

nssensalo/airflow-postgres-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gradiant line

Purpose

To orchestrate a DAG on Airflow as well as dockerizing airflow, postgres, and pgadmin. The data is scraped from AWS and is simply a table of current Data Engineering books. The DAG is scheduled to run once a day. This project will be expanded upon with more complexity in DAG tasks, including modeling with dbt, creation of aggregated tables and marts, and SCD 2 type upserting.

Run

Install Docker

run: docker compose up

view airflow: localhost:8080

view pgadmin:localhost:5050

username:admin

password:admn

View DAG diagram:

gradiant line

Query table in pgAdmin:

gradiant line

Watch for run status:

gradiant line

About

Simple dockerized DAG of scraping date, and creating/inserting into tables in postgres via Airflow orchestration.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published