Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Existing Airflow DAG and dbt model cleanup for every data source #264

Open
jrlegrand opened this issue Mar 7, 2024 · 0 comments
Open

Existing Airflow DAG and dbt model cleanup for every data source #264

jrlegrand opened this issue Mar 7, 2024 · 0 comments

Comments

@jrlegrand
Copy link
Member

Problem Statement

We need a wholesale review of our current Airflow DAGs and dbt setup for each data source from beginning to end.

Criteria for Success

Every checkbox in the DAGs and Work section of our Roadmap is checked and complete.

Airflow Standardized

  • Abstraction
    • Use airflow operator to create dag and tasks
    • Common tasks used to centralize logic
  • Loaded
    • Sources loaded to data lake

Dbt Buildout

  • Transformations in staging models
  • Combinations in intermediate models
  • Final table in prod schema mart model

Data Documented

  • Source data documented
  • Data source reference linked
  • Any additional tables documented

Additional Information

Roadmap: https://docs.google.com/document/d/1LTWPbgTdGrlRq2qi5KAyj68rtyGJfYRB_luUIN_1WFo/edit#heading=h.p5fi4rrf2kkp

Go down to the DAGs and Work section at the bottom.

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Todo
Development

No branches or pull requests

1 participant