Skip to content

[Feature] DbtCompileOperator + allow partial_parse.msgpack to be read from S3/GCS/Azure Blob Storage. #870

Open
@dwreeves

Description

Our CICD uses a different config than what Cosmos does, and the difference in the profile is close to unavoidable, which makes it so partial parsing doesn't actually work:

Unable to do partial parsing because config vars, config profile, or config target have changed
Unable to do partial parsing because profile has changed
Unable to do partial parsing because env vars used in profiles.yml have changed

I think the only way you can reasonably get this to work is to have a DbtCompileOperator, and then read the partial_parse.msgpack from S3.

The output directory should be templated, ideally, so that you can use ti.xcoms_pull() or otherwise avoid any "clashing" across multiple simultaneous DAG runs. (Then at the end of each DAG run the user can do e.g. S3DeleteObjectsOperator() to clean things up at their leisure.)

Metadata

Assignees

No one assigned

    Labels

    area:parsingRelated to parsing DAG/DBT improvement, issues, or fixesdbt:compilePrimarily related to dbt compile command or functionalitydo-not-staleRelated to stale job and dosubotexecution:localRelated to Local execution environmentparsing:customRelated to custom parsing, like custom DAG parsing, custom DBT parsing, etctriage-neededItems need to be reviewed / assigned to milestone

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions