Aggregate ingestion errors over reingestion

## Problem

If a DAG has an expected ingestion error, we can use `skip_ingestion_errors` to cause it to skip over them during ingestion and report them _once in aggregate_ at the end of ingestion.

There is no equivalent configuration for skipping errors during **reingestion**. Very often when there is an error in a reingestion workflow, it occurs for multiple ingestion days, and a separate Slack notification is received for _each one_. This can have the effect of flooding the Slack channel and obscuring other alerts. 

We should instead aggregate errors that occur over the ingestion days and report them in a single Slack notification at the end of reingestion.

## Description

A preliminary idea: we could pass some context in to the ProviderDataIngester so that it knows when it is running as part of a reingestion workflow. Then, if some Airflow configuration variable is set, we report errors via XCOM instead of sending the Slack notification. In `report_load_completion`, we aggregate and report.


## Implementation

- [ ] 🙋 I would be interested in implementing this feature.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aggregate ingestion errors over reingestion #1379

stacimc
openedon Oct 25, 2022

Problem

Description

Implementation

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Aggregate ingestion errors over reingestion #1379

Description

stacimcopenedon Oct 25, 2022

Problem

Description

Implementation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

stacimc
openedon Oct 25, 2022