Always provide a relevant TI context in Dag callback by Asquator · Pull Request #61274 · apache/airflow

Asquator · 2026-01-31T00:33:13Z

This PR strives to make sense out of TI contexts passed to Dag callbacks. Rather than always passing the last task lexicogrpahically, the new logic passes the last "relevant" task to Dag's failure.

Behavior outline:

On regular failure, pass the lastest finished task out of the ones that could contribute to failure.
On timeout, pass the task that started last out of the unfinished tasks.
On "tasks deadlocked", pass the latest finished task.

closes DAG on_failure_callback uses wrong context #26760

Was generative AI tooling used to co-author this PR?

Yes (please specify the tool below)
deepwiki.com was used to find the relevant logic in the repo, and an embedded chat was used to make a skeleton for TestDagRunGetFirstTiCausingFailure.

Read the Pull Request Guidelines for more information. Note: commit author/co-author name and email in commits become permanently public when merged.
For fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
When adding dependency, check compliance with the ASF 3rd Party License Policy.
For significant user-facing changes create newsfragment: {pr_number}.significant.rst or {issue_number}.significant.rst, in airflow-core/newsfragments.

Asquator · 2026-01-31T01:14:35Z

airflow-core/src/airflow/jobs/scheduler_job_runner.py

                    .where(TI.dag_id == dag_run.dag_id)
                    .where(TI.run_id == dag_run.run_id)
-                    .where(TI.state.in_(State.unfinished))
+                    .where(TI.state.in_(State.unfinished) | (TI.state.is_(None)))


I thought that tasks with null state should also be always marked as skipped (the in_ clause won't fetch them because of SQL semantics).

None is in unfinished states, so there is no need to change anything here

From what I tried, the in clause doesn't catch the None value. It's in accordance with SQL semantics for NULL values. See also: https://stackoverflow.com/questions/6362112/in-clause-with-null-or-is-null

Asquator · 2026-02-04T20:28:07Z

I want to get some feedback on the behavior, whether it's the desired one or if anything can be made better.

I will update the docs to explain the behavior of passed TIs to dag callbacks.

Nataneljpwd

Looks good, I have left a few comments on the changes, overall, great find!

Nataneljpwd · 2026-02-05T07:20:48Z

airflow-core/src/airflow/jobs/scheduler_job_runner.py

-                    .where(TI.state.in_(State.unfinished))
+                    .where(TI.state.in_(State.unfinished) | (TI.state.is_(None)))
+                ).all()
+                last_unfinished_ti = (


What if it was a mapped task? Or what if it was concurrently running tasks, how do you decide?

As the end date is not the only factor, the start date may be a better option here, or maybe even return a few tasks if they were running concurrently, and check by the dependencies of the task rather than solely rely on end date

Well, start_date is exactly the field I'm using here. On timeout, we assume there are unfinished tasks, which means they have no end_date yet.

Nataneljpwd · 2026-02-05T07:23:25Z