Restore ability to mark task groups as success/failed in UI #60161

arjav1528 · 2026-01-06T11:56:04Z

Description

This PR adds two new React components that restore the task group marking functionality:

MarkGroupTaskInstanceAsButton - A dropdown button component that allows users to select a state (success/failed) and opens a dialog for confirmation
MarkGroupTaskInstanceAsDialog - A dialog component that provides:
- Options to include past/future/upstream/downstream task instances
- Dry-run preview showing which task instances will be affected
- Ability to add notes to the state change
- Bulk update using the task instance bulk API for efficiency

The button is integrated into the Group Task Instance page header, alongside the existing Clear button.

Screenshots

Solves #60121

Closes: #56103

bohdan-pd · 2026-01-06T12:31:14Z

@arjav1528 Thank you for working on that! 👍

arjav1528 · 2026-01-06T18:56:29Z

@arjav1528 Thank you for working on that! 👍

is the UI ok or do you want me to refactor that

bbovenzi

Thanks for taking this on! Do you mind doing more manually testing to find where MarkAs isn't quite working? I think there are some issues with the API that we need to tackle before we can merge any UI changes.

...low-core/src/airflow/ui/src/components/MarkAs/TaskInstance/MarkGroupTaskInstanceAsDialog.tsx

providers/snowflake/tests/unit/snowflake/hooks/test_snowflake_sql_api.py

bbovenzi

I tried using this locally and still had issues. Could you both manually test this out and also add fastapi test for dry runs?

arjav1528 · 2026-01-07T19:50:01Z

I tried using this locally and still had issues. Could you both manually test this out and also add fastapi test for dry runs?

Sorry about the trouble. I’ve tested this locally and couldn’t reproduce the issue. I have reproduced the issue, solved it, you can run it locally now

…ooks and update request body validation

…o the service layer

…ate update logic and removing listener triggers

…ion for improved clarity and reuse in task instance patching

… ID, including dry run functionality and improved state management

…sk_instances.py Co-authored-by: Jason(Zhe-You) Liu <68415893+jason810496@users.noreply.github.com>

… affected instances and streamline updates for both single task instances and task groups

… instances in patching process

…points, enhancing dry run functionality and validation logic

…sk_instances.py Co-authored-by: Jason(Zhe-You) Liu <68415893+jason810496@users.noreply.github.com>

…llection of unique task instances in patching logic

… based on test group and downgrade SQLAlchemy flag

…tances to ensure correct handling of failed states

…r improved clarity and consistency

…up_id over identifier, improving error handling and validation flow

arjav1528 · 2026-01-23T20:17:47Z

@jason810496 could you please review the PR and merge if good to go,

jason810496

Hi @arjav1528, thank you for your patience and for the update.

I recommend reviewing the earlier comments, as some of the same issues have not yet been fully resolved. Thanks!

jason810496 · 2026-01-24T14:31:16Z

.github/workflows/run-unit-tests.yml

+      # Default is 60 minutes; Min SQLAlchemy providers DB runs can exceed that slightly.
+      TOTAL_TEST_TIMEOUT: "${{ (inputs.test-group == 'providers' && inputs.downgrade-sqlalchemy == 'true') && '3900' || '3600' }}"
    if: inputs.test-group == 'core' || inputs.skip-providers-tests != 'true'


I thinks this is a bad rebase. The change doesn't seem to be related to the PR.

Still exist, needs another rebase.

airflow-core/src/airflow/api_fastapi/core_api/routes/public/task_instances.py

…ce function to simplify code and improve readability

…fier for task group, enhancing clarity in API requests

… for task group, improving API request structure

…bulk update logic to remove dry_run parameter for improved clarity

… for consistency across test groups

arjav1528 · 2026-01-24T19:24:13Z

Thanks for the reviews @jason810496 @pierrejeambrun @bbovenzi! I've addressed all the feedback:

Backend fixes:

✅ Removed the unreachable duplicate return statement in patch_task_instance
✅ Created a separate bulk_task_instances_dry_run endpoint instead of using a dry_run query parameter - this avoids the union return type (BulkResponse | TaskInstanceCollectionResponse) that was causing client-side confusion
✅ Removed the unused _validate_patch_request_body function (dead code)
✅ Reverted the unrelated CI timeout change in run-unit-tests.yml
Frontend fixes: 5. ✅ Fixed usePatchTaskGroup.ts to call the correct endpoint {identifier}?task_group_id=... instead of the non-existent {task_group_id} 6. ✅ Fixed usePatchTaskGroupDryRun.ts to call dry_run?task_group_id=... instead of dry_run

The UI was previously calling endpoints that didn't exist in the backend, which would have caused all task group mark-as operations to fail with 404 errors.

Ready for another review when you have time!

…y and consistency in API requests

guan404ming

There are still lots of function in backend is incompleted. How about let keep only backend changes in this pr to make it more easily to review and implement? Thanks!

guan404ming · 2026-01-30T11:57:27Z

airflow-core/src/airflow/ui/src/queries/useBulkUpdateTaskInstances.test.ts

I think mostly tests mock wiring, not real logic here. Maybe we could just test here by e2e test instead of unit tests. We could handle in a follow up.

guan404ming · 2026-01-30T11:57:35Z

airflow-core/src/airflow/ui/src/queries/useBulkUpdateTaskInstancesDryRun.test.ts

guan404ming · 2026-01-30T11:58:43Z

airflow-core/src/airflow/ui/src/queries/usePatchTaskGroup.ts

+    }
+  };
+
+  // Use direct API call until OpenAPI types are generated


I am not really sure why we need to call it directly here.

guan404ming · 2026-01-30T11:59:26Z

.github/workflows/run-unit-tests.yml

+      # Default is 60 minutes; Min SQLAlchemy providers DB runs can exceed that slightly.
+      TOTAL_TEST_TIMEOUT: "${{ (inputs.test-group == 'providers' && inputs.downgrade-sqlalchemy == 'true') && '3900' || '3600' }}"
    if: inputs.test-group == 'core' || inputs.skip-providers-tests != 'true'


Still exist, needs another rebase.

pierrejeambrun

Overall direction looks good. A few comments/suggestions, thanks for the PR.

(feel free to directly resolve comments you have addressed so we can easily identify where works remains)

pierrejeambrun · 2026-01-30T14:04:30Z

airflow-core/src/airflow/api_fastapi/core_api/routes/public/task_instances.py

+    service = BulkTaskInstanceService(
        session=session, request=request, dag_id=dag_id, dag_run_id=dag_run_id, dag_bag=dag_bag, user=user
-    ).handle_request()
+    )
+    return service.handle_request()


This change doesn't seem related and should be removed.

pierrejeambrun · 2026-01-30T14:04:50Z

airflow-core/src/airflow/api_fastapi/core_api/routes/public/task_instances.py

+@task_instances_router.patch(
+    task_instances_prefix + "/dry_run",
+    dependencies=[Depends(requires_access_dag(method="PUT", access_entity=DagAccessEntity.TASK_INSTANCE))],
+    operation_id="bulk_task_instances_dry_run",
+)
+def bulk_task_instances_dry_run(
+    request: BulkBody[BulkTaskInstanceBody],
+    session: SessionDep,
+    dag_id: str,
+    dag_bag: DagBagDep,
+    dag_run_id: str,
+    user: GetUserDep,
+) -> TaskInstanceCollectionResponse:
+    """Bulk update task instances dry run - returns affected task instances without making changes."""
+    service = BulkTaskInstanceService(
+        session=session, request=request, dag_id=dag_id, dag_run_id=dag_run_id, dag_bag=dag_bag, user=user
+    )
+    return service.handle_request_dry_run()


This change deosn't seem related and should be removed.

pierrejeambrun · 2026-01-30T14:49:39Z

...low-core/src/airflow/ui/src/components/MarkAs/TaskInstance/MarkGroupTaskInstanceAsDialog.tsx

+  const downstream = selectedOptions.includes("downstream");
+
+  // eslint-disable-next-line unicorn/no-null -- DAGRunResponse["note"] type requires null, not undefined
+  const [note, setNote] = useState<string | null>(null);


Initialise with "" and replace by null at the backend call so you don't need that eslint-disable-next-line unicorn/no-null.

Check other calls for [note, setNote] in the code base we never have to ignore this.

pierrejeambrun · 2026-01-30T14:55:25Z