Separate FAB migration from Core Airflow migration #41437

ephraimbuddy · 2024-08-13T16:46:00Z

This PR separates FAB migration from Airflow Core migration and provides a way for apps to integrate into Airflow and run their migrations.

potiuk

NICE

vincbeck

🤯 Fantastic! I wish I could have done it while working on AIP-56! Thank you!

ephraimbuddy · 2024-08-15T16:20:18Z

Not sure why this test is failing but doesn't fail locally: https://github.com/apache/airflow/actions/runs/10406470435/job/28819881902?pr=41437#step:7:5512

ephraimbuddy · 2024-08-15T16:21:44Z

Not sure why this test is failing but doesn't fail locally: https://github.com/apache/airflow/actions/runs/10406470435/job/28819881902?pr=41437#step:7:5512

Mostly tests related to logging:
https://github.com/apache/airflow/actions/runs/10406470435/job/28819876941?pr=41437#step:7:5339

potiuk · 2024-08-16T15:23:32Z

Likely side-effect of other tests that have been somewhat masked or avoided before the change - where some setup/teardown removed the side-effect.

You can reproduce the set of tests run with breeze testing db-tests --test-type CLI locally (for example - for CLI tests) - that should run the tests in the same sequence as they are run in CI in each of the parallel runs and then they shoudl reproducibly fail as well. Also it could be caused by new version of dependencies - (look at generate constraints output of your build) but it's rather unlikely - https://github.com/apache/airflow/actions/runs/10420774982/ canary build just got green and updated constraints without any test failures, so it's rather unlikely (however you can always rebase and see if it will fail in the same way).

Generally those kind of side-effects are best investigated by a bit guessing and bi-secting - and trying to run a smaller-and-smaller subset of tests until you find the one that is the culprit. At least that's what I did in the past.

You should start by looking at the pytest command that was run in the original test type - unfold the red failing test type and you will see:

  Starting the tests with those pytest arguments: tests/cli --verbosity=0 --strict-markers --durations=100 --maxfail=50 --color=yes --junitxml=/files/test_result-cli-postgres.xml --timeouts-order moi --setup-timeout=60 --execution-timeout=60 --teardown-timeout=60 --disable-warnings -rfEX --run-db-tests-only --ignore=tests/system --ignore=tests/integration --warning-output-path=/files/warnings-cli-postgres.txt --ignore=helm_tests --with-db-init --no-cov
  
  ============================= test session starts ==============================
  platform linux -- Python 3.8.19, pytest-8.3.2, pluggy-1.5.0
  rootdir: /opt/airflow
  configfile: pyproject.toml
  plugins: icdiff-0.9, timeouts-1.2.1, instafail-0.5.0, custom-exit-code-0.3.0, rerunfailures-14.0, asyncio-0.23.8, time-machine-2.15.0, anyio-4.4.0, requests-mock-1.12.1, cov-5.0.0, mock-3.14.0, xdist-3.6.1
  asyncio: mode=strict
  setup timeout: 60.0s, execution timeout: 60.0s, teardown timeout: 60.0s
  collected 406 items
  
  tests/cli/commands/test_celery_command.py ..........                     [  2%]
  tests/cli/commands/test_cheat_sheet_command.py s                         [  2%]
  tests/cli/commands/test_config_command.py ssssssssssssssssss             [  7%]
  tests/cli/commands/test_connection_command.py .......................... [ 13%]
  .....................                                                    [ 18%]
  tests/cli/commands/test_dag_command.py ................................. [ 26%]
  ....................                                                     [ 31%]
  tests/cli/commands/test_dag_processor_command.py .                       [ 32%]
  tests/cli/commands/test_db_command.py .................................. [ 40%]
  .....................................                                    [ 49%]
  tests/cli/commands/test_info_command.py sssssssss..s                     [ 52%]
  tests/cli/commands/test_internal_api_command.py ssss...                  [ 54%]
  tests/cli/commands/test_jobs_command.py ......                           [ 55%]
  tests/cli/commands/test_kerberos_command.py ....                         [ 56%]
  tests/cli/commands/test_kubernetes_command.py ..........                 [ 59%]
  tests/cli/commands/test_legacy_commands.py sss                           [ 59%]
  tests/cli/commands/test_plugins_command.py ...                           [ 60%]
  tests/cli/commands/test_pool_command.py ...........                      [ 63%]
  tests/cli/commands/test_rotate_fernet_key_command.py ..                  [ 63%]
  tests/cli/commands/test_scheduler_command.py ...................         [ 68%]
  tests/cli/commands/test_standalone_command.py ssssssssssssss             [ 71%]
  tests/cli/commands/test_task_command.py ................................ [ 79%]
  .F............                                                           [ 83%]
  tests/cli/commands/test_triggerer_command.py ..                          [ 83%]
  tests/cli/commands/test_variable_command.py ...........                  [ 86%]
  tests/cli/commands/test_version_command.py s                             [ 86%]
  tests/cli/commands/test_webserver_command.py sssssssssss....             [ 90%]
  tests/cli/test_cli_parser.py ..................................s....     [100%]

If your tests succeeds when run separately, but fails when run as tests/cli - then side-effect is almost certain root cause. And you can attempt guess which one is producing the side-effect and run only that test and the one that's failing to confirm your guess. Or attempt to bisect it:

In this case you might convert the single:

pytest --run-db-tests-only tests/cli (that should fail locally for you as well)

into (looking at the output):

pytest --run-db-tests-only tests/cli/commands/test_celery_command.py tests/cli/commands/test_cheat_sheet_command.py ... tests/cli/commands/test_task_command.py

Then you can remove half of the modules from the list and run it again (then you will see whether side-effect comes from the removed half or the remaining half). And continue that path - even down to a single test that causes the side effect. Then usually fixing it is trivial by adding missing setup/teardown or changing the test so that it patches and restores any state.

It's slow and tedious, yes, but this is the way I've been successfully using in the past to trace root causes of similar issues, and have no other idea how to do it differently faster.

ephraimbuddy · 2024-08-18T13:27:53Z

Likely side-effect of other tests that have been somewhat masked or avoided before the change - where some setup/teardown removed the side-effect.

You can reproduce the set of tests run with breeze testing db-tests --test-type CLI locally (for example - for CLI tests) - that should run the tests in the same sequence as they are run in CI in each of the parallel runs and then they shoudl reproducibly fail as well. Also it could be caused by new version of dependencies - (look at generate constraints output of your build) but it's rather unlikely - https://github.com/apache/airflow/actions/runs/10420774982/ canary build just got green and updated constraints without any test failures, so it's rather unlikely (however you can always rebase and see if it will fail in the same way).

Generally those kind of side-effects are best investigated by a bit guessing and bi-secting - and trying to run a smaller-and-smaller subset of tests until you find the one that is the culprit. At least that's what I did in the past.

You should start by looking at the pytest command that was run in the original test type - unfold the red failing test type and you will see:
  Starting the tests with those pytest arguments: tests/cli --verbosity=0 --strict-markers --durations=100 --maxfail=50 --color=yes --junitxml=/files/test_result-cli-postgres.xml --timeouts-order moi --setup-timeout=60 --execution-timeout=60 --teardown-timeout=60 --disable-warnings -rfEX --run-db-tests-only --ignore=tests/system --ignore=tests/integration --warning-output-path=/files/warnings-cli-postgres.txt --ignore=helm_tests --with-db-init --no-cov

  

  ============================= test session starts ==============================

  platform linux -- Python 3.8.19, pytest-8.3.2, pluggy-1.5.0

  rootdir: /opt/airflow

  configfile: pyproject.toml

  plugins: icdiff-0.9, timeouts-1.2.1, instafail-0.5.0, custom-exit-code-0.3.0, rerunfailures-14.0, asyncio-0.23.8, time-machine-2.15.0, anyio-4.4.0, requests-mock-1.12.1, cov-5.0.0, mock-3.14.0, xdist-3.6.1

  asyncio: mode=strict

  setup timeout: 60.0s, execution timeout: 60.0s, teardown timeout: 60.0s

  collected 406 items

  

  tests/cli/commands/test_celery_command.py ..........                     [  2%]

  tests/cli/commands/test_cheat_sheet_command.py s                         [  2%]

  tests/cli/commands/test_config_command.py ssssssssssssssssss             [  7%]

  tests/cli/commands/test_connection_command.py .......................... [ 13%]

  .....................                                                    [ 18%]

  tests/cli/commands/test_dag_command.py ................................. [ 26%]

  ....................                                                     [ 31%]

  tests/cli/commands/test_dag_processor_command.py .                       [ 32%]

  tests/cli/commands/test_db_command.py .................................. [ 40%]

  .....................................                                    [ 49%]

  tests/cli/commands/test_info_command.py sssssssss..s                     [ 52%]

  tests/cli/commands/test_internal_api_command.py ssss...                  [ 54%]

  tests/cli/commands/test_jobs_command.py ......                           [ 55%]

  tests/cli/commands/test_kerberos_command.py ....                         [ 56%]

  tests/cli/commands/test_kubernetes_command.py ..........                 [ 59%]

  tests/cli/commands/test_legacy_commands.py sss                           [ 59%]

  tests/cli/commands/test_plugins_command.py ...                           [ 60%]

  tests/cli/commands/test_pool_command.py ...........                      [ 63%]

  tests/cli/commands/test_rotate_fernet_key_command.py ..                  [ 63%]

  tests/cli/commands/test_scheduler_command.py ...................         [ 68%]

  tests/cli/commands/test_standalone_command.py ssssssssssssss             [ 71%]

  tests/cli/commands/test_task_command.py ................................ [ 79%]

  .F............                                                           [ 83%]

  tests/cli/commands/test_triggerer_command.py ..                          [ 83%]

  tests/cli/commands/test_variable_command.py ...........                  [ 86%]

  tests/cli/commands/test_version_command.py s                             [ 86%]

  tests/cli/commands/test_webserver_command.py sssssssssss....             [ 90%]

  tests/cli/test_cli_parser.py ..................................s....     [100%]
If your tests succeeds when run separately, but fails when run as tests/cli - then side-effect is almost certain root cause. And you can attempt guess which one is producing the side-effect and run only that test and the one that's failing to confirm your guess. Or attempt to bisect it:

In this case you might convert the single:

pytest --run-db-tests-only tests/cli (that should fail locally for you as well)

into (looking at the output):

pytest --run-db-tests-only tests/cli/commands/test_celery_command.py tests/cli/commands/test_cheat_sheet_command.py ... tests/cli/commands/test_task_command.py

Then you can remove half of the modules from the list and run it again (then you will see whether side-effect comes from the removed half or the remaining half). And continue that path - even down to a single test that causes the side effect. Then usually fixing it is trivial by adding missing setup/teardown or changing the test so that it patches and restores any state.

It's slow and tedious, yes, but this is the way I've been successfully using in the past to trace root causes of similar issues, and have no other idea how to do it differently faster.

Thanks @potiuk for always helping. I'll look into these and fix them 🙏

airflow/providers/fab/__init__.py

eladkal · 2024-08-20T08:42:09Z

wait.. what does this actually mean for users who is bumping providers only?
We have doc that explain upgrade procedure https://airflow.apache.org/docs/apache-airflow/stable/installation/upgrading.html#upgrading-airflow-to-a-newer-version

If now, migrations can also run from bumping provider only that changes the upgrade procedure users should take.

ephraimbuddy · 2024-08-20T11:03:52Z

wait.. what does this actually mean for users who is bumping providers only? We have doc that explain upgrade procedure https://airflow.apache.org/docs/apache-airflow/stable/installation/upgrading.html#upgrading-airflow-to-a-newer-version

If now, migrations can also run from bumping provider only that changes the upgrade procedure users should take.

I will describe the upgrade procedure in my next PR when I add the upgrade command for FAB provider. It should be smooth

eladkal · 2024-08-20T11:22:00Z

I will describe the upgrade procedure in my next PR when I add the upgrade command for FAB provider. It should be smooth

I am worried here.
We may dismiss the impact here too lightly.
Did I miss mailing thread on this topic?

Here we are introducing something really new - bumping provider version which also runs DB migration.. that is not small change to how Airflow operatre.

boring-cyborg bot added area:providers provider:fab labels Aug 13, 2024

ephraimbuddy force-pushed the separate-fab-db-from-airflow branch from 7df27d4 to 82cd523 Compare August 13, 2024 17:10

potiuk reviewed Aug 14, 2024

View reviewed changes

potiuk requested a review from vincbeck August 14, 2024 00:09

vincbeck approved these changes Aug 14, 2024

View reviewed changes

ephraimbuddy force-pushed the separate-fab-db-from-airflow branch 4 times, most recently from 2ab293d to b54d98c Compare August 15, 2024 09:10

ephraimbuddy marked this pull request as ready for review August 15, 2024 11:01

ephraimbuddy requested review from XD-DENG, ashb and kaxil as code owners August 15, 2024 11:01

ephraimbuddy requested a review from jedcunningham August 15, 2024 13:02

ephraimbuddy force-pushed the separate-fab-db-from-airflow branch from c9210c6 to 504c078 Compare August 15, 2024 15:04

ephraimbuddy force-pushed the separate-fab-db-from-airflow branch 4 times, most recently from 1ebb982 to 2c47ab4 Compare August 19, 2024 15:45

potiuk reviewed Aug 19, 2024

View reviewed changes

airflow/providers/fab/__init__.py Outdated Show resolved Hide resolved

ephraimbuddy force-pushed the separate-fab-db-from-airflow branch 2 times, most recently from a5527b0 to 708b15e Compare August 20, 2024 08:21

ephraimbuddy added 26 commits August 25, 2024 18:52

fix autogenerate and update function signatures

37fe581

Rename alembic directory to migrations

58e86a2

add more tests

d917212

Fix import

85a38c2

Fix static check

481413f

Fix tests

3cd8577

fixup! Fix tests

554b11f

fixup! fixup! Fix tests

018db91

Mark extra operator links test as non-db test

17457ed

update erd diagram

b7fe860

import airflow settings at point of use

ea816cd

skip import error for alembic env.py

df10d28

Remove type on typing

0eef772

Move code around

376c24d

fixup! Move code around

79cf6db

Move import

c898bad

fixup! Move import

4222344

Don't disable existing loggers

2dcf266

Use abstract base and update tests

d70eccb

Revert compat change

b19cfc4

Fix backcompat for fab provider

250c59e

Build fab provider for www test instead of installing from pypi

70b1e2d

fixup! Fix backcompat for fab provider

48d8e10

Mark downgradedb method as abstract

08411de

Ensure supports_table_dropping works and improve tests

497cd3c

Skip tag check when preparing fab provider in CI

167ecb6

ephraimbuddy force-pushed the separate-fab-db-from-airflow branch from 24facde to 167ecb6 Compare August 25, 2024 17:53

ephraimbuddy merged commit 59dc981 into apache:main Aug 25, 2024

ephraimbuddy deleted the separate-fab-db-from-airflow branch August 25, 2024 19:26

eladkal mentioned this pull request Sep 21, 2024

Status of testing Providers that were prepared on September 21, 2024 #42393

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate FAB migration from Core Airflow migration #41437

Separate FAB migration from Core Airflow migration #41437

Uh oh!

ephraimbuddy commented Aug 13, 2024 •

edited

Loading

Uh oh!

potiuk left a comment

Uh oh!

vincbeck left a comment

Uh oh!

ephraimbuddy commented Aug 15, 2024

Uh oh!

ephraimbuddy commented Aug 15, 2024

Uh oh!

potiuk commented Aug 16, 2024

Uh oh!

ephraimbuddy commented Aug 18, 2024

Uh oh!

Uh oh!

eladkal commented Aug 20, 2024

Uh oh!

ephraimbuddy commented Aug 20, 2024

Uh oh!

eladkal commented Aug 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Separate FAB migration from Core Airflow migration #41437

Separate FAB migration from Core Airflow migration #41437

Uh oh!

Conversation

ephraimbuddy commented Aug 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

potiuk left a comment

Choose a reason for hiding this comment

Uh oh!

vincbeck left a comment

Choose a reason for hiding this comment

Uh oh!

ephraimbuddy commented Aug 15, 2024

Uh oh!

ephraimbuddy commented Aug 15, 2024

Uh oh!

potiuk commented Aug 16, 2024

Uh oh!

ephraimbuddy commented Aug 18, 2024

Uh oh!

Uh oh!

eladkal commented Aug 20, 2024

Uh oh!

ephraimbuddy commented Aug 20, 2024

Uh oh!

eladkal commented Aug 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ephraimbuddy commented Aug 13, 2024 •

edited

Loading