mpi4py: run the spawn and dynamic process tests #12421

jsquyres · 2024-03-21T01:23:10Z

They're currently disabled by default upstream in mpi4py because Open MPI is failing these tests. Explicitly enable them here in Open MPI's github action for mpi4py so that we have to fix them.

See mpi4py/mpi4py#479 for some details.

FYI @bosilca @dalcinl @hppritcha

Once this merges, we'll bring this over to v5.0.x.

dalcinl · 2024-03-21T08:01:24Z

Once this merges, we'll bring this over to v5.0.x.

@jsquyres Are you sure? Isn't better to leave the release branch as it now, such that any mpi4py failures would signal an new, unknown regression in the release branch?

jsquyres · 2024-03-21T11:53:55Z

I thought about this overnight. I changed up this PR a bit, and split the Github Action into 4 parts:

build: do everything to build, configure, and install Open MPI and mpi4py
run: run all the mpi4py tests with its defaults. As of March 2024, this disables the spawn and dynamic tests, which means that the entire block of tests should pass.
run_spawn: run all the mpi4py tests, including the spawn tests. As of March 2024, we know some of these tests are failing.
run_dynamic: run all the mpi4py tests, including the dynamic tests. As of March 2024, we know some of these tests are failing.

Keeping the spawn and dynamics tests separate are useful because those are different failures. So let's run them separately.

This gives us a clear green checkmark if all the mpi4py tests -- excluding spawn and dynamic tests -- pass. Then it separately runs the spawn and dynamic tests, which we currently expect to fail.

Looks like I don't quite have the syntax correct yet for this in the github action -- let me go fool with this over in my fork and not spam everyone here with a bunch of force pushes until I get this right.

dalcinl · 2024-03-21T12:32:06Z

Looks like I don't quite have the syntax correct yet for this in the github action

Feel free to ping me about any doubts you have about GHA, I've been using the thing for quite a while.

jsquyres · 2024-03-21T20:00:57Z

@bosilca @hppritcha I revamped the github action quite a bit since you approved it. Please re-review.

Here's what the output looks like now in the CI on the PR:

And here's what it looks like if you click through to the github action result itself:

.github/workflows/ompi_mpi4py.yaml

jsquyres · 2024-03-26T16:13:57Z

General consensus on the call today:

Do not merge this PR as-is; there is (legit) fear of normalizing CI failures.
Instead, make the running of the spawn/dynamic tests contingent upon the presence of a github label or special token in a the PR body or comment or something.

That way, the tests are still there and able to be run, but a developer has to do something to run them.

.github/workflows/ompi_mpi4py.yaml

Split the mpi4py Github Action into 4 parts: 1. build: do everything to build, configure, and install Open MPI and mpi4py 2. run: run all the mpi4py tests with its defaults. As of March 2024, this disables the spawn and dynamic tests, which means that the entire block of tests should pass. 3. run_spawn: run all the mpi4py tests, including the spawn tests, but only if the "mpi4py" label is set on the PR. As of March 2024, we know some of these tests are failing. 4. run_dynamic: run all the mpi4py tests, including the dynamic tests, but only if the "mpi4py" label is set on the PR. As of March 2024, we know some of these tests are failing. The spawn and dynamic failures are different, so we split them up and run them separately. For steps 2, 3, and 4, we utilize a reusable Github workflow so that we don't have to duplicate the code. Signed-off-by: Jeff Squyres <jeff@squyres.com>

jsquyres · 2024-03-31T00:17:16Z

Ok, I think I got it now. Here's what a run looks like if the mpi4py-all label is not on the PR:

And here's what a run looks like if the mpi4py-all label is on the PR:

lrbison · 2024-04-02T15:02:29Z

.github/workflows/ompi_mpi4py.yaml

+      # This parameter is required, so send a meaningless
+      # environment variable name that will not affect the tests at
+      # all (i.e., the tests will be run with default values).
+      env_name: MAKE_TODAY_AN_OMPI_DAY


jsquyres requested review from bosilca and hppritcha March 21, 2024 01:23

github-actions bot added the Target: main label Mar 21, 2024

hppritcha approved these changes Mar 21, 2024

View reviewed changes

bosilca approved these changes Mar 21, 2024

View reviewed changes

jsquyres force-pushed the pr/mpi4py-run-the-spawn-tests branch 3 times, most recently from 58710e4 to 0fbbebd Compare March 21, 2024 11:43

jsquyres marked this pull request as draft March 21, 2024 11:54

jsquyres force-pushed the pr/mpi4py-run-the-spawn-tests branch 2 times, most recently from aa2129d to 5d05c49 Compare March 21, 2024 19:37

jsquyres marked this pull request as ready for review March 21, 2024 19:53

jsquyres requested review from hppritcha and bosilca March 21, 2024 19:53

bosilca approved these changes Mar 22, 2024

View reviewed changes

.github/workflows/ompi_mpi4py.yaml Outdated Show resolved Hide resolved

dalcinl reviewed Mar 23, 2024

View reviewed changes

.github/workflows/ompi_mpi4py.yaml Outdated Show resolved Hide resolved

jsquyres force-pushed the pr/mpi4py-run-the-spawn-tests branch from 5d05c49 to e96f5f7 Compare March 23, 2024 11:30

wenduwan approved these changes Mar 25, 2024

View reviewed changes

hppritcha approved these changes Mar 26, 2024

View reviewed changes

jsquyres marked this pull request as draft March 26, 2024 15:29

dalcinl reviewed Mar 27, 2024

View reviewed changes

.github/workflows/ompi_mpi4py.yaml Show resolved Hide resolved

dalcinl reviewed Mar 27, 2024

View reviewed changes

.github/workflows/ompi_mpi4py.yaml Show resolved Hide resolved

dalcinl reviewed Mar 27, 2024

View reviewed changes

.github/workflows/ompi_mpi4py.yaml Outdated Show resolved Hide resolved

dalcinl reviewed Mar 27, 2024

View reviewed changes

.github/workflows/ompi_mpi4py.yaml Outdated Show resolved Hide resolved

jsquyres force-pushed the pr/mpi4py-run-the-spawn-tests branch from e96f5f7 to 635a663 Compare March 29, 2024 10:37

jsquyres added the mpi4py-all Run the optional mpi4py CI tests label Mar 29, 2024

jsquyres force-pushed the pr/mpi4py-run-the-spawn-tests branch from 635a663 to 027009a Compare March 30, 2024 23:32

jsquyres removed the mpi4py-all Run the optional mpi4py CI tests label Mar 30, 2024

jsquyres force-pushed the pr/mpi4py-run-the-spawn-tests branch from 027009a to e255ffc Compare March 30, 2024 23:45

jsquyres added the mpi4py-all Run the optional mpi4py CI tests label Mar 30, 2024

jsquyres force-pushed the pr/mpi4py-run-the-spawn-tests branch from e255ffc to deb4bda Compare March 30, 2024 23:54

jsquyres marked this pull request as ready for review March 31, 2024 00:17

lrbison approved these changes Apr 2, 2024

View reviewed changes

dalcinl approved these changes Apr 2, 2024

View reviewed changes

bosilca approved these changes Apr 2, 2024

View reviewed changes

jsquyres merged commit bebfc41 into open-mpi:main Apr 2, 2024

jsquyres deleted the pr/mpi4py-run-the-spawn-tests branch April 2, 2024 19:08

jsquyres mentioned this pull request Apr 2, 2024

v5.0.x: mpi4py: run the spawn and dynamic process tests #12450

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mpi4py: run the spawn and dynamic process tests #12421

mpi4py: run the spawn and dynamic process tests #12421

Uh oh!

jsquyres commented Mar 21, 2024

Uh oh!

dalcinl commented Mar 21, 2024

Uh oh!

jsquyres commented Mar 21, 2024 •

edited

Loading

Uh oh!

dalcinl commented Mar 21, 2024

Uh oh!

jsquyres commented Mar 21, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

jsquyres commented Mar 26, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsquyres commented Mar 31, 2024

Uh oh!

lrbison Apr 2, 2024

Uh oh!

Uh oh!

mpi4py: run the spawn and dynamic process tests #12421

mpi4py: run the spawn and dynamic process tests #12421

Uh oh!

Conversation

jsquyres commented Mar 21, 2024

Uh oh!

dalcinl commented Mar 21, 2024

Uh oh!

jsquyres commented Mar 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dalcinl commented Mar 21, 2024

Uh oh!

jsquyres commented Mar 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsquyres commented Mar 26, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsquyres commented Mar 31, 2024

Uh oh!

lrbison Apr 2, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jsquyres commented Mar 21, 2024 •

edited

Loading

jsquyres commented Mar 21, 2024 •

edited

Loading