Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add patch to fix Open MPI 4.1.5 with PMIx >= 4.2.3 #18833

Merged
merged 1 commit into from
Sep 20, 2023

Conversation

bartoldeman
Copy link
Contributor

I needed this to use any MPI application with srun with pmix or with plain mpirun, or else I would run into OOB/TCP communication errors for even "mpirun hostname" across two nodes (see open-mpi/ompi#11729
for another user with the same problem). This patch is taken from open-mpi/ompi#11472 and with it, both srun and mpirun run flawlessly without further ado.

I needed this to use any MPI application with srun with pmix or
with plain mpirun, or else I would run into OOB/TCP communication
errors for even "mpirun hostname" across two nodes (see
open-mpi/ompi#11729
for another user with the same problem). This patch is taken from
open-mpi/ompi#11472 and with it,
both srun and mpirun run flawlessly without further ado.
@boegel
Copy link
Member

boegel commented Sep 19, 2023

@boegelbot please test @ generoso

@boegelbot
Copy link
Collaborator

@boegel: Request for testing this PR well received on login1

PR test command 'EB_PR=18833 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs /opt/software/slurm/bin/sbatch --job-name test_PR_18833 --ntasks=4 ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 11744

Test results coming soon (I hope)...

- notification for comment with ID 1725636581 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
cns2 - Linux Rocky Linux 8.5, x86_64, Intel(R) Xeon(R) CPU E5-2667 v3 @ 3.20GHz (haswell), Python 3.6.8
See https://gist.github.com/boegelbot/a1eb4c484ff6246416c6d269f9ef6f52 for a full test report.

@SebastianAchilles
Copy link
Member

@boegelbot please test @ jsc-zen2

@boegelbot
Copy link
Collaborator

@SebastianAchilles: Request for testing this PR well received on jsczen2l1.int.jsc-zen2.easybuild-test.cluster

PR test command 'EB_PR=18833 EB_ARGS= EB_REPO=easybuild-easyconfigs /opt/software/slurm/bin/sbatch --mem-per-cpu=4000M --job-name test_PR_18833 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen2.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 3392

Test results coming soon (I hope)...

- notification for comment with ID 1726158115 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
jsczen2g1.int.jsc-zen2.easybuild-test.cluster - Linux Rocky Linux 8.5, x86_64, AMD EPYC 7742 64-Core Processor (zen2), Python 3.6.8
See https://gist.github.com/boegelbot/4e3654c3fc8feff12e01d85cd679de25 for a full test report.

@boegel
Copy link
Member

boegel commented Sep 19, 2023

Test report by @boegel
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
node3141.skitty.os - Linux RHEL 8.6, x86_64, Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz (skylake_avx512), Python 3.6.8
See https://gist.github.com/boegel/f1b417bf9ea85f5d27b4934f8abf55a8 for a full test report.

@boegel boegel changed the title Add patch fixing Open MPI 4.1.5 with PMIx >= 4.2.3 add patch to fix Open MPI 4.1.5 with PMIx >= 4.2.3 Sep 20, 2023
@boegel
Copy link
Member

boegel commented Sep 20, 2023

Going in, thanks @bartoldeman!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants