-
Notifications
You must be signed in to change notification settings - Fork 701
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OpenMPI/4.1.5-GCC-12.3.0 (foss/2023a) cannot communicate across nodes #18914
Comments
Is this built using the patched version in #18833? |
I don't think so, can I check that somehow? It was built last week, but I think it was built using a EDIT: I'll try building from that patch in my private folder, overriding the system package. |
The easyconfig and patches used in the build are in the Checking my build shows I will need to rebuild, as that patch is not included:
|
Thank you very much, @branfosj The patch is not included, so we'll rebuild. I'll report back here, and close the issue if it helps. |
Hi,
We cannot get OpenMPI/4.1.5-GCC-12.3.0 (foss/2023a) to work. It works fine on a single compute node, but across two or more nodes it fails with the error
mpi4py
hello-world script.OMPI_MCA_btl='^openib,ofi'
on the OmniPath nodes.-mca pml_base_verbose 10 -mca mtl_base_verbose 10
does not give any extra info, the crash happens before the usual verbose output is printed.OMPI_MCA_btl
,OMPI_MCA_mtl
andOMPI_MCA_pml
resulted in the same crash regardless of the values, which is consistent with the crash happening before the verbose output from selecting these is printed.CC: @OleHolmNielsen
The text was updated successfully, but these errors were encountered: