Skip to content

btl/ofi: fixes for multi mpi init/fini scenarios #13020

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Jan 8, 2025

Conversation

hppritcha
Copy link
Member

@hppritcha hppritcha commented Jan 6, 2025

The OFI BTL was caching a context -in the case of normal OFI EPs - across multiple sessions init/finalize, resulting in segfaults.

Also, there were some symbols being exported which should not have been.

Related to #13019

The OFI BTL was caching a context -in the case of normal OFI EPs - which needed across multiple
sessions init/finalize and resulting in segfaults.

Also, there were some symbols being exported which should not have been.

Related to open-mpi#13019

Signed-off-by: Howard Pritchard <hppritcha@gmail.com>
Turns out that when Open MPI is configured with --enable-mca-dso
and is using the OFI MTL/BTL/common, a problem is brought out
with the patcher framework the second time through closing the
bml and hence btl frameworks.

See issue open-mpi#13021.

This patch fixes this problem.

Signed-off-by: Howard Pritchard <hppritcha@gmail.com>
@hppritcha hppritcha merged commit 3907c2c into open-mpi:main Jan 8, 2025
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants