Open
Description
Hi, I'm seeing an initialization failure when combining coll/han
with coll/ucc
.
Environment:
Open MPI v5.0.x (#9605bed)
UCC master (#1d808c9)
Execution:
$(which mpirun) --host A:2,B:2 --mca coll libnbc,basic,ucc,han --mca pml ucx osu_allreduce
Leads to:
Error: coll_ucc_module.c:352 - mca_coll_ucc_module_enable() mca_coll_ucc_save_coll_handlers failed
I believe that the issue might stem from coll/ucc assuming that there will be a prior value/implementation present in comm->c_coll->coll_<primitive>
, and errors out if one is not found.
I had also stumbled upon this question at some point and had more-or-less concluded that there are cases where a previous value won't be present -- Is this accurate? Or does the design say that there should always exist a previous value (eg. coll/base
's?)?