Skip to content

UCC initialization fails at mca_coll_ucc_save_coll_handlers(), when under HAN #9885

Open
@gkatev

Description

@gkatev

Hi, I'm seeing an initialization failure when combining coll/han with coll/ucc.

Environment:

Open MPI v5.0.x (#9605bed)
UCC master (#1d808c9)

Execution:

$(which mpirun) --host A:2,B:2 --mca coll libnbc,basic,ucc,han --mca pml ucx osu_allreduce

Leads to:

Error: coll_ucc_module.c:352 - mca_coll_ucc_module_enable() mca_coll_ucc_save_coll_handlers failed

I believe that the issue might stem from coll/ucc assuming that there will be a prior value/implementation present in comm->c_coll->coll_<primitive>, and errors out if one is not found.

I had also stumbled upon this question at some point and had more-or-less concluded that there are cases where a previous value won't be present -- Is this accurate? Or does the design say that there should always exist a previous value (eg. coll/base's?)?

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions