Closed
Description
In some IM discussions with @rhc54, we discovered that opal_show_help()
message aggregation is no longer working (on both master and v2.x). This is a regression compared to v1.10.x.
There's two reasons it isn't working:
a) In orte/util/orte_show_help.c
, the following checks are done to see if we should render the help message locally or relay it up to the HNP:
if (ORTE_PROC_IS_HNP || ORTE_PROC_IS_TOOL ||
orte_standalone_operation ||
NULL == orte_rml.send_buffer_nb ||
NULL == orte_routed.get_route ||
NULL == orte_process_info.my_hnp_uri) {
// do local render
} else {
// relay to HNP
}
orte_rml.send_buffer_nb
and orte_routed.get_route
will now always be NULL in MPI procs.
b) Even if we delete these 2 checks (in two places, BTW), orte_show_help tries to use the RML to relay the help message up to the HNP. But there's no RML.
@rhc54 says it will take a bit of work to fix this.