Description
While testing malak with XIOS in server mode, I encounter an error towards the end of the application that appear to be related to the VarObs
and CX
Writer. The error is not very helpful so I tested without the writer filters
and the application runs. I suspect this is to do with the communicator group because when running in server mode, the jedi application does not use the MPI_COMM_WORLD
comm, instead it uses a split communicator where some PEs are used by the server.
I did a search for MPI_COMM_WORLD
and found a few references, the following in particular:
/// \note This filter must only be used with ObsSpaces using the \c MPI_COMM_WORLD communicator,
/// otherwise a deadlock will occur while writing the VarObs file. This is due to a limitation of
/// the \c Ops_WriteVarobs function, which could be removed by replacing \c mpl_comm_world in the
/// call to \c Ops_Mpl_Gatherv by \c mpi_group (for consistency with all other MPI calls in \c
/// Ops_WriteVarobs).
suggest that there would be some additional work that is required if we want XIOS server
+ varWriter
. I don't imagine that this is too much work. Adding a few people that may know:
@wsmigaj @DavidSimonin @ctgh @mikecooke77 @adammaycock @DJDavies2
if others have an ideas then please comment below.
Thanks, Steve.