Skip to content

Commit f3a5b37

Browse files
authored
Merge pull request #8628 from jjhursey/v4.0-fix-orterun-man
v4.0.x: Fix/Cleanup the return value documentation for mpirun
2 parents 240e167 + 7885d08 commit f3a5b37

File tree

1 file changed

+6
-18
lines changed

1 file changed

+6
-18
lines changed

orte/tools/orterun/orterun.1in

Lines changed: 6 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -1787,24 +1787,12 @@ exit status of the primary job. Any non-zero exit status in secondary jobs will
17871787
reported solely in a summary print statement.
17881788
.
17891789
.PP
1790-
By default, OMPI records and notes that MPI processes exited with non-zero termination status.
1791-
This is generally not considered an "abnormal termination" - i.e., OMPI will not abort an MPI
1792-
job if one or more processes return a non-zero status. Instead, the default behavior simply
1793-
reports the number of processes terminating with non-zero status upon completion of the job.
1794-
.PP
1795-
However, in some cases it can be desirable to have the job abort when any process terminates
1796-
with non-zero status. For example, a non-MPI job might detect a bad result from a calculation
1797-
and want to abort, but doesn't want to generate a core file. Or an MPI job might continue past
1798-
a call to MPI_Finalize, but indicate that all processes should abort due to some post-MPI result.
1799-
.PP
1800-
It is not anticipated that this situation will occur frequently. However, in the interest of
1801-
serving the broader community, OMPI now has a means for allowing users to direct that jobs be
1802-
aborted upon any process exiting with non-zero status. Setting the MCA parameter
1803-
"orte_abort_on_non_zero_status" to 1 will cause OMPI to abort all processes once any process
1804-
exits with non-zero status.
1805-
.PP
1806-
Terminations caused in this manner will be reported on the console as an "abnormal termination",
1807-
with the first process to so exit identified along with its exit status.
1790+
By default, the job will abort when any process terminates with non-zero
1791+
status. The MCA parameter "orte_abort_on_non_zero_status" can be set to
1792+
"false" (or "0") to cause OMPI to not abort a job if one or more
1793+
processes return a non-zero status. In that situation the OMPI records
1794+
and notes that processes exited with non-zero termination status to
1795+
report the approprate exit status of \fImpirun\fP (per bullet points above).
18081796
.PP
18091797
.
18101798
.\" **************************

0 commit comments

Comments
 (0)