Employ consistent error handling policy and make fatal error conditions fail fast

Triggered by a discussion in the Alice WP3 mailing list about error handling, I propose to employ the following error handling "policy" (more in the sense of a guideline and less in the sense of a strict ruleset, simply to achieve more consistency over randomness) for FairMQ:

1. State pre and postconditions via asserts / comments, violations are code bugs and need fixing, not exceptions
   * [ ] Apply throughout the current FairMQ codebase and collect and document practical examples
   * [x] Decide whether to tie asserts to `NDEBUG` or a separate switch (`gsl`'s runtime asserts are not disabable at all even for release builds)
2. Prefer `noexcept` and strongly typed error codes via return channel - in the future: replace error codes with [static exceptions](http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2019/p0709r4.pdf)
   * [ ] Study cases where a `noexcept` interface internally calls `noexcept(false)` interfaces and decide if the "Prefer `noexcept`" can hold as a guideline.
   * [ ] Study cases where we already use C-style int error codes that do not distinguish error codes from return value by type and decide whether it is worth changing them.
   * [ ]  Decide for one of [`<system_error>`](https://en.cppreference.com/w/cpp/header/system_error), [`std::experimental::expected`](http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2018/p1051r0.pdf), [Boost.Outcome](https://www.boost.org/doc/libs/develop/libs/outcome/doc/html/index.html) or similar
   * [ ] Practical examples
3. Throw only unrecoverable (that lead to a terminate) error condition from free and member functions (should basically boil down to `std::bad_alloc` and `std::runtime_error("not implemented")`).
   * [ ] Understand the consequences of a full stack unwind + automatic terminate vs early [`std::terminate`](https://en.cppreference.com/w/cpp/error/terminate), especially in combination with `noexcept` interfaces, with regard to debuggability
4. Throw recoverable error conditions from ctors if class invariant cannot be established (to avoid 2-phase construction). Prefer to throw own exception types (possibly inheriting from a STL base exception type).
   * [ ] Practical examples

The above work-in-progress guidelines try to optimize for
1. debuggability/maintainability and
2. runtime performance (should really mainly be achieved via rule 1 and only on second order via rule 2).

Once the above points are finished:
* [ ] Introduce new section in FairMQ docs
* [ ] Report the results back to Alice to possibly be taken into account when developing further guidelines for Alice O2 projects (@ihrivnac suggested https://github.com/AliceO2Group/CodingGuidelines/master/coding_guidelines.html#Exceptions, @ktf suggested https://github.com/AliceO2Group/AliceO2/blob/dev/Framework/Core/README.md#error-handling)


Cc: @davidrohr, @ihrivnac, @ktf just FYI, this will take a bit to finish, but I guess we are not in a huge hurry here and it should not really block any activities in Alice O2 projects.

References:
* http://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#S-errors
* http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2019/p0709r4.pdf
* https://google.github.io/styleguide/cppguide.html#Exceptions
* https://www.youtube.com/watch?v=hNaLf8lYLDo
* https://www.youtube.com/watch?v=koTf7u0v41o

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Employ consistent error handling policy and make fatal error conditions fail fast #371

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Employ consistent error handling policy and make fatal error conditions fail fast #371

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions