Description
We're running SolidQueue as a Puma plugin on a Rails 8 app, as our job processing load is currently quite small.
We recently had an incident where the server running Puma temporarily lost the connection to Postgres. This caused SolidQueue to crash with this message:
PQconsumeInput() FATAL: terminating connection due to administrator command (PG::
ConnectionBad)
server closed the connection unexpectedly
This probably means the server terminated abnormally
before or while processing the request.
and this in turn took down Puma:
Detected Solid Queue has gone away, stopping Puma...
- Gracefully stopping, waiting for requests to finish
I was able to reproduce this locally by shutting down Postgres after starting Rails.
When running Rails without the SolidQueue Puma plugin, if the database goes away, Rails throws an error when it tries to do something with the database, but Puma stays up and the connections recover when the database comes back online.
If I run SolidQueue separately, via bin/jobs
, it also crashes if the database goes away.
Obviously SolidQueue can't be expected to do much without a database, but would it be reasonable for it to behave as Rails does when the db goes offline, i.e. pause its activity and reconnect when the db is available again?
Thanks for all your work on this - SolidQueue has been a fantastic addition to Rails!