-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Outage needs investigation #210
Comments
We also need to set up an auto-alert service like uptimerobot, so we know right away about an outage like this. Just gonna dump some good links here -- @cecilia-donnelly, let's pick one and, as we say in IT, operationalize that sucker: |
Note: Our Mailgun logs don't show any problems. The Apache logs show that apache wasn't able to reach node. @kfogel suggests looking into |
(We have a monitor at uptimerobot.com now, by the way, though that doesn't close this ticket obviously.) |
Oops, and then I hit the wrong button and accidentally closed it anyway, sigh. Re-opening. |
Actually, the |
On [2016-07-07], the production site went down while it was on commit 9ba06df. We need to know what caused the outage, so that we can prevent it from happening again.
I think it might have been something to do with our connection to Mailgun (which we use to send emails), but we'll do more testing using the demo server to find out for sure.
The text was updated successfully, but these errors were encountered: