Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Outage needs investigation #210

Open
cecilia-donnelly opened this issue Jul 11, 2016 · 5 comments
Open

Outage needs investigation #210

cecilia-donnelly opened this issue Jul 11, 2016 · 5 comments

Comments

@cecilia-donnelly
Copy link
Contributor

On [2016-07-07], the production site went down while it was on commit 9ba06df. We need to know what caused the outage, so that we can prevent it from happening again.

I think it might have been something to do with our connection to Mailgun (which we use to send emails), but we'll do more testing using the demo server to find out for sure.

@kfogel
Copy link
Member

kfogel commented Jul 12, 2016

We also need to set up an auto-alert service like uptimerobot, so we know right away about an outage like this. Just gonna dump some good links here -- @cecilia-donnelly, let's pick one and, as we say in IT, operationalize that sucker:

@cecilia-donnelly
Copy link
Contributor Author

Note: Our Mailgun logs don't show any problems. The Apache logs show that apache wasn't able to reach node. @kfogel suggests looking into npm_prefork or a similarly named Apache variable.

@kfogel
Copy link
Member

kfogel commented Aug 2, 2016

(We have a monitor at uptimerobot.com now, by the way, though that doesn't close this ticket obviously.)

@kfogel kfogel closed this as completed Aug 2, 2016
@kfogel
Copy link
Member

kfogel commented Aug 2, 2016

Oops, and then I hit the wrong button and accidentally closed it anyway, sigh. Re-opening.

@kfogel kfogel reopened this Aug 2, 2016
@kfogel
Copy link
Member

kfogel commented Feb 16, 2017

Actually, the err.log file does have Mailgun errors. I'm not sure when they're from (see issue #224 about date stamps in logs) but they're there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants