-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Docker input panic when decode fails #1052
Comments
@sparrc I'm dealing with problem now in 0.12.1 so in an attempt to patch it I added the error handling code you wrote (36d330f) to the 0.12.1 tag. This does remove the exception and prevent telegraf from crashing but I am still having issues. Telegraf will run fine until I do a deploy of my software then it will hang showing no new entries in the log. (The deploy process kills a running container, the starts a new one with a new tag of the same image.) If I delete the killed container than telegraf recovers and continues sending metrics (before the patch this is where I would see the exception). 2016/05/05 18:56:03 Gathered metrics, (10s interval), from 10 inputs in 3.558962691s You can see in the logs that it hung gathering metrics for 36 minutes. |
Thanks @sparrc, I patched in the code from master using context.WithTimeout and it seems to have corrected the issue. |
This bug was discovered by our cloud team. This can happen if there is a decode error.
The text was updated successfully, but these errors were encountered: