-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
inputs.mqtt_consumer reconnects after network interruption but only yields timeouts thereafter #8722
Comments
This bug affects us, too.
We can't tell if it is caused by any network interruptions. Restarting telegraf helps but telegraf doesn't immediately react to SIGTERM, instead it must be terminated with SIGKILL. The Problem might be related to #8562 which had been merged into 1.17. |
I'm also affected by this bug. I can confirm exact same behaviour from @AndreasUfert Edit: i just hit it again, here is a recent log output:
|
Yes, I also encountered it. only restarting telegraf can solve it
|
To add on to my own initial report at the top of this issue, I can say we've also seen Telegraf fall into this timeout behavior after simply running for an extended period of time. Our issue is reliably reproducible upon a specific network interruption (daily network device reboot). But, we've since noticed that if Telegraf runs for 4 or 5 days even without a network interruption, MQTT metric subscriptions eventually fall into a mode of consistent, repeating timeouts exactly as what happens when a network interruption occurs. |
Relevant telegraf.conf:
Note: The MQTT Broker is AWS IoT Core.
System info:
Telegraf 1.17
Ubuntu 14.04.5
Docker
docker-compose.yaml:
Steps to reproduce:
Expected behavior:
Recovery of MQTT subscriptions after broker reconnection following network interruption.
Actual behavior:
After a connection interruption Telegraf successfully reconnects to the MQTT broker but collection results in timeouts. Only restarting Telegraf fixes the problem.
This problem was reliably reproducible each morning at 6am when a network device automatically rebooted and interrupted the connection to the AWS IoT Core MQTT Broker. Removing the automatic device reboot solved the problem.
Additional info:
The text was updated successfully, but these errors were encountered: