Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failed to ping server fix #42

Merged
merged 2 commits into from
Aug 1, 2019
Merged

Failed to ping server fix #42

merged 2 commits into from
Aug 1, 2019

Conversation

msaizar
Copy link
Contributor

@msaizar msaizar commented Aug 1, 2019

The server will still be in a running state after ping failure, and the autoscaler will keep trying to reprovision it. The stopped time is set, but the state is not updated. This puts it in a stopped state, similar to https://github.com/drone/autoscaler/blob/master/engine/collect.go#L108-L109 and https://github.com/drone/autoscaler/blob/master/engine/reaper.go#L121-L122

The server will still be in a running state after ping failure, and the autoscaler will keep trying to reprovision it. The stopped time is set, but the state is not updated. This puts it in a stopped state, similar to https://github.com/drone/autoscaler/blob/master/engine/collect.go#L108-L109 and https://github.com/drone/autoscaler/blob/master/engine/reaper.go#L121-L122
engine/pinger.go Outdated Show resolved Hide resolved
@bradrydzewski
Copy link
Member

good catch. I posted a minor comment with regards to which status I think we should use, but otherwise should be good to merge.

@msaizar
Copy link
Contributor Author

msaizar commented Aug 1, 2019

Updated. The reaper took care of it eventually I think but I'll look into the GC, I haven't tried it yet, thank you! I also noticed that the pinger is not closing connections. On long running agents, you can see a lot of docker connections from the autoscaler with netstat -an, one for every 10 minutes the agent was up. Haven't figured out a fix yet but I'll open a discourse topic on that

@bradrydzewski
Copy link
Member

bradrydzewski commented Aug 1, 2019

thanks!

The reaper took care of it eventually I think but I'll look into the GC

yes, sorry, I was actually referring to the reaper

I also noticed that the pinger is not closing connections

the Ping function is actually the Docker Client code. So if there is a connection leak it would unfortunately be in the Docker client library. So we would need to send them a patch.

@bradrydzewski bradrydzewski merged commit 89450d6 into drone:master Aug 1, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants