-
Notifications
You must be signed in to change notification settings - Fork 457
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
observation: docker system dial-stdio processes do not die #837
Comments
What are you running on ubuntu-4gb-hel1-mortimer-1? Is it used as the remote builder? |
It is - and a staging server (following the current litany out of Chicago = solid_queue, Kamal, SQLite and “1 container to rule them all”) 😉- btw: I’m in tears regarding the work put into this making so much developer happiness 🥰 So: a huge thank you to all contributors!!CheersWaltherDen 13. jun. 2024 kl. 15.48 skrev Donal McBreen ***@***.***>:
What are you running on ubuntu-4gb-hel1-mortimer-1? Is it used as the remote builder?
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: ***@***.***>
|
I think the For now I'd suggest moving the remote builder to it's own server to avoid affecting your app. |
yup - that’s the ’signature’
Good advice on the “separation of concerns” 😅
Cheers,
Walther
… Den 13. jun. 2024 kl. 18.11 skrev Donal McBreen ***@***.***>:
I think the docker system dial-stdio processes are related to the connections to your remote builder then. We have seen similar problems with ours. Looks maybe a bit like this - https://forums.docker.com/t/docker-continuously-making-unnecessary-ssh-connections-to-remote-servers/136132?
For now I'd suggest moving the remote builder to it's own server to avoid affecting your app.
—
Reply to this email directly, view it on GitHub <#837 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABPFXK7XA4GWAC2FXZYSVDZHHACHAVCNFSM6AAAAABJGEAZ36VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCNRWGEZDGMBYGE>.
You are receiving this because you authored the thread.
|
disclaimer: If anyone ends up here either b/c the host dies (well it is less dramatic than that but it runs out of memory so it is more like it ends up in a kind of "coma" -- a kamal infused coma you might say) under the weight of dial-stdio or realizing that the client (your CPU - mac/linux/pc) keeps every single ssh connection used by Kamal open 'till hell freezes, or you show them the kitchen door 😉 This is the clean ups: # your client
kill -HUP `ps aux | grep 'ConnectTimeout' | awk '{ print $2}'`
# host
kill $(ps ax | grep 'docker system dial-stdio' | grep Ssl | awk '{print $1}') |
I am having similar problems, although I think there are two separate problems here. I use kamal 2.1.1 to deploy from my MBP to three low cost Ubuntu servers. The one chosen as the remote builder runs out of memory in something like 18-24 hours and has to be restarted, after which the process repeats. If I manage to get onto the server before it runs out of memory there are around 100 Problem 2 is that there are also lots of processes on the laptop from which I deploy, 56 as I write, of the form: |
Update: the problem is This was raised in March in a Docker community forum post. |
Update 2: as we don't strictly need to use a remote builder, we dropped the |
@rogermarlow I fiddled on with the script(s) and now it looks like this (I have one named
notes: |
Perhaps Kamal could stop the builder when its work is done: docker buildx stop kamal-remote-ssh--username-hostname Can try that out in your app with a post-deploy hook. In #!/bin/bash
docker buildx stop kamal-remote-ssh--yourbuilderusername-yourbuilderhostname (Note: assumes Kamal 2 builder naming convention. Adjust for older Kamal 1 builder names like |
not sure what is going on - but my observation is a slowly "degenerating system" as I keep deploying; if this is just me (not knowing to be) high as a kite on ethanol, please apologise me wasting your bandwidth 🙏
Symptoms
Either deploys fail - or takes forever - and service response is measurably below par
Diagnostics
Remediation
I'm barking up the kamal communicates via the
npipe
helped by docker system dial-stdio tree - suspecting the "remote" process not knowing when to exit so hangs around indefinitely - just a (wild) guess 🤷🏻♂️Somehow signaling the process to 'go die' would perhaps solve the matter - in a perfect world not until the deploy has finished (either exit 0 or exit something) but otherwise after each command --
Reproduction
All I do is
kamal env push && kamal deploy
- once/twice pr 2hr slot - effectively demanding a reboot every other daySystem
it's a rental, what can I say 😉
happy user of Hetzner services
and the ruby/rails env is
and finally Kamal is
The text was updated successfully, but these errors were encountered: