-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
wait container unable to start in windows on version >2.12.6 #5376
Comments
Paging our Windows expert @lippertmarkus |
Just tried the most basic workflow (https://argoproj.github.io/argo-workflows/windows/#schedule-hybrid-workflows) on v2.12.7 without a problem. Does that also don't work for you or is it just happening for a more complex workflow? Maybe something with volumes? Are you using the docker executor? Also could you please provide the Windows Build and Docker version? For me the error looks more like a problem with the host/container build version or the setup in general. Here's what I tried with:
|
It may well be a problem of the setup, as it's quite customized.
The difference I see is mostly the OS Image. I'll investigate this part then. |
Question is why does it work in version .6 but not in .7? From what I can see the docker image is built using OS version 1809, but that was the case for argo-workflows .6 as well. |
@mweibel Microsoft also updates the 1809 images with (security) fixes. The old .6 image may not had some of them at the time it was created. So you would also need to compare the whole revision/build number of the two images. |
KB4598230 is the difference, seemingly. Wondering if that really is the issue or not. Either I downgrade our nodes to a similar old version or try building argoexec using a newer version too. |
good question, I'm curious 😄 |
I built the images myself now, needed to upgrade the Go version to 1.13.6 to fix an issue with go + windows.
running those on my setup (using the this means v2.12.6 only worked because of an older windows build version. I'll try building using 1909 as base and report what happens there. If this also doesn't work I'll rebuild my nodes to get newer versions and see how that works... Edit:
with 1909 as base it seems to work (I'll need to verify with actually deploying argo using that version, but seemingly 1809 and 1909 are not compatible, despite microsoft saying it should not matter. Can argo build 1909 based images additionally? |
Rather difficult, GitHub Workflows only provide a 1809 runner. |
Oh, true. This (windows containers) is such a pain (especially if you're used to non windows systems ;)). I guess we'll stick to building our own images for the near future then. |
I can relate. According to Microsoft differences within the revision shouldn't affect container functionality, but this is an example where that statement unfortunately doesn't hold. Do you want to add that to the Limitations (https://argoproj.github.io/argo-workflows/windows/#limitations)? |
fixes argoproj#5376 Signed-off-by: Michael Weibel <michael@helio.exchange>
Summary
Currently running argo workflows version 2.12.0-rc2 and tried to upgrade to 2.12.9 a few days ago.
I noticed the workflows failing due to the wait container. Workflows are being sent the exact same way.
After deploying everything from the working 2.12.0-rc2 I had before up to 2.12.10, I noticed the issue seemed to appear in 2.12.7 (-> 2.12.6 is the last working version).
Error is related to workflows running on windows and can be reproduced by running:
Error message I see:
I'm unsure what could be the issue as not that much changed in 2.12.7 (v2.12.6...v2.12.7, basicly only #4946 seems remotely relevant but I don't think it's the issue).
Has there been some update to the way windows containers are built?
Diagnostics
What Kubernetes provider are you using?
Rancher
Message from the maintainers:
Impacted by this bug? Give it a 👍. We prioritise the issues with the most 👍.
The text was updated successfully, but these errors were encountered: