-
-
Notifications
You must be signed in to change notification settings - Fork 314
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
jdk_container left files owned by root #5358
Comments
These tests were added around one and half years ago. As it's dev level may not run frequently. I didn't notice there is this issue. Check recent jdk21 seems no this issue. https://ci.adoptium.net/view/Test_openjdk/job/Test_openjdk21_hs_dev.openjdk_x86-64_linux/ |
We should mark the node offline automatically when there is an error |
related: adoptium#5358 Signed-off-by: Lan Xia <Lan_Xia@ca.ibm.com>
related: adoptium#5358 Signed-off-by: Lan Xia <Lan_Xia@ca.ibm.com>
related: adoptium#5358 Signed-off-by: Lan Xia <Lan_Xia@ca.ibm.com>
related: #5358 Signed-off-by: Lan Xia <Lan_Xia@ca.ibm.com>
Is there any other way to clean up the crash files in the test code itself instead of marking it offline @llxia |
llxia is on vacation Related: |
I think we also need to know why this happens. Does it only happen when impl=openj9|ibm as no issue reported with jdk_container running against impl=hotspot. Normally this permission issue happens if you run things as root inside the container while using a mapped volume from the host inside the container. The jdk_container tests map volumes options are like |
Is there any updates on this issue? |
If I had to guess, it happens when a testcase fails and doesn't cleanup after itself, then the workspace can not be deleted. So @AswathySK perhaps check if that is the case and exclude the failing testcases. Lan is not back from vacation and no one is pursuing this issue further at this time. I suggest you dig in to answer some of the questions in this issue if you are interested in a different approach than taking the machine offline. |
Just a note that PR of making the node offine has also been reverted, which might help @AswathySK your investigation? |
@smlambert , when a test case fails it is not able to clean up after since the files created when it crashes are owned by root user. And yes I will do some more investigation on which all test cases we are seeing this issue. |
So my point is, the reason we do not have a cleanup problem for Temurin is that there is not a failing/crashing testcase. So your first task would be to see which testcase is crashing/failing, triage it by gathering any extra data you can, report the issue in the openj9 repo if it doesn't already exist, and exclude the test in the ProblemList files while the issue is being investigated and fixed by the openj9 team. |
related: adoptium#5358 and infrastructure/issues/9874 Signed-off-by: Lan Xia <Lan_Xia@ca.ibm.com>
related: adoptium#5358 and infrastructure/issues/9874 Signed-off-by: Lan Xia <Lan_Xia@ca.ibm.com>
related: adoptium#5358 and infrastructure/issues/9874 Signed-off-by: Lan Xia <Lan_Xia@ca.ibm.com>
related: #5358 and infrastructure/issues/9874 Signed-off-by: Lan Xia <Lan_Xia@ca.ibm.com>
Does the |
cleanWs() (at groovy level) happens after the jdk_container is exited. jdk_container is a openjdk tests which does not belong to us. |
Has anyone tried removing the files as the jenkins user via any random container as the root user? As much as that could be a hack workaround, it is probably easy to add a step to our cleanup scripts to remove everything inside the workspace dir via a container. cc @AswathySK |
related: adoptium#5358 and infrastructure/issues/9874 Signed-off-by: Lan Xia <Lan_Xia@ca.ibm.com>
jdk_container
left files on the host machine that are owned by root. These files cannot be cleaned by Jenkins job. It causes Jenkins job to fail.@sophia-guo @smlambert do you also see a similar issue at Adoptium Jenkins? Is there a better way to resolve this?
The text was updated successfully, but these errors were encountered: