-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(Support Case) fix rook check to allow manual fix when is upgrading from only #4599
Conversation
scripts/common/rook.sh
Outdated
echo "" | ||
echo output | ||
echo "" | ||
|
||
if [[ $output == *"Module 'dashboard' has failed"* ]] || [[ $output == *"Module 'prometheus'"* ]]; then | ||
echo "Disable modules to try fix status" | ||
kubectl -n rook-ceph exec deployment/rook-ceph-tools -- ceph mgr module disable prometheus | ||
kubectl -n rook-ceph exec deployment/rook-ceph-tools -- ceph mgr module disable dashboard | ||
fi |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is the DELTA ONLY
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-d8fc113-openebs-3.7.0-k8s-docker-localpv-2023-06-09T14:35:39Z |
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-d8fc113-openebs-3.6.0-k8s-docker-localpv-2023-06-09T14:35:45Z |
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-d8fc113-openebs-3.3.0-k8s-docker-localpv-2023-06-09T14:36:01Z |
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-d8fc113-openebs-3.5.0-k8s-docker-localpv-2023-06-09T14:36:11Z |
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-d8fc113-openebs-3.4.0-k8s-docker-localpv-2023-06-09T14:36:12Z |
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-fe2cc63-openebs-3.7.0-k8s-docker-localpv-2023-06-09T14:50:21Z |
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-fe2cc63-openebs-3.6.0-k8s-docker-localpv-2023-06-09T14:50:24Z |
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-fe2cc63-openebs-3.5.0-k8s-docker-localpv-2023-06-09T14:50:38Z |
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-fe2cc63-openebs-3.3.0-k8s-docker-localpv-2023-06-09T14:50:54Z |
Testgrid Run(s) Executing @ https://testgrid.kurl.sh/run/pr-4599-fe2cc63-openebs-3.4.0-k8s-docker-localpv-2023-06-09T14:50:55Z |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@rrpolanco I found a nit: c236023
Could you please re-check again?
c236023
to
14a40a7
Compare
What this PR does / why we need it:
In certain edge cases, while migrating away from Rook, we may encounter issues. Specifically, after we execute a pvmigrate operation to migrate the PVCs and to migrate the Object store, the system may transition to an unhealthy state. This problem appears to be connected to specific modules
The proposed workaround ensures a smooth transition during the migration and upgrade processes, ultimately allowing for the successful deletion of Rook. To this end, this PR automates the resolution process by rectifying the Rook Ceph state and allowing the migration to proceed, given that Rook will be removed in the end. It's important to note that this automated fix is only applied during the checks performed when we are in the process of migrating away from Rook and when Rook's removal is the intended outcome.
Because of this we are duplicating the check and ensuring that just this process will use it.
Which issue(s) this PR fixes:
Fixes # [sc-79289]
Special notes for your reviewer:
This automated fix has been implemented specifically to mitigate unnecessary support calls in scenarios where they aren't required. It's worth noting that we perform a thorough status recheck after applying the workaround, prior to continuing with the process.
Steps to reproduce
Does this PR introduce a user-facing change?
Does this PR require documentation?