-
Notifications
You must be signed in to change notification settings - Fork 147
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(recurring-job): avoid intermediate volume detachment during backup #2629
fix(recurring-job): avoid intermediate volume detachment during backup #2629
Conversation
439cf81
to
c1d4266
Compare
c1d4266
to
04e9213
Compare
aa5ea8e
to
08e8564
Compare
longhorn/longhorn-7937 Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>
08e8564
to
e3d4e35
Compare
Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>
e3d4e35
to
5bec953
Compare
Thank you for the idea! I have some feedbacks as:
Given the small benefit and the potential downside, I suggest that we don't go with this approach. However, if the team strongly agrees with this approach, it would be understandable for me. |
@PhanLe1010 , the intention of this PR is not to overtake your approach, but to avoid unnecessary intermediate detachment when recurring job runs backup.
Yes noticed, I am aiming for
The recurring job controller should handle the case (fixed locally but hasn't been pushed). https://github.com/longhorn/longhorn-manager/blob/master/controller/recurring_job_controller.go#L355
Could you elaborate?
I agree with your point here even though this case was specifically for the recurring job. I've closed this PR so we are not confused by its origin. This isn't a fix for longhorn/longhorn#7937; rather, it's an improvement stemming from it. We can reopen this if others think this would be beneficial. |
Hi @c3y1huang , the situation I was mentioning is that while the recurring job attaches the volume to take a backup, if the workload pod is scaled up and the pod is scheduled to a new node, the pod cannot attach the volume because the recurring job is still holding the volume on different node. If backup take 1 hour, the pod will not be able to start for 1 hour. Thank you for the explanation about other points! Yeah, we can see the feedback from others. If the team strongly agrees with this approach, it would be understandable for me. |
Which issue(s) this PR fixes:
Issue longhorn/longhorn#7937
What this PR does / why we need it:
This was initially created to resolve the issue in longhorn/longhorn#7937, and @PhanLe1010 proposed an alternative solution to tackle the problem.
However, Longhorn could still benefit by avoiding the unnecessary volume detachment during the recurring job backup. Currently, the recurring job runs the snapshot followed by the backup, resulting in an intermediate volume detachment.
Special notes for your reviewer:
None
Additional documentation or context