-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(DR): update dr to v3.0.0 rc, refactor schedules, add new snapshotController and snapshotMoveData features #304
feat(DR): update dr to v3.0.0 rc, refactor schedules, add new snapshotController and snapshotMoveData features #304
Conversation
…templates to include snapshot-controller deployment, and enabling volume snapshotting on OnPremises and KFDDistribution providers. Complete refactor of velero schedules settings, simplifying and improving the definition
…apshotMoveData field. Adding also snapshotController installation flag
…, since EKS will install it's own with the aws module
@nutellino maybe the problem is due to the fact that we are backing up all Velero's resources, which does not seem to be recommended: https://github.com/vmware-tanzu/velero/issues/7057#issuecomment-1794110211 I haven't found any specific directive about this in Velero docs, actually. On the other hand, I see that there are some MushHave resources needed to successfully restore a backup: and from what I understand, DataUpload is needed for restoring volumes. I think it would require further investigation to clearly understand what happened, and what we can do |
No I did some more testing and the issue arise when there are more than one backup for the same volume in kopia. I added some more edits with the workaround in the issue description |
Update: If you create a backup without starting from the full schedule, for example with:
All the restore from backups even subsequent ones are working fine. The only issue arise if you do a backup starting from the schedule full @luigidematteis is trying to find the reason behind this |
This PR introduces several enhancements and new features to the Disaster Recovery (DR) module, including improvements to schedule customization, new options for snapshot data movement, and the ability to optionally install a snapshot-controller. It also includes a breaking schema change to improve usability. Detailed explanations and examples are provided below.
Checklist for Testing 🧪
dr
module version with Kopia set as the default uploader.deletionPolicy: Retain
, snapshots can still be restored even if the namespace is deleted.kopia repository connect s3 --bucket=velero --prefix=kopia/<namespacename>/ --endpoint="yourendpoint" --access-key=xxxx --secret-access-key=xxxx --region=xxxx --disable-tls
, using passwordstatic-passw0rd
andkopia snapshot list --all -l
and finallykopia snapshot restore ke7c2317576cfc5336a966ee040d68a3d /path-to-restore-to
can be used to restore data. NOTE: You need kopia cli installedDetails
DR improved configurable schedules:
The schedule configuration has been updated to enhance the usability of schedule customization (note: this is a breaking change):
DR snapshotMoveData options for full schedule:
A new parameter has been introduced in the Velero
full
schedule to enable the snapshotMoveData feature. This feature allows data captured from a snapshot to be copied to the object storage location.Important: Enabling this parameter will cause Velero to upload all data from the snapshotted volumes to S3 using Kopia. While backups are deduplicated, significant storage usage is still expected. To enable this parameter in the full schedule:
General example to enable Volume Snapshotting on rook-ceph (from our storage add-on module):
deletionPolicy: Retain
is important because if the volume snapshot is deleted from the namespace, the cluster-wideVolumeSnapshotContent
CR will be preserved, maintaining the snapshot on the storage that the cluster is using.DR optional snapshot-controller installation:
To leverage VolumeSnapshots on the
OnPremises
andKFDDistribution
providers, a new option on Velero has been added to install thesnapshot-controller
component.Important: Before activating this parameter, make sure that there is no other
snapshot-controller
component deployed in your cluster. By default, this parameter isfalse
.Breaking Changes 💔
A new format for the schedule customization has been introduced to improve usability.