Skip to content

Infinite Fleet Cluster patching loop #313

@anmazzotti

Description

@anmazzotti

Occasionally, when the fleetaddonconfig.spec.cluster.patchResource is true, CAAPF will be stuck in a deadlock continuously patching the Fleet Cluster multiple times a second.

2025-05-28T11:04:13.368719Z  INFO reconciling object:reconcile:patch: controller::controllers::controller: Updated object object.ref=Cluster.v1beta1.cluster.x-k8s.io/pvala-capz-kubeadm-single.default object.reason=related object updated: Cluster.v1alpha1.fleet.cattle.io/pvala-capz-kubeadm-single.default name="pvala-capz-kubeadm-single" namespace="default" reconcile_id=853a9720607e2ff8c16d2a046445d80c name="pvala-capz-kubeadm-single" namespace="default" api_version="fleet.cattle.io/v1alpha1" kind="Cluster"
2025-05-28T11:04:13.405243Z  INFO reconciling object:reconcile:patch: controller::controllers::controller: Updated object object.ref=Cluster.v1beta1.cluster.x-k8s.io/pvala-capz-kubeadm-single.default object.reason=related object updated: Cluster.v1alpha1.fleet.cattle.io/pvala-capz-kubeadm-single.default name="pvala-capz-kubeadm-single" namespace="default" reconcile_id=8e3e9d7d1d3d9d661a05a3f0a2482720 name="pvala-capz-kubeadm-single" namespace="default" api_version="fleet.cattle.io/v1alpha1" kind="Cluster"
2025-05-28T11:04:13.448025Z  INFO reconciling object:reconcile:patch: controller::controllers::controller: Updated object object.ref=Cluster.v1beta1.cluster.x-k8s.io/pvala-capz-kubeadm-single.default object.reason=related object updated: Cluster.v1alpha1.fleet.cattle.io/pvala-capz-kubeadm-single.default name="pvala-capz-kubeadm-single" namespace="default" reconcile_id=0c0a5092af425cb5530ac2753c206b1f name="pvala-capz-kubeadm-single" namespace="default" api_version="fleet.cattle.io/v1alpha1" kind="Cluster"
2025-05-28T11:04:13.499983Z  INFO reconciling object:reconcile:patch: controller::controllers::controller: Updated object object.ref=Cluster.v1beta1.cluster.x-k8s.io/pvala-capz-kubeadm-single.default object.reason=related object updated: Cluster.v1alpha1.fleet.cattle.io/pvala-capz-kubeadm-single.default name="pvala-capz-kubeadm-single" namespace="default" reconcile_id=c0c64692fc94acbe2a5c063c7548b35c name="pvala-capz-kubeadm-single" namespace="default" api_version="fleet.cattle.io/v1alpha1" kind="Cluster"
2025-05-28T11:04:13.549389Z  INFO reconciling object:reconcile:patch: controller::controllers::controller: Updated object object.ref=Cluster.v1beta1.cluster.x-k8s.io/pvala-capz-kubeadm-single.default object.reason=related object updated: Cluster.v1alpha1.fleet.cattle.io/pvala-capz-kubeadm-single.default name="pvala-capz-kubeadm-single" namespace="default" reconcile_id=a8a5df77bf4a1123370b0ac5ef2514c5 name="pvala-capz-kubeadm-single" namespace="default" api_version="fleet.cattle.io/v1alpha1" kind="Cluster"
2025-05-28T11:04:13.591034Z  INFO reconciling object:reconcile:patch: controller::controllers::controller: Updated object object.ref=Cluster.v1beta1.cluster.x-k8s.io/pvala-capz-kubeadm-single.default object.reason=related object updated: Cluster.v1alpha1.fleet.cattle.io/pvala-capz-kubeadm-single.default name="pvala-capz-kubeadm-single" namespace="default" reconcile_id=935bc68bed8f6c10ec68bea091a3e45d name="pvala-capz-kubeadm-single" namespace="default" api_version="fleet.cattle.io/v1alpha1" kind="Cluster"

I did not see any changes in the Cluster resource itself.
The issue also stops if CAAPF is disabled for a while (for example setting 0 replicas), even with patchResource: true.

The side effect, other than CPU consumption, is Fleet creating and deleting continuously the fleet-agent Bundle before it can be applied downstream.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    Done

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions