-
Notifications
You must be signed in to change notification settings - Fork 216
Disable metrics to avoid pod collisions #403
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disable metrics to avoid pod collisions #403
Conversation
Disable metrics to avoid pod collisions. This was enabled by default it https://github.com/kubernetes-sigs/controller-runtime/pull/510/files
/lgtm |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: spangenberg The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Is this a short-term fix while we get leader election or some such to avoid metrics collision? Doesn't seem like something we want to turn off completely in the long term. |
@wking we have never yet exposed the controller runtime metrics. You can check the ones we expose here https://github.com/openshift/machine-api-operator/tree/master/pkg/metrics so this is just putting us back to where we were. |
Dunno if it's related or not. /retest |
/retest |
looks like there are no workers in the upgrade job (as in, even mco reports 0 workers) |
Seems we are hitting aws limits
|
/retest |
1 similar comment
/retest |
/test all |
@enxebre: The following test failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
Upgrading is failing the same way for both the revert and this, so it’s either a separate problem that this PR doesn’t fix, or a more general problem that is somehow happening due to the RHCoS bump. I’m going to force merge this, and follow up to see if an rhcos issue is at play. |
openshift/installer#2455 still open, and we need openshift/installer#2459 first anyway, so I don't think RHCOS comes into this. |
Disable metrics to avoid pod collisions. This was enabled by default it https://github.com/kubernetes-sigs/controller-runtime/pull/510/files
See https://storage.googleapis.com/origin-ci-test/pr-logs/pull/openshift_cluster-api-provider-aws/261/pull-ci-openshift-cluster-api-provider-aws-master-e2e-aws-operator/727/artifacts/e2e-aws-operator/pods/openshift-machine-api_machine-api-controllers-6dd895f447-zglqq_nodelink-controller.log