Add support for frequent loops when provisioningrequest is encountered in last iteration #7271

Duke0404 · 2024-09-10T13:58:05Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

Created lastProvisioningRequestSeenTime which gets updated whenever a provisioningrequest is encountered in an iteration, which is used when frequent loops is enabled to start next iteration without delay.

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Support for frequent loops when ProvisioiningRequest is encountered in last loop.

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

cc: @yaroslava-serdiuk @aleksandra-malinowska @kawych

k8s-ci-robot · 2024-09-10T13:58:15Z

Hi @Duke0404. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

aleksandra-malinowska · 2024-09-10T15:54:33Z

/ok-to-test

@kawych will you be able to review?

yaroslava-serdiuk · 2024-09-11T09:39:28Z

@aleksandra-malinowska I can review later today

yaroslava-serdiuk

Have a small comment, otherwise LGTM

cluster-autoscaler/main.go

cluster-autoscaler/config/autoscaling_options.go

cluster-autoscaler/processors/provreq/injector.go

cluster-autoscaler/loop/trigger.go

cluster-autoscaler/main.go

kawych · 2024-09-16T12:23:38Z

/lgtm

cluster-autoscaler/loop/trigger.go

kawych · 2024-09-24T08:16:17Z

cluster-autoscaler/loop/trigger.go

+	t.initialized = true
+}
+
+// provisioningRequestWasProcessed is used to check if provisioningRequestProcessTimeGetter is not nil and a provisioning request was processed in the last iteration


nit: pls remove the comments here and below. Comments are not required for private function and these functions are short and self-explanatory enough to not require extra insight.

Removed comment here but kept the comment on triggerNextIteration, because the behaviour of the function is not entirely self-explanatory imo.

cluster-autoscaler/loop/trigger.go

kawych · 2024-09-24T12:11:50Z

/lgtm

cluster-autoscaler/loop/trigger.go

aleksandra-malinowska · 2024-09-24T13:11:40Z

/cc @x13n can you approve? I see you reviewed #6589, this expands on it by adding recently processing a ProvisioningRequest as trigger.

kawych · 2024-09-26T09:38:57Z

/lgtm

cluster-autoscaler/loop/trigger.go

x13n · 2024-10-11T13:53:52Z

cluster-autoscaler/loop/trigger.go

-			klog.Infof("Autoscaler loop triggered immediately after a productive iteration")
-		}
-		return
+		t.triggerNextIteration("Autoscaler loop triggered immediately after a productive iteration")


Isn't an iteration that processed a provisioning request also "productive"?

Not necessarily, because a ProvisioningRequest can be marked as failed and we will still trigger the next loop immidiately

Sure, though my point is it maybe makes sense to be a bit more explicit about the reason, as "productive" can mean different things. Autoscaler loop triggered immediately after scale up/Autoscaler loop triggered immediately after scale down?

Added separate logs for scale up and scaled own.

x13n · 2024-10-11T13:57:50Z

cluster-autoscaler/loop/trigger.go

 	}
 }

+// Initialize initializes the LoopTrigger object by providing a pointer to the UnschedulablePodObserver
+func (t *LoopTrigger) Initialize(podObserver *UnschedulablePodObserver) {


What is the benefit of splitting initialization into 2 phases? This comes with additional complexity like the need to suddenly handle errors when waiting.

The aim was to make minimal changes to the args and return values of the buildAutoscaler function.

The trigger can only be initialized within the buildAutoscaler function as the injector is present there. @yaroslava-serdiuk felt that creating the injector in the run function and passing that to buildAutoscaler was not good because the injector is only a relevant component for CA if the user has ProvisioningRequests enabled and thus should not be a part of the buildAutoscaler args.

The podObserver can only be created within the run function as it requires the background context of the function. Therefore, having an initialize method which serves as a setter for the podObserver was deemed as the best solution.

Thanks for sharing the background!

I think the context can safely be created before the call to buildAutoscaler and passed there - you can then remove two-phase init and actually simplify podObserver creation a bit too by reusing autoscaling options available there.

Modified accordingly.

cluster-autoscaler/main.go

cluster-autoscaler/loop/trigger.go

…d in last iteration

x13n · 2024-10-15T09:55:15Z

/lgtm
/approve

k8s-ci-robot · 2024-10-15T09:55:25Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: Duke0404, x13n

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~cluster-autoscaler/OWNERS~~ [x13n]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the kind/feature Categorizes issue or PR as related to a new feature. label Sep 10, 2024

k8s-ci-robot requested review from BigDarkClown and x13n September 10, 2024 13:58

k8s-ci-robot added area/cluster-autoscaler cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Sep 10, 2024

k8s-ci-robot added needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Sep 10, 2024

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Sep 10, 2024

yaroslava-serdiuk reviewed Sep 11, 2024

View reviewed changes

cluster-autoscaler/main.go Outdated Show resolved Hide resolved

kawych suggested changes Sep 12, 2024

View reviewed changes

cluster-autoscaler/config/autoscaling_options.go Outdated Show resolved Hide resolved

Duke0404 force-pushed the freqloops branch 2 times, most recently from 2f629ac to 169b99c Compare September 12, 2024 17:12

k8s-ci-robot added the area/provider/aws Issues or PRs related to aws provider label Sep 12, 2024

Duke0404 force-pushed the freqloops branch from 169b99c to 7aaa4b1 Compare September 12, 2024 17:20

kawych reviewed Sep 13, 2024

View reviewed changes

Duke0404 force-pushed the freqloops branch 2 times, most recently from 7fe3b40 to fc3ca6b Compare September 15, 2024 22:07

yaroslava-serdiuk reviewed Sep 16, 2024

View reviewed changes

cluster-autoscaler/main.go Outdated Show resolved Hide resolved

cluster-autoscaler/main.go Outdated Show resolved Hide resolved

k8s-ci-robot assigned kawych Sep 16, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 16, 2024

Duke0404 force-pushed the freqloops branch from fc3ca6b to 3798a40 Compare September 16, 2024 21:17

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed lgtm "Looks good to me", indicates that a PR is ready to be merged. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Sep 16, 2024

Duke0404 force-pushed the freqloops branch from 3798a40 to 53115f8 Compare September 16, 2024 21:24

kawych reviewed Sep 23, 2024

View reviewed changes

cluster-autoscaler/loop/trigger.go Outdated Show resolved Hide resolved

cluster-autoscaler/loop/trigger.go Outdated Show resolved Hide resolved

Duke0404 force-pushed the freqloops branch 2 times, most recently from 9ba7e09 to 9cfa863 Compare September 23, 2024 14:58

kawych reviewed Sep 24, 2024

View reviewed changes

Duke0404 force-pushed the freqloops branch from 9cfa863 to 2b8f56c Compare September 24, 2024 09:19

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 24, 2024

aleksandra-malinowska reviewed Sep 24, 2024

View reviewed changes

cluster-autoscaler/loop/trigger.go Outdated Show resolved Hide resolved

Duke0404 force-pushed the freqloops branch from 2b8f56c to 7e2f1a7 Compare September 24, 2024 13:03

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 24, 2024

Duke0404 force-pushed the freqloops branch from 7e2f1a7 to a50cee0 Compare September 25, 2024 06:20

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 26, 2024

x13n requested changes Oct 11, 2024

View reviewed changes

Duke0404 force-pushed the freqloops branch from a50cee0 to ba92758 Compare October 12, 2024 14:14

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 12, 2024

Duke0404 requested a review from x13n October 12, 2024 17:08

Duke0404 force-pushed the freqloops branch from ba92758 to ec4cf79 Compare October 15, 2024 02:10

x13n reviewed Oct 15, 2024

View reviewed changes

cluster-autoscaler/main.go Outdated Show resolved Hide resolved

cluster-autoscaler/loop/trigger.go Outdated Show resolved Hide resolved

Add support for frequent loops when provisioningrequest is encountere…

0a64fb0

…d in last iteration

Duke0404 force-pushed the freqloops branch from ec4cf79 to 0a64fb0 Compare October 15, 2024 09:43

k8s-ci-robot assigned x13n Oct 15, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 15, 2024

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 15, 2024

k8s-ci-robot merged commit bb94d27 into kubernetes:master Oct 15, 2024
6 checks passed

Duke0404 mentioned this pull request Oct 18, 2024

Revert "Add support for frequent loops when provisioningrequest is encountered in last iteration" #7410

Merged

Duke0404 mentioned this pull request Oct 26, 2024

Add support for frequent loops when provisioningrequest is encountered in last iteration #7418

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for frequent loops when provisioningrequest is encountered in last iteration #7271

Add support for frequent loops when provisioningrequest is encountered in last iteration #7271

Duke0404 commented Sep 10, 2024

k8s-ci-robot commented Sep 10, 2024

aleksandra-malinowska commented Sep 10, 2024

yaroslava-serdiuk commented Sep 11, 2024

yaroslava-serdiuk left a comment

kawych commented Sep 16, 2024

kawych Sep 24, 2024

Duke0404 Sep 24, 2024

kawych commented Sep 24, 2024

aleksandra-malinowska commented Sep 24, 2024

kawych commented Sep 26, 2024

x13n Oct 11, 2024

Duke0404 Oct 11, 2024

x13n Oct 14, 2024

Duke0404 Oct 15, 2024

x13n Oct 11, 2024

Duke0404 Oct 12, 2024

x13n Oct 14, 2024

Duke0404 Oct 15, 2024

x13n commented Oct 15, 2024

k8s-ci-robot commented Oct 15, 2024

Add support for frequent loops when provisioningrequest is encountered in last iteration #7271

Add support for frequent loops when provisioningrequest is encountered in last iteration #7271

Conversation

Duke0404 commented Sep 10, 2024

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

k8s-ci-robot commented Sep 10, 2024

aleksandra-malinowska commented Sep 10, 2024

yaroslava-serdiuk commented Sep 11, 2024

yaroslava-serdiuk left a comment

Choose a reason for hiding this comment

kawych commented Sep 16, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kawych commented Sep 24, 2024

aleksandra-malinowska commented Sep 24, 2024

kawych commented Sep 26, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

x13n commented Oct 15, 2024

k8s-ci-robot commented Oct 15, 2024