-
Notifications
You must be signed in to change notification settings - Fork 39.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cluster autoscaler scales up too much #43709
Comments
@mwielgus Please provide more information. |
@bgrant0607 After a scale up we create "virtual" nodes inside CA so that we don't scale up again for the same pods. Once the node arrives its virtual placeholder is deleted. And, if I'm correct, we have a bug here. A node with node condition ready=true but with non ready network is considered as fully started but broken. Its virtual placeholder is gone so we trigger scale up again. Seconds later the network arrives and the scale up appears to be non-needed. I'm testing right now a 10 line fix for this issue. After the fix is merged I would like to bump CA version in 1.6 as having the old code would impact all of the current CA users on GCE and GKE (and possibly on other cloud providers too). |
is this a regression in cluster autoscaler for 1.6, or did the same behavior exist in 1.5? |
It is a regression. Doesn't happen always (depends on the network setup timings) but when it does it is quite confusing. |
Automatic merge from submit-queue Cluster-autoscaler: Fix isNodeStarting Fix for: kubernetes/kubernetes#43709 cc: @MaciekPytel @fgrzadkowski
Automatic merge from submit-queue Bump cluster autoscaler to 0.5.1 Fixes: #43709 **Release note**: ```release-note With Cluster Autoscaler 0.5 the cluster will be autoscaled even if there are some unready or broken nodes. Moreover the status of CA is exposed in kube-system/cluster-autoscaler-status config map. ```
…tarting-fix Automatic merge from submit-queue Cluster-autoscaler: Fix isNodeStarting Fix for: kubernetes/kubernetes#43709 cc: @MaciekPytel @fgrzadkowski
…tarting-fix Automatic merge from submit-queue Cluster-autoscaler: Fix isNodeStarting Fix for: kubernetes/kubernetes#43709 cc: @MaciekPytel @fgrzadkowski
On some occasions Cluster Autoscaler may scale up the cluster two times. This results in growing the cluster too large. The size gets back to normal soon but the user will be forced to pay for the unneeded node for 10min/1h, depending on the cloud provider.
cc: @MaciekPytel @fgrzadkowski @ethernetdan @enisoc
The text was updated successfully, but these errors were encountered: