Add KEP for volume scheduling limits #942

jsafrane · 2019-04-08T15:03:37Z

This KEP changes existing scheduler predicates for volume limits from using Node.status.allocatable to CSINode.spec.drivers["xyz"] for CSI drivers and in-tree volumes (in some cases).

Feature: #554

keps/sig-storage/20190408-volume-scheduling-limits.md

leakingtapan · 2019-04-09T16:36:37Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+
+##### Alpha -> Beta Graduation
+
+N/A (`AttachVolumeLimit` feature is already beta).


Since CSI migration is targeting Beta in next quarter, are you saying attach limit migration will be Beta by next quarter too?

VolumeAttachLimit feature is already beta in 1.14. We're changing implementation underneath, hoping that we can keep it still beta.

keps/sig-storage/20190408-volume-scheduling-limits.md

jsafrane · 2019-04-15T16:11:00Z

Passed initial review round, now the KEP is complete.
@bsalamat @msau42 @davidz627, PTAL (and invite others as needed)

msau42 · 2019-04-15T22:06:14Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+
+- `ResourceName` is limited to 63 characters. We must prefix `ResourceName` with unique string (such as `attachable-volumes-csi-<driver name>`) so it cannot collide with existing resources like `cpu` or `memory`. But `<driver name>` itself is up to 63 character long, so we ended up with using SHA-sums of driver name to keep the `ResourceName` unique, which is not user readable.
+- CSI driver cannot share its limits with in-tree volume plugin e.g. when running pods with AWS EBS in-tree volumes and `ebs.csi.aws.com` CSI driver on the same node.
+- `node.status` size increases with each installed CSI driver. Node objects are big enough already.


Unsure if this is an issue anymore once the Node heartbeat has been moved to its own object.

Space-wise, it's a similar amount of space since we have to store an equivalent amount in the CSINode object.

Removed the bullet.

msau42 · 2019-04-15T22:07:58Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+
+### Non-Goals
+
+- Heterogenous clusters, i.e. clusters where access to storage is limited only to some nodes. Existing `PV.spec.nodeAffinity` handling, not modified by this KEP, will filter out nodes that don't have access to the storage, so predicates changed in this KEP don't need to worry about storage topology and can be simpler.


Another idea was to schedule pods to nodes based on CSINode that indicates what drivers are installed on the node, and maybe even health status in the feature.

Added as non-goal.

msau42 · 2019-04-15T22:11:08Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+
+* Kubelet will create `CSINode` instance during initial node registration together with `Node` object.
+  * Limits of each in-tree volume plugin will be added to `CSINode.status.allocatable`.
+    * Limit for in-tree volumes will be added by kubelet during CSINode creation. Name of corresponding CSI driver will be used as key in `CSINode.status.allocatable` and it will be discovered using [CSI translation library](https://github.com/kubernetes/kubernetes/tree/master/staging/src/k8s.io/csi-translation-lib). If the library does not support migration of an in-tree volume plugin, the volume plugin has no limit.


Do all in-tree plugins today that report limits also have a translation layer?

Azure is not listed as "migratable" in staging/src/k8s.io/csi-translation-lib/translate.go.

@andyzhangx, do you plan to add Azure migration support to 1.15 as alpha?

@jsafrane thanks for letting me know, I have filed below two issues to add support in 1.15:
kubernetes/kubernetes#76684
kubernetes/kubernetes#76685

keps/sig-storage/20190408-volume-scheduling-limits.md

msau42 · 2019-04-15T23:17:55Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+
+##### Removing a deprecated flag
+
+- Announce deprecation and support policy of the existing flag


Can you be specific on what flag is being deprecated here? And when are we going to deprecate it?

I think this is a typo? There are no flags exposed by volume limit feature IIRC.

Whole section 'Removing a deprecated flag' is taken from KEP template. We don't add any flags, removed.

keps/sig-storage/20190408-volume-scheduling-limits.md

msau42 · 2019-04-16T03:19:14Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+
+##### Beta -> GA Graduation
+
+It must graduate together with CSI migration.


Do we want to consider adding a cache before GA to improve predicate performance?

IMO, we need to have the implementation fast enough even in beta. It may turn out to be impossible to speed up the code enough after beta to reach GA requirements.

may have missed it but what do we forsee being the slow part? The scheduler accessing CSINode API object?

If I am not wrong- I think @msau42 meant caching of volumes in-use on a node. Currently the number of volumes in-use on a node is computed by iterating through running/in-flight pods and then adding number of volumes they are using.

This can be cached and cache can be kept in-sync with decisions scheduler makes. Caching will potentially save some CPU cycles but IMO should not be targeted for this release. It can get fairly tricky pretty quickly.

So for beta, our goal is same performance as the current implementation. Before we go to GA, do we want to have a goal of improving performance by caching these in-use counts per node?

added a line.

keps/sig-storage/20190408-volume-scheduling-limits.md

msau42 · 2019-04-17T14:53:33Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+  * CSI driver is used both for in-tree and CSI PVs (if the driver is installed).
+  * Kubelet creates `CSINode` during node registration with no limit for the volume plugin / CSI driver.
+  * When CSI driver is registered, kubelet gets its volume limit through CSI and updates `CSINode` with the new limit.
+    * During the period when no CSI driver is registered and CSINode exists, there is "no limit" for the CSI driver and scheduler can put pods there! See non-goals!


Or can we do something like set limit to 0 so that no pods can be scheduled there and wait for the CSI driver to be installed?

If we have no limit then more pods could get scheduled but will get stuck in attach and require user intervention to retry

Regardless when migration is on and driver is not installed yet, pods won't be able to come up, so maybe it's better to not schedule the pod to that node.

this was fixed. After deprecation period, we will stop scheduling pods to a node that does not have CSINode.Spec.Driver or if CSINode object itself is missing.

msau42 · 2019-04-17T14:59:42Z

keps/sig-storage/20190408-volume-scheduling-limits.md

-| key is missing in `CSINode.status.allocatable`   | -     |  there is no limit of volumes on the node* |
+|  `Volumes` | Description  |
+|  --------- | ------------ |
+|  0         | plugin / CSI driver exists and has zero limit, i.e. can attach no volumes |


Is it possible to set 0? CSI spec says that a 0 value is undefined and CO can interpret it however it wants. For backwards compatibility I would expect 0 value from CSI to mean unlimited

Similar question comes up if in the future we add capacity limits. How to tell the difference between 0 and undefined? Do we need the fields to be pointers?

While discussing the issue of stopping pods from getting scheduled on a node if driver is not installed on it - I thought it might be best to handle this as a separate issue. We can indeed set value to 0 when driver is not installed but this applies only to drivers that support migration.

Since we won't be creating CSINode object for in-tree drivers which don't support migration the solution of setting the volume limit to 0 might not be good enough. We may have to go one step further and say prevent scheduling of volumes if CSINode object for a driver does not exist on a node? If yes then - that sounds like something that is kinda out of scope for volume limit work.

I'm only concerned about drivers that are being migrated. If we set the limit to "unlimited", then during this (hopefully) brief moment, Pods that are using in-tree drivers can fail attaching, and require manual intervention to fix. One of the goals of migration is that users shouldn't notice that we switched.

Similar question comes up if in the future we add capacity limits. How to tell the difference between 0 and undefined?

If a driver is installed, it is in CSINode.Spec.Drivers. Then the driver either has its limit in CSINode.Status or it's missing there, and that means there is no limit.

IMO, scheduling based on on availability of a driver on a node is a separate issue and separate predicate. IMO it should be fairly trivial to implement.

It could be a separate issue but we also need to decide if that should block migration ga or not. @davidz627

Scheduling based on availability of a driver on a node is a blocker for migration GA since not having that would be a regression in volume behavior.

This was addressed by @jsafrane in recent commit. Scheduler will not schedule pods to a node if it does not have driver entry in CSINode object.

msau42 · 2019-04-17T15:20:36Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+	// allocatable is a list of volume limits for each volume plugin and CSI driver on the node.
+	// +patchMergeKey=name
+	// +patchStrategy=merge
+	Allocatable []VolumeLimits `json:"allocatable" patchStrategy:"merge" patchMergeKey:"name" protobuf:"bytes,1,rep,name=allocatable"`


Should this be a CSINodeDriverStatus to mirror spec more closely?

Is it odd that a status for a driver may exist before the spec? Although in the past designs iterations for migration we had confusing behavior where driver spec would be partially filled in before driver was installed

I think we want to avoid having a partially filled spec. That leaves the style in this design or having a status without a spec as the only options.

For the current design that means if we ever add a "real status field" we have to be ok with the pretty weird schema of:

apiVersion: storage.k8s.io/v1beta1 kind: CSINode metadata: name: ip-172-18-4-112.ec2.internal spec: status: drivers: - name: ebs.csi.aws.com fieldA: valueA allocatable: # AWS node can attach max. 40 volumes, 1 is reserved for the system - name: ebs.csi.aws.com volumes: 39

where drivers are duplicated between the allocatable and drivers fields.

What is the API guidance on having a Status without a corresponding Spec. Is that even weirder? @liggitt

I am not sure why is having a Status field without having a corresponding Spec field weird. Node Object is a good example and CSINode object in-fact mirrors properties of a node.

We may end up with two arrays with the same keys and that would be indeed weird. We could do:

status: drivers: - name: ebs.csi.aws.com fieldXYZ: x allocatable: volumes: 39 size: "64 TB"

But that would complicate sharing of limits between drivers, if we ever implement it.

On a meeting we decided to move allocatable into CSINode.spec.drivers[xyz], because it does not change in time.

davidz627

mostly looks good to me! One important comment on the CSINode Status design though

davidz627 · 2019-04-17T20:52:58Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+	// allocatable is a list of volume limits for each volume plugin and CSI driver on the node.
+	// +patchMergeKey=name
+	// +patchStrategy=merge
+	Allocatable []VolumeLimits `json:"allocatable" patchStrategy:"merge" patchMergeKey:"name" protobuf:"bytes,1,rep,name=allocatable"`


I think we want to avoid having a partially filled spec. That leaves the style in this design or having a status without a spec as the only options.

For the current design that means if we ever add a "real status field" we have to be ok with the pretty weird schema of:

apiVersion: storage.k8s.io/v1beta1 kind: CSINode metadata: name: ip-172-18-4-112.ec2.internal spec: status: drivers: - name: ebs.csi.aws.com fieldA: valueA allocatable: # AWS node can attach max. 40 volumes, 1 is reserved for the system - name: ebs.csi.aws.com volumes: 39

where drivers are duplicated between the allocatable and drivers fields.

What is the API guidance on having a Status without a corresponding Spec. Is that even weirder? @liggitt

davidz627 · 2019-04-17T20:56:21Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+
+##### Beta -> GA Graduation
+
+It must graduate together with CSI migration.


may have missed it but what do we forsee being the slow part? The scheduler accessing CSINode API object?

keps/sig-storage/20190408-volume-scheduling-limits.md

bsalamat

Thanks, @jsafrane!

keps/sig-storage/20190408-volume-scheduling-limits.md

bsalamat · 2019-04-17T21:47:21Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+- User can run use PVs both with in-tree volume plugins and CSI and they will share their limits. There is only one scheduler predicate that handles both kind of volumes.
+
+- Existing predicates for in-tree volumes `MaxEBSVolumeCount`, `MaxGCEPDVolumeCount`, `MaxAzureDiskVolumeCount` and `MaxCinderVolumeCount` are removed (with deprecation period).
+  - When both deprecated in-tree predicate and CSI predicate are enabled, only one of them does useful work and the other is NOOP to save CPU.


Shouldn't this be considered an invalid config? Do we need to support it in the transition phase for backward compatibility? Also, later in the doc it is mentioned that CSI predicate will be used in this case.

Shouldn't this be considered an invalid config?

Ideally yes, but MaxEBSVolumeCount, MaxGCEPDVolumeCount, MaxAzureDiskVolumeCount MaxCinderVolumeCount and MaxCSIVolumeCountPred are all existing predicates and right now they can be all enabled together. I can make Max<cloud>VolumeCount and MaxCSIVolumeCountPred mutually exclusive and it would simplify implementation a lot. Is it allowed? IMO it could break clusters that have all the predicates enabled.

Also, later in the doc it is mentioned that CSI predicate will be used in this case.

Clarified a bit ("MaxCSIVolumeCountPred does useful work...")

keps/sig-storage/20190408-volume-scheduling-limits.md

bsalamat · 2019-04-17T22:19:49Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+|  0         | plugin / CSI driver exists and has zero limit, i.e. can attach no volumes |
+|  X>0       | plugin / CSI driver exists and can attach X volumes (where X > 0) |
+|  X<0       | negative values are blocked by validation
+| Driver is missing in `CSINode.status.allocatable`   | there is no limit of volumes on the node* |


When does not happen and what does it mean that "driver" does not exist in CSINode.status.allocatable?

what does it mean that "driver" does not exist in CSINode.status.allocatable

CSINode.status.allocatable is a map[DriverName]VolumeLimits. If a driver is missing there, it can mean two things:

Driver is not installed on the node (or it is installed and informers are slow to propagate it to all components)

Driver is installed on the node and has no limits.

In both cases, scheduler expects that there is no limit.

Differentiation between these two cases is discussed here: #942 (comment)

keps/sig-storage/20190408-volume-scheduling-limits.md

bsalamat · 2019-04-17T22:41:51Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+	// allocatable is a list of volume limits for each volume plugin and CSI driver on the node.
+	// +patchMergeKey=name
+	// +patchStrategy=merge
+	Allocatable []VolumeLimits `json:"allocatable" patchStrategy:"merge" patchMergeKey:"name" protobuf:"bytes,1,rep,name=allocatable"`


can't this be a map[string]int32 that maps a CSI driver name to its volume limit?

We want to add total volume size later. We can add either two maps, map[string]int32 for count of volumes and map[string]Quantity for total size. Or we can have maps of structs as it is now. Both will work, not sure what's the preference here. Also see discussion at #942 (comment)

keps/sig-storage/20190408-volume-scheduling-limits.md

gnufied · 2019-04-27T02:46:28Z

/label api-review

jsafrane · 2019-04-29T08:43:20Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+    * Especially, `kubelet --kube-reserved` or `--system-reserved` cannot be used to "reserve" volumes for kubelet or the OS. It is not possible with existing kubelet and this KEP does not change it. We expect that CSI drivers will have configuration options / cmdline arguments to reserve some volumes and they will report their limit already reduced by that reserved amount.
+* Scheduler will respect `Node.status.allocatable` and `Node.status.capacity` for CSI volumes if `CSINode` object is not available or has missing entry in `CSINode.spec.drivers[xyz].allocatable` during a deprecation period but kubelet will stop populating `Node.status.allocatable` and `Node.status.capacity` for CSI volumes.
+  * After deprecation period for CSI volumes, limits coming from `Node.status.allocatable` and `Node.status.capacity` will be completely ignored by the scheduler.
+  * After deprecation period, scheduler won't schedule any pods that use CSI volumes to a node with missing `CSINode` instance or missing driver in `CSINode.Spec.Drivers`. It is expected that it happens only during node registration when `Node` exists and `CSINode` doesn't and it self-heals quickly.


IMO, whole sentence "It is expected that it happens only during node registration when Node exists and CSINode doesn't and it self-heals quickly" is obsolete. There is no self-heal because there is no limits in CSINode created at kubelet startup. Scheduler needs to wait for a driver to get installed there.

thockin

Overall this seems OK. Is it "better" in CSINode or just simpler? E.g. if we added a volume-specific status block with volume-specific allocatable to node, would we prefer that for any reason?

keps/sig-storage/20190408-volume-scheduling-limits.md

thockin · 2019-04-29T19:37:07Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+
+- Maximum capacity per node. Some cloud environments limit both number of attached volumes (covered in this KEP) and total capacity of attached volumes (not covered in this KEP). For example, this KEP will ensure that scheduler puts max. 128 GCE PD volumes to a [typical GCE node](https://cloud.google.com/compute/docs/machine-types#predefined_machine_types), but it won't ensure that the total capacity of the volumes is less than 64 TB.
+
+- Volume limits does not yet integrate with cluster autoscaler if all nodes in the cluster are running at maximum volume limits.


@mwielgus In case you have not seen this one

keps/sig-storage/20190408-volume-scheduling-limits.md

thockin · 2019-04-29T19:43:34Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+
+## Proposal
+
+* Track volume limits for CSI driver in `CSINode` objects instead of `Node`. Limit in `CSINode` is used instead of limit coming from `Node` object whenever available for same in-tree volume type. This mean scheduler will always try to translate in-tree driver name to CSI driver name whenever `CSINode` object has same in-tree volume type (even if migration is off).


Does scheduler already consider CSINode anywhere? Would be worth saying whether this exists or is net-new

Scheduler does not consider CSINode object anywhere currently. I can perhaps clarify it some more.

gnufied · 2019-04-29T20:11:32Z

@thockin it is bit simpler to use CSINode object and there are some minor benefits, for example - using CSINode we can tell if driver is not installed on the node or if driver is installed on the node but does not support a "defined" max. volume limits. We wouldn't be able to disambiguate between these two scenarios with only a node object.

gnufied · 2019-04-29T20:22:39Z

@thockin okay addressed open comments. PTAL. :-)

thockin · 2019-04-29T23:40:56Z

/lgtm /approved

…

On Mon, Apr 29, 2019 at 1:22 PM Hemant Kumar ***@***.***> wrote: @thockin okay addressed open comments. PTAL. :-) — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

msau42 · 2019-04-29T23:48:58Z

keps/sig-storage/20190408-volume-scheduling-limits.md

+editor: TBD
+creation-date: 2019-04-08
+last-updated: 2019-04-08
+status: provisional


This should be changed to implementable

msau42 · 2019-04-29T23:49:53Z

/hold

for changing the status bit

update the spec to not create CSINode object by default for in-tree volume types

gnufied · 2019-04-30T00:03:58Z

@msau42 updated as implementable. Also updated reviewers and approvers to be more accurate. Added myself as one of the authors.

Can you please re-lgtm?

msau42 · 2019-04-30T00:08:07Z

/lgtm

msau42 · 2019-04-30T00:08:18Z

/hold cancel

k8s-ci-robot · 2019-04-30T00:08:18Z

[APPROVALNOTIFIER] This PR is APPROVED

Approval requirements bypassed by manually added approval.

This pull-request has been approved by: jsafrane, msau42, thockin

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

keps/sig-storage/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot requested review from childsb and saad-ali April 8, 2019 15:03

gnufied reviewed Apr 8, 2019

View reviewed changes

keps/sig-storage/20190408-volume-scheduling-limits.md Show resolved Hide resolved

gnufied reviewed Apr 8, 2019

View reviewed changes

keps/sig-storage/20190408-volume-scheduling-limits.md Outdated Show resolved Hide resolved

gnufied reviewed Apr 8, 2019

View reviewed changes

keps/sig-storage/20190408-volume-scheduling-limits.md Outdated Show resolved Hide resolved

jsafrane force-pushed the volume-limits branch from 3ef03d3 to 6980284 Compare April 9, 2019 13:39

gnufied reviewed Apr 9, 2019

View reviewed changes

keps/sig-storage/20190408-volume-scheduling-limits.md Outdated Show resolved Hide resolved

bertinatto reviewed Apr 9, 2019

View reviewed changes

keps/sig-storage/20190408-volume-scheduling-limits.md Outdated Show resolved Hide resolved

keps/sig-storage/20190408-volume-scheduling-limits.md Outdated Show resolved Hide resolved

keps/sig-storage/20190408-volume-scheduling-limits.md Outdated Show resolved Hide resolved

leakingtapan reviewed Apr 9, 2019

View reviewed changes

jsafrane force-pushed the volume-limits branch 3 times, most recently from c8eb762 to eeeb7b1 Compare April 15, 2019 16:07

jsafrane changed the title ~~WIP: Add KEP for volume scheduling limits~~ Add KEP for volume scheduling limits Apr 15, 2019

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Apr 15, 2019

bsalamat self-assigned this Apr 15, 2019

msau42 reviewed Apr 15, 2019

View reviewed changes

msau42 reviewed Apr 16, 2019

View reviewed changes

jsafrane commented Apr 16, 2019

View reviewed changes

keps/sig-storage/20190408-volume-scheduling-limits.md Outdated Show resolved Hide resolved

msau42 reviewed Apr 16, 2019

View reviewed changes

keps/sig-storage/20190408-volume-scheduling-limits.md Show resolved Hide resolved

jsafrane force-pushed the volume-limits branch from 70cc269 to 82cccb8 Compare April 17, 2019 10:23

msau42 reviewed Apr 17, 2019

View reviewed changes

davidz627 reviewed Apr 17, 2019

View reviewed changes

bsalamat reviewed Apr 17, 2019

View reviewed changes

jsafrane commented Apr 29, 2019

View reviewed changes

liggitt added the api-review Categorizes an issue or PR as actively needing an API review. label Apr 29, 2019

thockin reviewed Apr 29, 2019

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 29, 2019

msau42 mentioned this pull request Apr 29, 2019

Dynamic Maximum volume count #554

Closed

thockin added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 29, 2019

msau42 reviewed Apr 29, 2019

View reviewed changes

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 29, 2019

Add KEP for volume scheduling limits

eab4824

update the spec to not create CSINode object by default for in-tree volume types

gnufied force-pushed the volume-limits branch from a35e98a to eab4824 Compare April 30, 2019 00:02

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 30, 2019

k8s-ci-robot assigned msau42 Apr 30, 2019

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 30, 2019

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 30, 2019

k8s-ci-robot merged commit 646c79c into kubernetes:master Apr 30, 2019

This was referenced May 6, 2019

Move dynamic volume limit to kep format #730

Closed

Volume Scheduling Limits kubernetes/kubernetes#77595

Merged

gnufied mentioned this pull request Aug 14, 2019

Improve Node-specific Volume Limit Calculation kubernetes/kubernetes#80967

Closed

bertinatto mentioned this pull request Nov 6, 2019

Convert existing PVs to use volume topology in VolumeBinderPredicate kubernetes/kubernetes#83394

Merged


		##### Alpha -> Beta Graduation

		N/A (`AttachVolumeLimit` feature is already beta).


		### Non-Goals

		- Heterogenous clusters, i.e. clusters where access to storage is limited only to some nodes. Existing `PV.spec.nodeAffinity` handling, not modified by this KEP, will filter out nodes that don't have access to the storage, so predicates changed in this KEP don't need to worry about storage topology and can be simpler.


		##### Removing a deprecated flag

		- Announce deprecation and support policy of the existing flag


		##### Beta -> GA Graduation

		It must graduate together with CSI migration.


		- Maximum capacity per node. Some cloud environments limit both number of attached volumes (covered in this KEP) and total capacity of attached volumes (not covered in this KEP). For example, this KEP will ensure that scheduler puts max. 128 GCE PD volumes to a [typical GCE node](https://cloud.google.com/compute/docs/machine-types#predefined_machine_types), but it won't ensure that the total capacity of the volumes is less than 64 TB.

		- Volume limits does not yet integrate with cluster autoscaler if all nodes in the cluster are running at maximum volume limits.


		## Proposal

		* Track volume limits for CSI driver in `CSINode` objects instead of `Node`. Limit in `CSINode` is used instead of limit coming from `Node` object whenever available for same in-tree volume type. This mean scheduler will always try to translate in-tree driver name to CSI driver name whenever `CSINode` object has same in-tree volume type (even if migration is off).

Add KEP for volume scheduling limits #942

Add KEP for volume scheduling limits #942

Conversation

jsafrane commented Apr 8, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsafrane commented Apr 15, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

davidz627 Apr 17, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

davidz627 left a comment

Choose a reason for hiding this comment

davidz627 Apr 17, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bsalamat left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsafrane Apr 18, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsafrane Apr 18, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gnufied commented Apr 27, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thockin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gnufied Apr 29, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gnufied commented Apr 29, 2019

gnufied commented Apr 29, 2019

thockin commented Apr 29, 2019 via email

Choose a reason for hiding this comment

msau42 commented Apr 29, 2019

gnufied commented Apr 30, 2019

msau42 commented Apr 30, 2019

msau42 commented Apr 30, 2019

k8s-ci-robot commented Apr 30, 2019

jsafrane commented Apr 8, 2019 •

edited

Loading

davidz627 Apr 17, 2019 •

edited

Loading

davidz627 Apr 17, 2019 •

edited

Loading

jsafrane Apr 18, 2019 •

edited

Loading

jsafrane Apr 18, 2019 •

edited

Loading

gnufied Apr 29, 2019 •

edited

Loading