Add design for StorageClass parameters #30

JohnStrunk · 2018-09-11T23:10:37Z

Describe what this PR does
This PR adds a design for StorageClass parameter fields.

Is there anything that requires special attention?
My goal for this PR is to agree on where we want to end up with the fields in the StorageClass. Since the StorageClass object is the main interface for cluster admins to define the types of storage that should be provided to users, the parameters that we choose to expose and how we do it will significantly affect the admins UX.

Related issues:
none.

Signed-off-by: John Strunk <jstrunk@redhat.com>

centos-ci · 2018-09-11T23:10:38Z

Can one of the admins verify this patch?

humblec · 2018-09-12T04:00:03Z

Thanks @JohnStrunk for this : At the very minimum I would like to pass
glusterURL, glusterUser, glusterSecret via SC , definitely we can many or more and obviously in plan. I can capture others later.

humblec · 2018-09-12T04:02:01Z

I can work on this code implementation. Assigning to myself.
/assign @humblec

JohnStrunk · 2018-09-13T20:36:55Z

@humblec I'm removing you from the PR so you can pick up the related issues for implementation (which is what I think you meant)... Probably #46

JohnStrunk · 2018-09-20T19:53:09Z

@centos-ci ok to test

amarts

LGTM

From glusterfs, we mainly need 'volume type', and 'volume options', and as the example covers both these, it should be good to get in for now!

JohnStrunk · 2018-10-10T20:45:12Z

From @humblec 's feedback, I have rechecked the allowed syntax, and my proposed parameters format isn't going to work as the field must be a plain map<string,string>. I'll revise this to flatten the fields.

jarrpa

Left a few questions and comments, with more waiting until the layout gets flattened.

A general comment: I see that there are a number of parameters from the current (in-tree) provisioner that are not represented in this proposal. I haven't compared them with scrutiny, but have they been consciously omitted, possibly due to no longer being relevant?

jarrpa · 2018-10-23T13:00:45Z

docs/storageclass-description.md

+parameters:
+  # Maximum size of an individual brick. Volumes that exceed this will be
+  # created as distributed-* volumes.
+  maxBrickSize: 100Gi


Is this actually a feature of Gluster I'm not aware of? Or is this logic yet-to-be-implemented in the CSI driver?

This will interact w/ GD2 IVP to limit brick size to make recovery and capacity balancing easier. I thought Heketi had something similar, though not quite used like intended here... Perhaps I'm just thinking of vol resize.

Ah, okay. Might be something that could be enforced within the driver, for Heketi anyway.

jarrpa · 2018-10-23T13:02:28Z

docs/storageclass-description.md

+  # PV capacity * capacityReservation will be reserved for use by this
+  # volume. The reservation should account for snaps, clones, compression,
+  # and deduplication. Float >= 0
+  capacityReservation: "1.0"


This name does not really reflect its function. Since we're explicitly using LVM, what about something like thinpoolFactor?

I'm trying to make the function reflect the name. 😁
The idea really is to "reserve" this brick capacity and not allocate it for other volumes. This is more than just setting the thinp size. It needs to account for VDO as well.
My big struggle here is that it's pretty straightforward for snapshots, but I'm not sure what to do about the "factor" when cloning. If I have an initial volume w/ reservation=10x and I clone it, presumably, the 10x accounted for the clone... What should the reservation parameter of the clone be? And does the answer change if I'm cloning a single image multiple times vs. cloning a clone of a clone of a...?

Due to the uncertainty here, I'm tempted to drop it for now, but w/o some sort of reservation, clone and snap are pretty useless.

Side note: We're also evaluating removal of LVM due to performance & scale issues.

JohnStrunk · 2018-10-24T20:54:06Z

A general comment: I see that there are a number of parameters from the current (in-tree) provisioner that are not represented in this proposal. I haven't compared them with scrutiny, but have they been consciously omitted, possibly due to no longer being relevant?

I was working from: https://kubernetes.io/docs/concepts/storage/storage-classes/#glusterfs

No longer needed: resturl, restauthenabled, restuser, restuserkey, secretName, secretNamespace, clusterid. These all move to the CSI driver configuration.
volumetype is one I'm trying to split out to make more extensible, less cryptic, and allow us to bring in arbiter, thin arbiter, halo
That leaves gidMin & gidMax which I haven't had time to track down. I find it odd that we're the only driver with those. If anyone has links that discuss these, their importance, and the alternatives, I'd appreciate it.

jarrpa · 2018-10-25T00:03:09Z

A general comment: I see that there are a number of parameters from the current (in-tree) provisioner that are not represented in this proposal. I haven't compared them with scrutiny, but have they been consciously omitted, possibly due to no longer being relevant?

I was working from: https://kubernetes.io/docs/concepts/storage/storage-classes/#glusterfs
* No longer needed: resturl, restauthenabled, restuser, restuserkey, secretName, secretNamespace, clusterid. These all move to the CSI driver configuration.

This will not be true if we plan on ever supporting more than one Gluster cluster per CSI driver. As we currently promote deploying at least two Gluster clusters in a "full" OpenShift installation, I would think it'd be ridiculous to have two copies of the same driver running.

* volumetype is one I'm trying to split out to make more extensible, less cryptic, and allow us to bring in arbiter, thin arbiter, halo

That would definitely be appreciated!

* That leaves gidMin & gidMax which I haven't had time to track down. I find it odd that we're the only driver with those. If anyone has links that discuss these, their importance, and the alternatives, I'd appreciate it.

I've always been confused by this, as well. I'd also like to know more about why this is.

JohnStrunk · 2018-10-25T18:51:02Z

This will not be true if we plan on ever supporting more than one Gluster cluster per CSI driver. As we currently promote deploying at least two Gluster clusters in a "full" OpenShift installation, I would think it'd be ridiculous to have two copies of the same driver running.

I don't have any plans to support a single driver accessing multiple gluster clusters. The reasons are:

The version of the CSI driver must be coordinated w/ the server versions, and adding multiple clusters to the mix complicates upgrade sequencing
The operator will be deploying the CSI driver (or triggering it if we use the CSI operator). While we could put in checks to avoid multiple deployments and skip removal if there are remaining clusters, this seems like added complexity (for dev and test) over having a single set of procedures that work the same way every time.
By moving this contact & authentication config to the CSI config (set up via the operator), we avoid burdening the admin with it and avoid a potential source of misconfiguration that would have to be tracked down. It just works out of the box, properly authenticated and secured.
The resources consumed by the CSI driver are small. It's a rather minimal golang executable w/ low CPU and memory requirements. The gains in idiot-poofing seem worth a couple extra pods.

Madhu-1 · 2018-10-26T05:46:10Z

as this needs some more discussion, moving out of GCS/0.2

jarrpa · 2018-10-26T16:34:07Z

I don't have any plans to support a single driver accessing multiple gluster clusters. The reasons are:
* The version of the CSI driver must be coordinated w/ the server versions, and adding multiple clusters to the mix complicates upgrade sequencing

The only coordination is FUSE client version >= Server version. The Operator already controls both, such that all Gluster clusters should have the same version / container images. On Operator update, if it comes with a new default image for the server containers, we already have plans to disallow further Operator updates until the full Gluster system reaches quiescence.

* The operator will be deploying the CSI driver (or triggering it if we use the CSI operator). While we could put in checks to avoid multiple deployments and skip removal if there are remaining clusters, this seems like added complexity (for dev and test) over having a single set of procedures that work the same way every time.

Complexity worth the feature, in my opinion. :)

* By moving this contact & authentication config to the CSI config (set up via the operator), we avoid burdening the admin with it and avoid a potential source of misconfiguration that would have to be tracked down. It just works out of the box, properly authenticated and secured.

We could still move that to the Operator by having it take care of configuring the StorageClasses with the appropriate credentials automatically.

* The resources consumed by the CSI driver are small. It's a rather minimal golang executable w/ low CPU and memory requirements. The gains in idiot-poofing seem worth a couple extra pods.

While the CPU and memory requirements are small, we do still have to consider that the pods themselves count towards the maximum number of pods per node. This means (with the current example deployments) that for every Gluster cluster you have at least two controller pods per Gluster cluster in the Kube/OCP cluster and at lease one node pod on (typically) EVERY schedulable node in the Kube/OCP cluster.

Signed-off-by: John Strunk <jstrunk@redhat.com>

jarrpa · 2018-10-29T13:31:49Z

The current PR LGTM. I still have my concerns about continuing to rely on a single cluster per driver model, but that can be addressed in another PR.

JohnStrunk · 2018-11-01T23:52:28Z

Any other reviews?

humblec · 2018-11-02T05:39:24Z

@JohnStrunk as in comment #30 (comment), I agree with @jarrpa . IMO, we should have support for multiple cluster for a single CSI driver. On that regard, I would like to have glusterCluster param in SC. I still miss, whats wrong if we have this param as an optional one. By this, admin has the possibility of bringing up different SCs for different Clusters as well. Say one for test and another one for Dev using same CSI driver. We cant always think, operator is the ONLY or a must thing for CSI driver, gluster CSI driver should be capable of running it alone and flexible in its support. Driver init() is always required if we allow this param only via deployment ENV var.

@obnoxxx thoughts?

humblec · 2018-11-02T08:24:01Z

@JohnStrunk one other high level question, how are you planning to match these allowed SC params to GD2 volumecreate fields ?

JohnStrunk · 2018-11-05T16:58:06Z

We can have optional fields for specifying cluster details, but that's not how GCS is going to be deployed. If someone wants to do it all manually (e.g., deploy CSI w/o the rest of the stack), then sure, specifying via SC is fine. Upgrade, versioning, and authentication is on them.

@JohnStrunk one other high level question, how are you planning to match these allowed SC params to GD2 volumecreate fields ?

Likely, some of these options cannot currently be expressed to GD2 IVP. Implementation here will need to be coordinated w/ the GD2 team. I assume you're referring to things such as maxBrickSize, capacityReservation, and the zone fields. IVP will need to be enhanced to implement these options.

humblec · 2018-11-07T07:47:38Z

Likely, some of these options cannot currently be expressed to GD2 IVP. Implementation here will need to be coordinated w/ the GD2 team. I assume you're referring to things such as maxBrickSize, capacityReservation, and the zone fields. IVP will need to be enhanced to implement these options.

@JohnStrunk Most of the param has issues in co-ordination with GD2 api. For example, if we parse arbitertype as in this design and in this code
humblec@aa1b771

How to pass arbitertype to GD2 ?

JohnStrunk · 2018-11-07T14:38:42Z

It appears thin arbiter is designated via a flag. See: gluster/glusterd2#702
If IVP doesn't expose a sufficient interface, please open an issue against gd2.

JohnStrunk · 2019-01-24T18:44:31Z

In keeping w/ #154:

the language around arbiters needs to be updated such that arbiterZones only applies to normal arbiter volumes, not thin
arbiterPath option needs to be documented

Add design for StorageClass parameters

587f494

Signed-off-by: John Strunk <jstrunk@redhat.com>

JohnStrunk requested a review from humblec September 11, 2018 23:10

humblec self-assigned this Sep 12, 2018

This was referenced Sep 13, 2018

Add georep fields to SC #43

Open

Be able to configure replicate volume type #46

Closed

Be able to configure disperse (ec) volumes #47

Open

JohnStrunk unassigned humblec Sep 13, 2018

JohnStrunk added the in progress label Sep 17, 2018

JohnStrunk self-assigned this Sep 17, 2018

JohnStrunk mentioned this pull request Sep 17, 2018

Need option to have multiple CSI drivers #7

Closed

JohnStrunk mentioned this pull request Sep 20, 2018

Add volumeType option for driver. #59

Closed

JohnStrunk added this to the GCS-alpha1 milestone Sep 24, 2018

This was referenced Sep 24, 2018

Define the tasks/subtasks for Gluster CSI driver. #6

Closed

Add a proposal for the Gluster CRDs gluster/anthill#40

Merged

amarts reviewed Oct 1, 2018

View reviewed changes

JohnStrunk added the GCS/alpha1 label Oct 3, 2018

JohnStrunk removed this from the GCS-alpha1 milestone Oct 3, 2018

JohnStrunk added GCS/beta0 and removed GCS/alpha1 labels Oct 4, 2018

JohnStrunk added the GCS/0.2 label Oct 15, 2018

jarrpa reviewed Oct 23, 2018

View reviewed changes

Madhu-1 removed the GCS/0.2 label Oct 26, 2018

Flattens the parameters to a simple k=v map.

9ef23cf

Signed-off-by: John Strunk <jstrunk@redhat.com>

This was referenced Nov 8, 2018

Do we need gidMin/gidMax in the StorageClass? #93

Open

Tracker: Enable thin arbiter gluster/gcs#54

Open

joejulian closed this Mar 30, 2023

Add design for StorageClass parameters #30

Add design for StorageClass parameters #30

Uh oh!

Conversation

JohnStrunk commented Sep 11, 2018

Uh oh!

centos-ci commented Sep 11, 2018

Uh oh!

humblec commented Sep 12, 2018

Uh oh!

humblec commented Sep 12, 2018

Uh oh!

JohnStrunk commented Sep 13, 2018

Uh oh!

JohnStrunk commented Sep 20, 2018

Uh oh!

amarts left a comment

Choose a reason for hiding this comment

Uh oh!

JohnStrunk commented Oct 10, 2018

Uh oh!

jarrpa left a comment

Choose a reason for hiding this comment

Uh oh!

jarrpa Oct 23, 2018

Choose a reason for hiding this comment

Uh oh!

JohnStrunk Oct 24, 2018

Choose a reason for hiding this comment

Uh oh!

jarrpa Oct 29, 2018

Choose a reason for hiding this comment

Uh oh!

jarrpa Oct 23, 2018

Choose a reason for hiding this comment

Uh oh!

JohnStrunk Oct 24, 2018

Choose a reason for hiding this comment

Uh oh!

JohnStrunk commented Oct 24, 2018

Uh oh!

jarrpa commented Oct 25, 2018

Uh oh!

JohnStrunk commented Oct 25, 2018

Uh oh!

Madhu-1 commented Oct 26, 2018

Uh oh!

jarrpa commented Oct 26, 2018

Uh oh!

jarrpa commented Oct 29, 2018

Uh oh!

JohnStrunk commented Nov 1, 2018

Uh oh!

humblec commented Nov 2, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

humblec commented Nov 2, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JohnStrunk commented Nov 5, 2018

Uh oh!

humblec commented Nov 7, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JohnStrunk commented Nov 7, 2018

Uh oh!

JohnStrunk commented Jan 24, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

humblec commented Nov 2, 2018 •

edited

Loading

humblec commented Nov 2, 2018 •

edited

Loading

humblec commented Nov 7, 2018 •

edited

Loading