proposal: Change application identifier to allow app names longer than 63 chars #6425

jannfis · 2021-06-08T08:28:27Z

This is a proposal to change the way application resources are identified within Argo CD to allow application names longer than 63 characters.

Signed-off-by: jannfis jann@mistrust.net

Note on DCO:

If the DCO action in the integration test fails, one or more of your commits are not signed off. Please click on the Details link next to the DCO action for instructions on how to resolve this.

Checklist:

Signed-off-by: jannfis <jann@mistrust.net>

codecov · 2021-06-08T08:38:40Z

Codecov Report

Merging #6425 (1fa7b42) into master (12c95d6) will increase coverage by 0.02%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #6425      +/-   ##
==========================================
+ Coverage   41.01%   41.03%   +0.02%     
==========================================
  Files         152      158       +6     
  Lines       20088    21299    +1211     
==========================================
+ Hits         8239     8740     +501     
- Misses      10716    11306     +590     
- Partials     1133     1253     +120

Impacted Files	Coverage Δ
util/db/repository.go	`40.00% <0.00%> (-22.74%)`	⬇️
util/cache/cache.go	`56.96% <0.00%> (-13.26%)`	⬇️
util/git/workaround.go	`46.87% <0.00%> (-6.07%)`	⬇️
util/webhook/webhook.go	`62.36% <0.00%> (-3.73%)`	⬇️
server/account/account.go	`65.89% <0.00%> (-3.28%)`	⬇️
reposerver/cache/cache.go	`64.51% <0.00%> (-2.93%)`	⬇️
server/project/project.go	`52.69% <0.00%> (-2.78%)`	⬇️
util/git/git.go	`75.75% <0.00%> (-2.37%)`	⬇️
util/db/cluster.go	`60.00% <0.00%> (-2.31%)`	⬇️
util/jwt/jwt.go	`60.00% <0.00%> (-1.91%)`	⬇️
... and 71 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 12c95d6...1fa7b42. Read the comment docs.

Signed-off-by: jannfis <jann@mistrust.net>

jgwest · 2021-06-08T12:37:21Z

docs/proposals/application-name-identifier.md

+
+On a rollback to a previous Argo CD version, this change would be reverted
+and the resource would look like the first shown example above.
+


How would Argo CD know whether a resource's app.kubernetes.io/instance was applied using the old system or the new system (the one described in this proposal)?

For example:

A cluster is running with the old version of Argo CD

All the application resources are labelled using the old system

Argo CD is upgraded

The new Argo CD looks at all the existing app.kubernetes.io/instance fields of the existing resources, discovers that they don't match any expected hash (and thus don't belong to any Application), and ignores them (recreating new Application resources).

I think we need the ability to look at the app.kubernetes.io/instance field of a resource and know which system it was created with.

One option we could use is look at the app.kubernetes.io/instance value and determine if it is a 32 character hexadecimal value, but this is not foolproof (users may be using Applications with names that are 32-byte hexadecimal values).

That's a good point you raise there.

I haven't yet verified, but I think it would be similar to the behavior if you rename an application, e.g. by deleting an Application and re-creating it with a different name, but same parameters (source + destination).

Argo CD would just go and overwrite what's in app.kubernetes.io/instance - at least as long as another application with the previous name does not exist, in which case Argo CD would issue a Shared Resource Warning and prevent the overwrite.

I think if an Argo CD instance encounters a resource with a app.kubernetes.io/instance value that it doesn't recognize (eg it does not correspond to a hash of a known application name+install id value), then Argo CD must assume that that resource is owned by another Argo CD instance (the goal of 'allow multiple Argo CD instances to manage resources on the same cluster').

So, presuming that Argo CD encounters a resource it thinks is owned by someone else, and it has the same name/namespace as a resouce managed by an Application managed by the current Argo CD instance, do we:

Take ownership of it with the current Argo CD instance

Leave it alone

Probably 2), and report an error ('error: Argo CD Application would overwrite a resource that may be owned by another Argo CD instance').

It might be worthwhile to document the algorithm that Argo CD will use to move existing resources from the old style to the new style, in the upgrade scenario.

I'm guessing it would be:

If we encounter a resource that contains a app.kubernetes.io/instance that is not a 32/64 character hexadecimal value, AND it matches the name field of an existing Argo CD Application, then convert it to the new system (ensure the app.kubernetes.io/instance field is updated)

As you said, this might already be handled by the rename-style behaviour.

Another option would be to delete the resource, and hope that Argo CD recreates it.

If we encounter a resource that contains a app.kubernetes.io/instance that is a 32/64 character hexadecimal value, leave it alone. (Assume it's a valid instance id from another Argo CD instance).

Unless it has the same Kubernetes resource name as a resource that this Argo CD instance is trying to create, in which case report the above error ('error: would overwrite a resource that may be owned...')

Else, either delete or leave it alone (orphan it).
(in very rare cases this might orphan or delete old resources, but it's probably fine.)

jgwest · 2021-06-08T12:40:54Z

docs/proposals/application-name-identifier.md

+  hash algorithms on Stack Exchange that suggests that `FNV-1a` might be more
+  than sufficient for our use-cases:
+  https://softwareengineering.stackexchange.com/a/145633
+


Re: hash choice -

If we're going to go with SHA-based algorithm, I recommend SHA-256, as SHA-1 is considered broken. It doesn't matter for this use case (we're not using the cryptographic properties of a hash, as you note later), but we will constantly get people telling us it's broken and asking us when we will move to another algorithm 😄 .

Also, correct me if I'm wrong, but I don't think hash performance is an issue: for each Application, we only need to calculate this value once, and then store it in memory (eg memoize the result of the hash function for each application).

You have a good argument there. I favored the SHA-1 variant for producing the shorter hash values, but I think you are right in that we may want to consider usage of SHA-256 instead. Although we do not need to be secure in cryptographic terms here, probably SHA-1 will become utterly deprecated soon enough (maybe up to the point of being removed from any standard libraries?), and even Git will move away from it. So yeah, it leaves a "bad aftertaste" kind of thing :)

Thanks for bringing it up, I will change it in the proposal accordingly.

Looks like we've got two similar discussion threads in this PR. Copying comments from another thread here.

As I understand there are trying to solve three issues:

allow app with long names

support two argocd managing apps in the same cluster

avoid argocd confusion that happened when something else adds app.kubernetes.io/instance label

Also, I think additional requirements are: try to keep it human-friendly and try avoid breaking changes.

To solve I want to propose:

use annotations instead of labels for tracking to solve long names issue

use three annotations (e.g. argo-cd.argoproj.io/application, argo-cd.argoproj.io/resourceKey, ``argo-cd.argoproj.io/installation` ) that includes app name, resource key and argocd installation id

allow the user to switch to annotation-based tracking when necessary using new app.spec.tracking field. (this comment tries to justify why new way of tracking should be optional)

So main changes in my proposal:

use annotations + human-readable strings instead of hash + labels.

new way of tracking should be explicitly enabled per app

@jannfis , @jgwest what do you think?

I think we should not provide two different methods for tracking applications, as it will complicate things greatly and probably will cause some confusion among users.

What speaks against using the label for tracking and the annotations for human-readable identifiers together? So the annotations will only serve informational purposes, and the label is used to select and identify resources.

To solve

avoid argocd confusion that happened when something else adds app.kubernetes.io/instance label

I would rather propose to change the default app.kubernetes.io/instance label to something more sensible, Argo CD specific. E.g. argocd.argoproj.io/instance - I think the Argo CD Helm chart already sets this by default.

IMHO, I think we shouldn't sacrifice the label in order to enable longer-than-63-chars application names.

I'm trying to avoid painful migration for end-users: if Argo CD changes the tracking method then all managed resources suddenly became out-of-sync.

However, I agree that two ways if tracking is not great. In order to make it easier to migrate, we should support using "legacy" way of tracking for 1~2 releases.

docs/proposals/application-name-identifier.md

Signed-off-by: jannfis <jann@mistrust.net>

kshamajain99 · 2021-06-08T17:19:22Z

docs/proposals/application-name-identifier.md

+of the Argo CD installation into the final hash value, e.g.
+
+```
+value = hashFunc(installationIdentifier + "." + applicationName)


My suggestion would be that hashValue can be applicationName+"."+resourceKey. So that we know what the name of original resourceName in case when some kubernetes operator clone the resource.

By that we can also aim to solve another usecase of ignoreExtraneous resources outside target namespace

#6324

I really like the idea to bring in resourceKey somewhere. But I think if we put it into the application instance label, we will end up with a different value on each resource. This makes it utterly hard to use the label as a selector, e.g. to select all resources belonging to a given application by running kubectl get somekind -l app.kubernetes.io/instance=<somevalue>

Maybe it would make more sense to add it as an additional annotation on the resource? E.g. argo-cd.argoproj.io/resourceKey instead of trying to condense it.

But at the end, this will be a design decision: Do we want to support above use-case, e.g. using the label as a selector for resources managed by a given application, or not. I clearly see the benefits of using the label as a selector, but I don't have any strong opinions on it. We should just be aware of the consequences.

I agree that it is very convenient to be compatible with kubectl get -l app.kubernetes.io/instance=<somevalue> . This works most of the time except few edge cases:

app name is too long

something else add app.kubernetes.io/instance label and confuses argo cd. This is the most annoying issue and we don't have a very good resolution.

So I think it would be great if keep using app.kubernetes.io/instance label by default and allow to switch to new way when necessary.

@jessesuen had a good point about trying to keep app mark human readable. How about we add new app-level field resourceTracking that might be label or annotation. If user select annotation that Argo CD should apply app.kubernetes.io/instance annotation where annotation value is <appName>-<resouceName> ?

This would allow us to upgrade smoothly and takes care about both edge cases. WDYT?

The argo-cd.argoproj.io/resourceKey annotation instead of embedding resource key into app.kubernetes.io/instance works too

Signed-off-by: Alexander Matyushentsev <amatyushentsev@gmail.com>

alexmt · 2021-09-07T18:45:20Z

We brainstormed about this change with @jannfis and @jessesuen offline. Created the PR to @jannfis 's fork to capture the proposed changes: jannfis#17

docs: update proposal to use annotations with human friendly values

alexmt

LGTM

…n 63 chars (argoproj#6425) proposal: Change application identifier to allow app names longer than 63 chars (argoproj#6425) Signed-off-by: jannfis <jann@mistrust.net> Signed-off-by: viktorplakida <plakyda1@gmail.com>

jannfis added 2 commits June 8, 2021 08:20

proposal: Change application identifier

e1b3c99

Signed-off-by: jannfis <jann@mistrust.net>

Changes

02372f9

Signed-off-by: jannfis <jann@mistrust.net>

jannfis requested review from alexmt, jessesuen, jgwest, kshamajain99, mayzhang2000 and sbose78 June 8, 2021 08:29

jannfis changed the title ~~proposal: Change application identifier~~ proposal: Change application identifier to allow app names longer than 63 chars Jun 8, 2021

Changes

9a7ae62

Signed-off-by: jannfis <jann@mistrust.net>

jgwest reviewed Jun 8, 2021

View reviewed changes

docs/proposals/application-name-identifier.md Show resolved Hide resolved

Update terminology

1959766

Signed-off-by: jannfis <jann@mistrust.net>

kshamajain99 reviewed Jun 8, 2021

View reviewed changes

alexmt mentioned this pull request Jun 17, 2021

Compare option to ignore extraneous resources only in other namespaces #6324

Closed

This was referenced Jun 21, 2021

application resource can be applied in any namespace #3474

Closed

feat: (PoC) Applications outside argocd namespace #6537

Closed

docs: update proposal to use annotations with human friendly values

99076af

Signed-off-by: Alexander Matyushentsev <amatyushentsev@gmail.com>

Merge pull request #17 from alexmt/proposal/application-name-identifier

1fa7b42

docs: update proposal to use annotations with human friendly values

alexmt approved these changes Sep 10, 2021

View reviewed changes

alexmt merged commit d823efe into argoproj:master Sep 13, 2021

pasha-codefresh mentioned this pull request Sep 17, 2021

feat: rename app identifier #7251

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposal: Change application identifier to allow app names longer than 63 chars #6425

proposal: Change application identifier to allow app names longer than 63 chars #6425

jannfis commented Jun 8, 2021 •

edited

Loading

codecov bot commented Jun 8, 2021 •

edited

Loading

jgwest Jun 8, 2021

jannfis Jun 8, 2021

jgwest Jun 8, 2021

jgwest Jun 8, 2021

jgwest Jun 8, 2021 •

edited

Loading

jannfis Jun 8, 2021

alexmt Jun 9, 2021 •

edited

Loading

jannfis Jun 22, 2021

alexmt Jun 28, 2021

kshamajain99 Jun 8, 2021 •

edited

Loading

jannfis Jun 8, 2021

alexmt Jun 8, 2021

alexmt Jun 8, 2021

alexmt commented Sep 7, 2021

alexmt left a comment


		On a rollback to a previous Argo CD version, this change would be reverted
		and the resource would look like the first shown example above.

proposal: Change application identifier to allow app names longer than 63 chars #6425

proposal: Change application identifier to allow app names longer than 63 chars #6425

Conversation

jannfis commented Jun 8, 2021 • edited Loading

codecov bot commented Jun 8, 2021 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jgwest Jun 8, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alexmt Jun 9, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kshamajain99 Jun 8, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alexmt commented Sep 7, 2021

alexmt left a comment

Choose a reason for hiding this comment

jannfis commented Jun 8, 2021 •

edited

Loading

codecov bot commented Jun 8, 2021 •

edited

Loading

jgwest Jun 8, 2021 •

edited

Loading

alexmt Jun 9, 2021 •

edited

Loading

kshamajain99 Jun 8, 2021 •

edited

Loading