Guidance for filling (or consider removing?) service.instance.id #1034

anuraaga · 2020-09-29T07:25:42Z

Resource defines service.instance.id as a unique ID among horizontally scaled services

https://github.com/open-telemetry/opentelemetry-specification/tree/master/specification/resource/semantic_conventions#service

But we have many similar IDs in other places
host.id
k8s.pod.name or uid
container.id
faas.instance

Should one of these be copied into service.instance.id when it's available? One issue is ordering, host.id is the instance ID for non-container workloads but not container workloads, for example.

Or can we just remove service.instance.id since we would expect an ID of horizontal scaling to be present in another convention?

The text was updated successfully, but these errors were encountered:

Oberon00 · 2020-09-29T07:52:35Z

A host can host multiple services. A POD might host multiple (related) services. A container might host multiple services (even though that's unusual). I can't think of a scenario wher FaaS would have more than one service instance.

I agree though that most of the time, except for host, everything should map 1:1 to service.instance.id, so that would be reasonable choices.

However, since only the triplet service.(namespace, name, id) is supposed to be unique, I think that we should not auto-fill service.instance.id by default with anything -- it seems to be predestined for an "inherent unique ID" as the spec puts it. I don't think we should remove it, it is fine for that use case.

I think it may make a lot of sense to add a telemetry.instance_id to be able to group resources that came from the very same initialization of OpenTelemetry, i.e. something that is usable as primary key for resources. Note that the service namespace/name/id triplet is not usable for that as it stays (or should stay at least) the same over restarts of the same service instance.

anuraaga · 2020-09-29T08:11:24Z

is not usable for that as it stays (or should stay at least) the same over restarts of the same service instance.

Hmm - in that case should we consider removing the recommendation to "generate a random Version 1 or Version 4 RFC 4122 UUID"? I don't think it can ever be possible for the UUID to be the same over restarts. I guess a user needs to set it appropriately in that case.

Oberon00 · 2020-09-29T08:15:18Z

Spec says:

It is preferable for the ID to be persistent and stay the same for the lifetime of the service instance, however it is acceptable that the ID is ephemeral and changes during important lifetime events for the service (e.g. service restarts).

And also later:

services aiming for reproducible UUIDs may also use Version 5, see RFC 4122 for more recommendations

It would be cool if the spec could also write the "nicknames" for these versions, e.g. V 5 requires a name and namespace as input, e.g. Python's implementation has this:

uuid.uuid5(namespace, name)
Generate a UUID based on the SHA-1 hash of a namespace identifier (which is a UUID) and a name (which is a string).

anuraaga · 2020-09-30T08:37:15Z

I was thinking a bit more on this - @Oberon00 just wondering, how would a backend use this value with the current definition? Isn't it a bit too ambiguous? If the value is always a useful, persistent identifier for the instance among a group of them, that's useful, but if we also allow a fallback to a random UUID V1 / V5 then isn't the attribute too unreliable to do anything meaningful?

Oberon00 · 2020-09-30T08:44:04Z

I don't think the current definition is suitable for using it much generically. It probably requires user-side knowledge to interpret. Still, grouping by instance ID groups at least groups all the spans by one running OpenTelemetry SDK instance together.

andrewhsu · 2020-10-09T16:03:23Z

from issue triage mtg today, setting as p3 for editorial change if it comes in before ga

Oberon00 · 2020-12-11T17:20:58Z

Since service.instance.id is "required" in the service resource, this issue does not interact well with #1269 "Define the fallback case for service.name": If the SDK sets a value for service.name automatically, service.instance.id may be missing, resulting in an invalid resource produced by default. See #1269 (comment)

carlosalberto · 2020-12-15T01:03:13Z

Changed priority to P2 to have this issue as a follow up.

Oberon00 · 2020-12-15T12:38:30Z

Repeating my comment from #1269 (comment)

IMHO this attribute is poorly defined right now as it may or may not be the same across service restarts, which IMHO can make quite a difference. It would be easiest if it MUST be the different for each restart, that way it could be used as primary key for all resources (not only service.*) sent by the same telemetry instance. On the other hand, maybe such an attribute would better be named telemetry.sdk.instance.id.

(BTW, why instance.id instead of instance_id?)

To sum it up, I think we should split this attribute up into (names are just ideas):

service.deployment_id: An unique ID of that particular deployed service instance. Stays the same across restarts, but only one service with that deployment_id can ever be running at the same time (if there are, they are parallel instances running with the exact same configuration+filesystem+host). Such an ID will often not be available. If it is, the ID is typically created when installing/deploying the service and stored on the filesystem.
telemetry.sdk.instance_id: A unique ID generated at startup by the SDK. Can be used as primary key for resources.

carlosalberto · 2020-12-15T18:46:47Z

To sum it up, I think we should split this attribute up into (names are just ideas)

Thanks @Oberon00, I think this makes sense. Not sure on the names themselves but the idea of having two attributes (one optional, set by the user, and another one generated per-restart by the SDK) seems like the way to go here.

aabmass · 2021-09-16T20:24:20Z

@Oberon00 @anuraaga any updates on this issue? Looking at what we have in the Python SDK right now, the default behavior violates the requirement that "service.namespace,service.name,service.instance.id triplet MUST be globally unique".

We are prototyping metrics and this presents the problem that multiple instances of a service are likely to violate the single-writer requirement because of the identical resources. I was considering doing the same as open-telemetry/opentelemetry-java#1726 to solve this, but it doesn't seem like we are settled on a solution.

Oberon00 · 2021-09-17T08:05:44Z

I don't really have much stake in this issue (anymore). I think it would be best if you create a PR since you seem to have an actual use case.

svrnm · 2021-11-12T13:50:52Z

I stumbled across this issue recently as well, so I would be curious to get this conversation re-started.

A few thoughts:

Reg. service.deployment_id. That's an interesting addition. However, I am wondering if this should be part of Deployment, i.e. deployment.id or similar? Or would it make more sense to add deployment.environment to service, i.e. service.deployment.environment and service.deployment.id?

Rg. telemetry.sdk.instance_id: would it be possible to reload/replace the SDK at runtime? Let's say I use an auto instrumentation agent and make a hot update? Would that id change in that case?

mtwo · 2024-07-09T20:39:11Z

This appears to be closed by open-telemetry/semantic-conventions#312. Please re-open if this isn't correct!

anuraaga added the spec:resource Related to the specification/resource directory label Sep 29, 2020

Oberon00 added the area:semantic-conventions Related to semantic conventions label Sep 29, 2020

andrewhsu added the release:allowed-for-ga Editorial changes that can still be added before GA since they don't require action by SIGs label Sep 29, 2020

andrewhsu added the priority:p3 Lowest priority level label Oct 9, 2020

Oberon00 mentioned this issue Dec 11, 2020

Define the fallback case for service.name #1269

Merged

carlosalberto added priority:p2 Medium priority level and removed priority:p3 Lowest priority level labels Dec 15, 2020

jmacd mentioned this issue Dec 18, 2020

Metrics: Requirements for safe attribute removal #1297

Open

jmacd mentioned this issue Feb 9, 2021

Adding resource attributes post-creation (e.g. via auto-discovery) #1298

Open

Oberon00 mentioned this issue Mar 3, 2021

Stabilizing service.name #1497

Closed

Oberon00 mentioned this issue May 14, 2021

Add OTEL_SERVICE_NAME environment variable #1677

Merged

anuraaga mentioned this issue Aug 26, 2021

Fill service.instance.id with UUID by default open-telemetry/opentelemetry-java#1726

Closed

aabmass mentioned this issue Sep 16, 2021

Generate a service.instance.id resource attribute if it is not present open-telemetry/opentelemetry-python#2113

Open

This was referenced Dec 6, 2021

Add telemetry.source attribute namespace #2192

Closed

Introduce Mandatory Unique Identifier For Telemetry Sources open-telemetry/oteps#194

Closed

Oberon00 mentioned this issue Jun 28, 2022

Ephemeral Resource Attributes open-telemetry/oteps#208

Closed

jack-berg mentioned this issue Jan 23, 2023

Should language SDK generate service.instance.id? #3136

Closed

mtwo closed this as completed Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guidance for filling (or consider removing?) service.instance.id #1034

Guidance for filling (or consider removing?) service.instance.id #1034

anuraaga commented Sep 29, 2020

Oberon00 commented Sep 29, 2020 •

edited

Loading

anuraaga commented Sep 29, 2020

Oberon00 commented Sep 29, 2020 •

edited

Loading

anuraaga commented Sep 30, 2020

Oberon00 commented Sep 30, 2020

andrewhsu commented Oct 9, 2020

Oberon00 commented Dec 11, 2020 •

edited

Loading

carlosalberto commented Dec 15, 2020

Oberon00 commented Dec 15, 2020 •

edited

Loading

carlosalberto commented Dec 15, 2020

aabmass commented Sep 16, 2021

Oberon00 commented Sep 17, 2021 •

edited

Loading

svrnm commented Nov 12, 2021

mtwo commented Jul 9, 2024

Guidance for filling (or consider removing?) service.instance.id #1034

Guidance for filling (or consider removing?) service.instance.id #1034

Comments

anuraaga commented Sep 29, 2020

Oberon00 commented Sep 29, 2020 • edited Loading

anuraaga commented Sep 29, 2020

Oberon00 commented Sep 29, 2020 • edited Loading

anuraaga commented Sep 30, 2020

Oberon00 commented Sep 30, 2020

andrewhsu commented Oct 9, 2020

Oberon00 commented Dec 11, 2020 • edited Loading

carlosalberto commented Dec 15, 2020

Oberon00 commented Dec 15, 2020 • edited Loading

carlosalberto commented Dec 15, 2020

aabmass commented Sep 16, 2021

Oberon00 commented Sep 17, 2021 • edited Loading

svrnm commented Nov 12, 2021

mtwo commented Jul 9, 2024

Oberon00 commented Sep 29, 2020 •

edited

Loading

Oberon00 commented Sep 29, 2020 •

edited

Loading

Oberon00 commented Dec 11, 2020 •

edited

Loading

Oberon00 commented Dec 15, 2020 •

edited

Loading

Oberon00 commented Sep 17, 2021 •

edited

Loading