[entities-wg] Rubric for evaluation of Entity signal designs #4071

jsuereth · 2024-06-06T21:47:57Z

Today during the otel-entities WG, we discussed values we'd use in rubrics to evaluate future OTEPs/Designs on entities. These are a set of principles we'd like to uphold, but can be flexible on. Designs to move forward with entities should list these conditions in pros/cons (at a minimum).

I'm opening this issue to record decsions and a follow on comment to add un-addressed items we need to decide upon.

Core Principles

Resource detectors (soon to be entity detectors) need to be composable / disjoint
New entities added by extension should not break existing code
- This means if a user takes an action to leverage a new entity, things may change.
- If a user upgrades to e.g. a new SDK OOTB defaults cannot break their existing o11y flow
Navigational attributes need to exist and can be used to identify an entity but could be augmented with UUID or other aspects. - Having ONLY a UUID for entity identification is not good enough.
- o11y needs to be actionable - E.g. you should be able to execute a kubectl get pods <name> for a k8s pod.
- We'll need to work through design issues here - LOTS of discussion and options and nuanced trade-offs.
- Navigational identity should not change unless the entity identity itself changes.
Collector augmentation / enrichment (resource, e.g.) - Should be extensible and not hard-coded. We need a general algorithm not specific rulesets.
- e.g. SDK + Collector both having k8s detection - this should be supported.
- This may lead to additional issues we'll need to address.
Users are expected to provide / prioritize "detectors" and determine which entity is "producing" or most-important for a signal
- Priorities - This is important if there is overlap in information. We should see if we can avoid this situation.
- e.g. Java - discovering service.name. Have a variety of them running in a default order. Realistic to think users want to shift these around.
For an SDK - ALL telemetry should be associated with the same set of entities (resource labels).
- The association of signals relies on using the same entities to navigate those signals
- We need to make sure identity is the same even through multi-observers.
These are some principles we agreed are important and will evaluate in our rubric on design choices.

The text was updated successfully, but these errors were encountered:

jsuereth · 2024-06-06T21:48:13Z

Issue 1 - Multi observers

We need the ability to understand if two observers are discussing the same entity.
Should the entity have the same ID or should this situation be detectable?

jsuereth · 2024-06-06T21:49:09Z

Issue 2 - ENV variable for resources

We need some interaction between enttity, resource + ENV variable that doesn't break OTEL operator users (and others leveraging ENV variables).

jsuereth · 2024-06-06T21:50:04Z

Issue 3 - Duplicate entity reporting

Should we prevent duplicate entities from being emitted across all possible telemetry sources? Should we have an automatic way for the collector, e.g. to unify duplicate sources of entities and only emit one definitive signal?

jsuereth · 2024-07-18T16:59:20Z

Copying notes from latest SiG meeting on additional principles:

Issue 1 - Multi observers

Two observers are discussing/reporting the same entity - is this something we permit or consider a bug?

Users will need to be involved in solving multi-observer merge
We need the solution to allow this and it's a very important problem to get right
We should try to solve the ~80% such that users won't need to worry about it but for advanced cases.

Issue 2 - ENV variable for resources

We need some interaction between entity, resource + ENV variable that doesn't break OTEL operator users (and others leveraging ENV variables). Ideally the platform can push identity/entity into SDKs via ENV variable.

This is a problem we should solve and include in our solution.

Issue 3 - Duplicate entity reporting

Should we prevent duplicate entities from being emitted across all possible telemetry sources? Should we have an automatic way for the collector, e.g. to unify duplicate sources of entities and only emit one definitive signal?

This is problem we can't solve this entirely in OpenTelemetry
We should provide tools to solve this in OpenTelemetry Collector
We should provide a data model with guidance on how to solve this problem.

jack-berg · 2024-09-04T15:10:14Z

Captured in open-telemetry/oteps#264.

jsuereth added the spec:miscellaneous For issues that don't match any other spec label label Jun 6, 2024

jsuereth changed the title ~~[entities-wg] Rubric for evaluation of designs~~ [entities-wg] Rubric for evaluation of Entity signal designs Jun 6, 2024

jsuereth self-assigned this Jun 6, 2024

jsuereth added the triage:accepted:ready-with-sponsor Ready to be implemented and has a specification sponsor assigned label Jun 6, 2024

jack-berg closed this as completed Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[entities-wg] Rubric for evaluation of Entity signal designs #4071

[entities-wg] Rubric for evaluation of Entity signal designs #4071

jsuereth commented Jun 6, 2024

jsuereth commented Jun 6, 2024 •

edited

Loading

jsuereth commented Jun 6, 2024

jsuereth commented Jun 6, 2024

jsuereth commented Jul 18, 2024

jack-berg commented Sep 4, 2024

[entities-wg] Rubric for evaluation of Entity signal designs #4071

[entities-wg] Rubric for evaluation of Entity signal designs #4071

Comments

jsuereth commented Jun 6, 2024

Core Principles

jsuereth commented Jun 6, 2024 • edited Loading

Issue 1 - Multi observers

jsuereth commented Jun 6, 2024

Issue 2 - ENV variable for resources

jsuereth commented Jun 6, 2024

Issue 3 - Duplicate entity reporting

jsuereth commented Jul 18, 2024

Issue 1 - Multi observers

Issue 2 - ENV variable for resources

Issue 3 - Duplicate entity reporting

jack-berg commented Sep 4, 2024

jsuereth commented Jun 6, 2024 •

edited

Loading