Delta to Cumulative Processor (metrics) #29300

0x006EA1E5 · 2023-11-16T10:46:54Z

The purpose and use-cases of the new component

Convert metric data from monotonic delta to monotonic cumulative.

We can currently convert from cumulative to delta, but not delta to cumulative.

One concrete use case is metrics produced by the count connector or deltas (in fact, a simple stream of monotonic delta 1s, with no period start_time_unix_nano; this design decision appears to have been made so that the count connector can be stateless)

Metrics produced by the count connector cannot be exported correctly via the Prometheus exporter due to the missing start_time_unix_nano, the Prometheus exporter appears to consider the data points to represent breaks in the sequence.

Metrics produced by the count connector are also not suitable for export with the Prometheus remote write exporter. Instead we should be sending the Prometheus remote write exporter periodic aggregates, as we would if the metrics had been scraped, for example every 30 seconds.

Ideally, we should be able to receive the stream of delta data points from the count connector (with missing start_time_unix_nano), and periodically emit a cumulative datapoint, suitable for reception by the Prometheus remote write exporter.

Note, due to the monotonic nature of the metrics, users should be aware that in a load balanced configuration, different instances will maintain different cumulative totals, which will then likely send incorrect, non-monotonic data downstream, unless care is taken to ensure that only one instance is ever responsible for a unique metric (for example, adding an identifying attribute such as the collectors unique instance id).

This processor will need to be stateful to maintain the cumulative total by metric.

~~The use case outlined above could be addressed in a single processor, which maintains the cumulative sum and periodically emits this value.~~

Alternatively, this could be implemented in two processors, a simple delta to cumulative which emits a cumulative sum for each delta datapoint, and a periodic aggregator which could also be used for deltas

This issues proposes the creation of a simple delta to cumulative which emits a cumulative sum for each delta datapoint.

Another issue has been created to perform peridoic aggregation: #29461

Example configuration for the component

Metric names to match, similar to cumulative to delta processor
Publish interval

Telemetry data types supported

Metric

Is this a vendor-specific component?

This is a vendor-specific component
If this is a vendor-specific component, I am proposing to contribute and support it as a representative of the vendor.

Code Owner(s)

No response

Sponsor (optional)

@djaglowski

Additional context

No response

djaglowski · 2023-11-16T15:37:37Z

I'm sponsoring this component.

A few initial thoughts/questions on the design:

Convert metric data from monotonic delta to monotonic cumulative.

I think we could also support nonmonotonic delta to nonmonotonic cumulative without any substantial changes. Or am I missing something?

Alternatively, this could be implemented in two processors, a simple delta to cumulative which emits a cumulative sum for each delta datapoint, and a periodic aggregator which could also be used for deltas

This seems likely to be problematic, since every data point in a time series would be inaccurate in between the two processors. e.g. delta timeseries with values 3, 2, 1 becomes cumulative time series 3, 2, 1 temporarily and then properly 3, 5, 6 only after the second processor.

One note on the broader use case for this - the metrics data model specifically describes this functionality and rationale:
Delta-to-Cumulative: Metrics that are input and output with Delta temporality unburden the client from keeping high-cardinality state. The use of deltas allows downstream services to bear the cost of conversion into cumulative timeseries...

Since our data model is explicitly designed for this use case, I think it is important that the Collector provides a solution for it, with sensible limitations.

0x006EA1E5 · 2023-11-16T21:18:17Z

This seems likely to be problematic, since every data point in a time series would be inaccurate in between the two processors. e.g. delta timeseries with values 3, 2, 1 becomes cumulative time series 3, 2, 1 temporarily and then properly 3, 5, 6 only after the second processor.

The intention would be that the output of the first processor would be correct, but there would be a one to one correspondence of datapoints. So if we had 1000 deltas/sec, we would transform to 1000 cumulatives/sec.

The second processor would kind of of just emit the latest value every interval. So we would just get 1 datapoint per interval (e.g. every 30 seconds)

So, delta timeseries with values 3, 2, 1 becomes cumulative time series 3, 5, 6, but the second processor would just output 6 (but then output 6 again at the next publish interval, and keep doing that forever, or until a new datapoint is received, or the timeseres become "stale", and is stopped being tracked)

We can achieve something like the second processor already, by publishing to the Prometheus exporter, and scraping ourselves with a Prometheus receiver. But it's a bit ugly, no?

djaglowski · 2023-11-16T22:11:58Z

So, delta timeseries with values 3, 2, 1 becomes cumulative time series 3, 5, 6, but the second processor would just output 6 (but then output 6 again at the next publish interval, and keep doing that forever, or until a new datapoint is received, or the timeseres become "stale", and is stopped being tracked)

Thanks for clarifying, I get the idea now. I actually find this composable design quite attractive. The implementation of each would be simpler and users may find a need for only one or the other (e.g. only use the first in combination with the prometheus exporter, or only use the second for reducing frequency of counter updates).

0x006EA1E5 · 2023-11-17T14:05:08Z

I actually find this composable design quite attractive

Yes, me too :D
But just to be clear, as I see it this would mean that both processors would need to be stateful, and essentially hold the same state - a map of metric "identities" to the current value.
Although I don't think this state will be very large

djaglowski · 2023-11-17T15:03:31Z

both processors would need to be stateful, and essentially hold the same state

This will ensure we design a reusable state-tracking mechanism, which I think is needed for many similar use cases. For example, metric aggregate over dimensions.

djaglowski · 2023-11-17T15:22:54Z

@0x006EA1E5, given that we agree on splitting this into two components, do you mind opening two new issues in favor of this one? I will sponsor both.

crobert-1 · 2023-11-20T18:49:34Z

I'm going to remove the needs triage label with the understanding that two issues will be created (one for each of the two proposed processors) and this issue will be closed.

0x006EA1E5 · 2023-11-21T19:40:37Z

@0x006EA1E5, given that we agree on splitting this into two components, do you mind opening two new issues in favor of this one? I will sponsor both.

Do we really need two new issues, rather than just scoping the current one to delta to cumulative, and one new one which would be whatever we call the other component (the one to periodically output updates)?

0x006EA1E5 · 2023-11-23T13:54:23Z

I created #29461

djaglowski · 2024-01-10T14:46:45Z

@0x006EA1E5, I apologize for the slow/missed responses on this. I still think we need these components and with the new year can be much more responsive, if you wish to renew your efforts on them.

0x006EA1E5 · 2024-01-10T15:53:02Z

Yes, no worries, hopefully I will get some more time to look at this too 😁

0x006EA1E5 · 2024-01-12T15:32:32Z

I have some questions regarding the spec for these processors, sorry if this is documented already somewhere.

We have the concept of identifying properties, which as I understand it includes AggregationTemporality.

The Metrics data model docs mention AggregationTemporality conflicts:

If the potential conflict involves an AggregationTemporality property, an implementation MAY convert temporality using a Cumulative-to-Delta or a Delta-to-Cumulative transformation; otherwise, an implementation SHOULD inform the user of a semantic error and pass through conflicting data.

What should we do here?

We could be tracking a metric where we are receiving a stream of deltas, and then we see a cumulative datapoint which is otherwise for the same metric (name, resource and scope).
- Do we ignore the cumulative datapoint as a "different" metric?
- Do we reset the tracked metric?
Similarly, do we just ignore cumulative datapoints for metrics we are not tracking (that we have never seen a delta datapoint for)?
Same questions for inconsistent monotonic values
Delta datapoints can have a start_time_unix_nano. What do we do in the various ways this can go wrong?
- where the start_time_unix_nano is missing (as in the original case, as produced by the stateless count connector)
- where the start_time_unix_nano is before the current tracked "latest" timestamp (out of order datapoint?)
- where the start_time_unix_nano is before the current tracked start_time_unix_nano (out of order datapoint?)
- Where the start_time_unix_nano is after the current tracked "latest" timestamp (out of order datapoint?)
- In some of the above it would normally be a reset, but I guess there is a use-case where we just want to keep cumulating regardless, so maybe this should be configurable?
how should we determine the start_time_unix_nano of the cumulative
- take it from the delta, if available
- If not available, is it the instant when we start to track, or the timestamp of when the collector started? Or just omit if not available?

Anyone have any ideas regarding other things that need to be defined?

djaglowski · 2024-01-12T17:09:21Z

Great questions. Focusing on the timestamp edge cases first.

Delta datapoints can have a start_time_unix_nano. What do we do in the various ways this can go wrong?

where the start_time_unix_nano is missing (as in the original case, as produced by the stateless count connector)

where the start_time_unix_nano is before the current tracked "latest" timestamp (out of order datapoint?)

where the start_time_unix_nano is before the current tracked start_time_unix_nano (out of order datapoint?)

Where the start_time_unix_nano is after the current tracked "latest" timestamp (out of order datapoint?)

I think the general principle is that we have two behaviors:

Prior to emitting the first data point for a time series, we freely expand the time window to reflect the earliest start_time_unix_nano and the latest time_unix_nano (end time). Out-of-order or overlapping points just update the time window as necessary.
After emitting the first data point for a time series, we lock the start time but still can update the end time. Data points with a start time prior the the accumulated start time are a bit of a problem though. Perhaps there is a configurable behavior to either include them in the current aggregation vs drop them. (User effectively chooses to prioritize accuracy over the long term or short term.)

Does this make sense or am I oversimplifying it?

0x006EA1E5 · 2024-01-13T12:36:21Z

Does this make sense or am I oversimplifying it?

It does make sense, but I'm a bit concerned with how the complexity can explode considering how many things can vary 🤔

And how do we make these behaviours configurable, without it becoming a mess.

One important constraint will be how will the typical downstream consumers behave, for example if we expand the time window's start time, especially the prometheus and prometheusremotewrite` exporters.

verejoel · 2024-01-18T13:19:32Z

This is something that is on our radar, and would like to support as much as possible. Our use case is to enable remote writing of metrics from the count connector to Thanos.

djaglowski · 2024-01-18T17:39:36Z

One important constraint will be how will the typical downstream consumers behave, for example if we expand the time window's start time, especially the prometheus and prometheusremotewrite` exporters.

This certainly could prove to be tricky but I think it may be worth trying in a simple form, and then iterating based on feedback.

RichieSams · 2024-01-22T14:18:06Z

Hi @djaglowski and @0x006EA1E5

I'd like to help contribute my time to the implementation of this issue and/or #29461

I can be available to work semi-full time on it. @0x006EA1E5 I see above you're working through many of the edge cases. Do you have a fork already? Would you be willing to work together?

djaglowski · 2024-01-22T14:57:43Z

Thanks very much @RichieSams. I think we'll gladly take the help unless @0x006EA1E5 is already working on it or just about to start. I haven't been working on any code for it, just trying to help design it ahead of time. I think it's fine to start development and we'll work through it as necessary.

We have a contributing guide which articulates a strategy for splitting a new component into multiple PRs. This helps keep the complexity in each PR at a reasonable level so we can review them.

djaglowski · 2024-01-22T21:44:25Z

A duplicate issue was opened for this component and appears to have started development. I think we should consolidate to one issue but we need to reestablish whether we are splitting the component. Currently waiting for feedback from those involved with the other proposal.

djaglowski · 2024-01-23T13:51:23Z

It seems #30479 moved is very close to an exact duplicate of this issue and specifically does not include the functionality proposed in #29461. It has a detailed design doc and a reference implementation already too. Therefore, I will close this issue and suggest that anyone interested in doing so take a closer look at #30479 and the associated PRs.

@RichieSams, @0x006EA1E5, or anyone else interested in #29461, I think we can parallel efforts by moving focus to that processor.

0x006EA1E5 added needs triage New item requiring triage Sponsor Needed New component seeking sponsor labels Nov 16, 2023

djaglowski added Accepted Component New component has been sponsored and removed Sponsor Needed New component seeking sponsor labels Nov 16, 2023

crobert-1 removed the needs triage New item requiring triage label Nov 20, 2023

github-actions bot mentioned this issue Nov 21, 2023

Weekly Report: 2023-11-14 - 2023-11-21 #29422

Closed

crobert-1 mentioned this issue Dec 13, 2023

Aggregate metric datapoints over time period #29461

Closed

2 tasks

djaglowski mentioned this issue Dec 25, 2023

count connector is not working with prometheus exporter: delta not accumulated #30203

Closed

karvounis-form3 mentioned this issue Jan 12, 2024

[prometheusremotewrite] TFC agent metrics are dropped before being forwarded to Grafana Cloud due to invalid temporality and type combination for metric error #30435

Closed

verejoel mentioned this issue Jan 18, 2024

New component: Log-based metrics processor #18269

Closed

2 tasks

djaglowski mentioned this issue Jan 22, 2024

new component: deltatocumulative processor #30479

Closed

djaglowski closed this as completed Jan 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delta to Cumulative Processor (metrics) #29300

Delta to Cumulative Processor (metrics) #29300

0x006EA1E5 commented Nov 16, 2023 •

edited

Loading

djaglowski commented Nov 16, 2023

0x006EA1E5 commented Nov 16, 2023

djaglowski commented Nov 16, 2023

0x006EA1E5 commented Nov 17, 2023 •

edited

Loading

djaglowski commented Nov 17, 2023

djaglowski commented Nov 17, 2023

crobert-1 commented Nov 20, 2023

0x006EA1E5 commented Nov 21, 2023

0x006EA1E5 commented Nov 23, 2023

djaglowski commented Jan 10, 2024

0x006EA1E5 commented Jan 10, 2024

0x006EA1E5 commented Jan 12, 2024

djaglowski commented Jan 12, 2024

0x006EA1E5 commented Jan 13, 2024

verejoel commented Jan 18, 2024

djaglowski commented Jan 18, 2024

RichieSams commented Jan 22, 2024 •

edited

Loading

djaglowski commented Jan 22, 2024

djaglowski commented Jan 22, 2024

djaglowski commented Jan 23, 2024

Delta to Cumulative Processor (metrics) #29300

Delta to Cumulative Processor (metrics) #29300

Comments

0x006EA1E5 commented Nov 16, 2023 • edited Loading

The purpose and use-cases of the new component

Example configuration for the component

Telemetry data types supported

Is this a vendor-specific component?

Code Owner(s)

Sponsor (optional)

Additional context

djaglowski commented Nov 16, 2023

0x006EA1E5 commented Nov 16, 2023

djaglowski commented Nov 16, 2023

0x006EA1E5 commented Nov 17, 2023 • edited Loading

djaglowski commented Nov 17, 2023

djaglowski commented Nov 17, 2023

crobert-1 commented Nov 20, 2023

0x006EA1E5 commented Nov 21, 2023

0x006EA1E5 commented Nov 23, 2023

djaglowski commented Jan 10, 2024

0x006EA1E5 commented Jan 10, 2024

0x006EA1E5 commented Jan 12, 2024

djaglowski commented Jan 12, 2024

0x006EA1E5 commented Jan 13, 2024

verejoel commented Jan 18, 2024

djaglowski commented Jan 18, 2024

RichieSams commented Jan 22, 2024 • edited Loading

djaglowski commented Jan 22, 2024

djaglowski commented Jan 22, 2024

djaglowski commented Jan 23, 2024

0x006EA1E5 commented Nov 16, 2023 •

edited

Loading

0x006EA1E5 commented Nov 17, 2023 •

edited

Loading

RichieSams commented Jan 22, 2024 •

edited

Loading