Ensure republish mechanism creates consistent results

## Problem

I have a check that runs every 5 minutes and when checking the underlying data in Mimir I can see an inconsistent amount of samples for each probe event. I have created a [spreadsheet](https://docs.google.com/spreadsheets/d/1WG57INHlQ4quxK0DCefH26PqkgZsFnO5Pwetcs9HShA/edit?gid=0#gid=0) to demonstrate what I have versus what I would expect. 

_If you want to view the raw data and experiment, add yourself to my stack and see it [here](https://ckbedwellksix.grafana-dev.net/explore?schemaVersion=1&panes=%7B%22au8%22%3A%7B%22datasource%22%3A%22grafanacloud-logs%22%2C%22queries%22%3A%5B%7B%22expr%22%3A%22%7Bprobe%3D%7E%5C%22.*%5C%22%2C+instance%3D%5C%22https%3A%2F%2Farabiantents.com%2F%5C%22%2C+job%3D%5C%22Arabian+Spot+Check+Content%5C%22%7D+%7C+logfmt+%7C+__error__+%3D+%5C%22%5C%22+%7C%3D+%60Beginning+check%60%22%2C%22refId%22%3A%22A%22%2C%22datasource%22%3A%7B%22type%22%3A%22loki%22%2C%22uid%22%3A%22grafanacloud-logs%22%7D%2C%22editorMode%22%3A%22code%22%2C%22queryType%22%3A%22range%22%2C%22direction%22%3A%22forward%22%7D%5D%2C%22range%22%3A%7B%22from%22%3A%221741856700000%22%2C%22to%22%3A%221741857986000%22%7D%2C%22panelsState%22%3A%7B%22logs%22%3A%7B%22visualisationType%22%3A%22logs%22%7D%7D%7D%2C%22v9q%22%3A%7B%22datasource%22%3A%22grafanacloud-prom%22%2C%22queries%22%3A%5B%7B%22refId%22%3A%22A%22%2C%22expr%22%3A%22probe_success%7Binstance%3D%5C%22https%3A%2F%2Farabiantents.com%2F%5C%22%2C+job%3D%5C%22Arabian+Spot+Check+Content%5C%22%7D%5B%24__range%5D%22%2C%22range%22%3Afalse%2C%22datasource%22%3A%7B%22type%22%3A%22prometheus%22%2C%22uid%22%3A%22grafanacloud-prom%22%7D%2C%22editorMode%22%3A%22code%22%2C%22instant%22%3Atrue%2C%22exemplar%22%3Afalse%2C%22format%22%3A%22heatmap%22%7D%5D%2C%22range%22%3A%7B%22from%22%3A%221741856700000%22%2C%22to%22%3A%221741857986000%22%7D%2C%22panelsState%22%3A%7B%22logs%22%3A%7B%22visualisationType%22%3A%22logs%22%7D%7D%7D%7D&orgId=1)_

In a five-minute period I would expect to see 3 samples per probe but it alternates between having 2 and 4 samples. There should be a consistent amount of samples per execution.

The context of this change is ensuring we can accurately portray uptime. In light of knowing we have to have a republish mechanism, I am thinking about how we can integrate that when using PromQL and Grafana's time range querying mechanism (evaluating periods in neat blocks of time, e.g. always 09:15:00 - 09:20:00 and 09:20:00 - 09:25:00 rather than 09:16:17 - 09:21:17). I am considering this query as a version for v4:

```max by () (round(avg_over_time(probe_success{job="${job}", instance="${instance}", probe=~"${probe}"}[${interval}])))```

The second issue I face with the republish mechanism is I need an **uneven** amount of samples per probe execution for the above query to work accurately. In the spreadsheet above I've made a [second tab](https://docs.google.com/spreadsheets/d/1WG57INHlQ4quxK0DCefH26PqkgZsFnO5Pwetcs9HShA/edit?gid=798522131#gid=798522131) showing two hypothetical scenarios for a check with a four-minute interval.

If we hard code the two-minute interval, failures besides successes will never get reported.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure republish mechanism creates consistent results #1253

Problem

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Ensure republish mechanism creates consistent results #1253

Description

Problem

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions