Remove queue from the Prometheus remote write #2951

rakyll · 2021-04-16T18:40:29Z

Prometheus remote write (PWR) exporter is supporting queued retry
out of the box, but due to the strict requirements of PWR, the queue
causes "out of order sample" errors. The queue won't be suitable
for Prometheus unless it has a way to shard the data by timeseries.

This is not going to break the existing users because if they have
this feature enabled, they already can't use the PRW exporter correctly.

Fixes #2949.

Prometheus remote write (PWR) exporter is supporting queued retry out of the box, but due to the strict requirements of PWR, the queue causes "out of order sample" errors. The queue won't be suitable for Prometheus unless it has a way to shard the data by timeseries. Fixes #2949.

codecov · 2021-04-16T18:46:53Z

Codecov Report

Merging #2951 (9d12db2) into main (e72553a) will decrease coverage by 0.00%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main    #2951      +/-   ##
==========================================
- Coverage   91.68%   91.68%   -0.01%     
==========================================
  Files         312      312              
  Lines       15339    15337       -2     
==========================================
- Hits        14063    14061       -2     
  Misses        870      870              
  Partials      406      406

Impacted Files	Coverage Δ
exporter/prometheusremotewriteexporter/factory.go	`100.00% <ø> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e72553a...9d12db2. Read the comment docs.

rakyll · 2021-04-16T18:57:07Z

cc @odeke-em

odeke-em

LGTM, thank you @rakyll! I added a suggestion but also would be nice to add the issue you filed, as a reference to the code comments. Thank you.

exporter/prometheusremotewriteexporter/README.md

odeke-em · 2021-04-16T20:40:27Z

Kindly cc-ing @bogdandrutu!

rakyll · 2021-04-16T23:43:12Z

Added a link to the issue, thanks for the suggestion @odeke-em.

odeke-em · 2021-04-16T23:44:23Z

Nice, thank you @rakyll! LGTM.

tigrannajaryan · 2021-04-19T14:35:46Z

Can we instead force queued_retry to have a single consumer for this exporter? That should fix the problem while keeping the retrying and queuing capabilities, right? I am not sure how important is that for this exporter though.

rakyll · 2021-04-19T16:24:22Z

@tigrannajaryan Retrying is critical in the context of remote write, it's not easy to have a highly available Prometheus, Cortex or Thanos. We want to reimplement the retrying options Prometheus server implements natively (https://prometheus.io/docs/practices/remote_write/) to provide a drop-in replacement and help users to reuse their fine tuned configuration.

tigrannajaryan · 2021-04-19T16:38:45Z

@tigrannajaryan Retrying is critical in the context of remote write, it's not easy to have a highly available Prometheus, Cortex or Thanos. We want to reimplement the retrying options Prometheus server implements natively (https://prometheus.io/docs/practices/remote_write/) to provide a drop-in replacement and help users to reuse their fine tuned configuration.

Sounds good. Do you also want to add a TODO or an issue (if it does not already exist) describing what you plan to do?

rakyll · 2021-04-19T18:53:54Z

I updated https://github.com/open-telemetry/opentelemetry-collector/issues/2259 to make sure we're following up with the necessary changes. Thanks much.

…2974) * Enable queue for the Prometheus Remote Write Exporter internally This is a follow up to #2951. When we disable the queue completely, it causes the export to happen not in a consumer goroutine. Enable the queue internally that export gets its own goroutine not to block the entire collector pipeline. * Set a default queue size

* Remove sending_queue from AWS Prometheus Remote Write Exporter This is a follow up of open-telemetry/opentelemetry-collector#2951. Fixes #3163. * Fix the test

…telemetry#3186) * Remove sending_queue from AWS Prometheus Remote Write Exporter This is a follow up of open-telemetry/opentelemetry-collector#2951. Fixes open-telemetry#3163. * Fix the test

rakyll requested a review from a team April 16, 2021 18:40

Fix docs

10ae713

odeke-em approved these changes Apr 16, 2021

View reviewed changes

exporter/prometheusremotewriteexporter/README.md Outdated Show resolved Hide resolved

Fix typo

01a1a28

Add issue link

9d12db2

This was referenced Apr 17, 2021

Improvements to the awsprometheusremotewriteexporter open-telemetry/opentelemetry-collector-contrib#3158

Merged

Reflect queue removal changes from PRW exporter to AWS PRW exporter open-telemetry/opentelemetry-collector-contrib#3163

Closed

tigrannajaryan approved these changes Apr 19, 2021

View reviewed changes

tigrannajaryan merged commit 51412b3 into open-telemetry:main Apr 19, 2021

rakyll deleted the prom-debug branch April 19, 2021 18:53

This was referenced Apr 20, 2021

Remove sending_queue from AWS Prometheus Remote Write Exporter open-telemetry/opentelemetry-collector-contrib#3186

Merged

Enable the queue for the Prometheus Remote Write Exporter internally #2974

Merged

hughesjj pushed a commit to hughesjj/opentelemetry-collector that referenced this pull request Apr 27, 2023

Fix debian upgrade command in linux-installer.md (open-telemetry#2951)

5bd5042

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove queue from the Prometheus remote write #2951

Remove queue from the Prometheus remote write #2951

rakyll commented Apr 16, 2021 •

edited

Loading

codecov bot commented Apr 16, 2021 •

edited

Loading

rakyll commented Apr 16, 2021

odeke-em left a comment

odeke-em commented Apr 16, 2021

rakyll commented Apr 16, 2021

odeke-em commented Apr 16, 2021

tigrannajaryan commented Apr 19, 2021

rakyll commented Apr 19, 2021 •

edited

Loading

tigrannajaryan commented Apr 19, 2021

rakyll commented Apr 19, 2021

Remove queue from the Prometheus remote write #2951

Remove queue from the Prometheus remote write #2951

Conversation

rakyll commented Apr 16, 2021 • edited Loading

codecov bot commented Apr 16, 2021 • edited Loading

Codecov Report

rakyll commented Apr 16, 2021

odeke-em left a comment

Choose a reason for hiding this comment

odeke-em commented Apr 16, 2021

rakyll commented Apr 16, 2021

odeke-em commented Apr 16, 2021

tigrannajaryan commented Apr 19, 2021

rakyll commented Apr 19, 2021 • edited Loading

tigrannajaryan commented Apr 19, 2021

rakyll commented Apr 19, 2021

rakyll commented Apr 16, 2021 •

edited

Loading

codecov bot commented Apr 16, 2021 •

edited

Loading

rakyll commented Apr 19, 2021 •

edited

Loading