-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Thanos Store Bucket Operation Latency since v0.33.0 #7072
Comments
We went from bitnami chart 12.13.0 (thanos 0.32.1) to chart 12.20.1 (thanos 0.33) and see exactly the same behavior and we also use S3 objstore as backend: version:
args:
*** 2nd jan is the date of version change, so it really looks like it is something in this version |
We also observed this exact issue when we upgraded from 0.32.5 to 0.33.0, and rolled back. I had a look at the changes between versions that could have caused this, and I'm now suspecting this might not be an issue per-se, but actually something that wasn't being correctly measured: thanos-io/objstore#79 instrumented the Maybe it's the 2s threshold of the |
We haven't looked enough in it yet, though also questioned ourself if simply the reporting is changed. This feeling is fuelled by the facts that in all the other metrics we have, we dont see any behavioural change in S3. |
Thanos, Prometheus and Golang version used:
Object Storage Provider:
Cloudscale.ch
What happened:
Since upgrading thanos to v0.33.0 from v0.32.5, we are running into the
ThanosStoreObjstoreOperationLatencyHigh
alert and the corresponding metric shows an increase of the 99th percentile by 3-6 times.What you expected to happen:
No significant increase in latency.
How to reproduce it (as minimally and precisely as possible):
Screenshot of our production environment (the increase corresponds exactly to the upgrade time):
Screenshot of our testing environment (same setup, less load) after we downgraded back to 0.32.5 (the decrease corresponds exactly to the downgrade time):
Full logs to relevant components:
Some logs AFTER upgrade
Some logs BEFORE upgrade
Anything else we need to know:
args
caching-bucket.config-file:
The text was updated successfully, but these errors were encountered: