Design doc: No init startup for store gateway. #1813

bwplotka · 2019-11-28T17:22:39Z

Hi 👋

Sharing the design doc with some initial discussion around removing or limiting startup time for store gateway to be used potentially by Cortex:

No startup metadata for blocks synchronization.
Loading blocks on-demand on query time.

While this has many benefits it is a tradeoff e.g in query latency for "cold blocks": https://docs.google.com/document/d/1En0Hr1OqZLlsF-_JtpYSWEu2mBXyVYx7BvXoivW0n3U/edit#

For Thanos, I am personally super interested in the latter step: loading blocks on-demand on query time. For block meta files synchronization we can go quite far with compaction and current iterating over the bucket. Those two can be tackled separately as well.

Feedback is welcome!

daixiang0 · 2020-01-06T02:40:03Z

better to add label like design or plan or something...

stale · 2020-02-05T09:19:48Z

This issue/PR has been automatically marked as stale because it has not had recent activity. Please comment on status otherwise the issue will be closed in a week. Thank you for your contributions.

pracucci · 2020-02-05T09:41:08Z

There's still some interest from my side on this improvement.

bwplotka · 2020-02-05T09:58:01Z

Agree - we should explore that further. We have stale bot exactly for that reason (: To revisit the issue if we are still interested at least once a month.

stale · 2020-03-06T09:58:27Z

This issue/PR has been automatically marked as stale because it has not had recent activity. Please comment on status otherwise the issue will be closed in a week. Thank you for your contributions.

pracucci · 2020-03-06T10:19:22Z

Let's keep it alive for a bit more.

Food for thought: the current lack of store gateway HA with sharding (if 1 gateway goes down, all queries fail) may actually be solved with a lazy storage which would allow to a fast re-sharding across gateways without downtime (if we completely remove the initial sync delay).

stale · 2020-04-05T11:12:19Z

This issue/PR has been automatically marked as stale because it has not had recent activity. Please comment on status otherwise the issue will be closed in a week. Thank you for your contributions.

bwplotka · 2020-04-05T12:22:30Z

unstale (:

…

On Sun, 5 Apr 2020 at 12:12, stale[bot] ***@***.***> wrote: This issue/PR has been automatically marked as stale because it has not had recent activity. Please comment on status otherwise the issue will be closed in a week. Thank you for your contributions. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#1813 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABVA3O43ULO5W6L2YSEJJE3RLBRSBANCNFSM4JSXGSWA> .

stale · 2020-05-05T14:43:11Z

Hello 👋 Looks like there was no activity on this issue for last 30 days.
Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗
If there will be no activity for next week, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

stale · 2020-06-04T17:56:10Z

Hello 👋 Looks like there was no activity on this issue for last 30 days.
Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗
If there will be no activity for next week, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

pracucci · 2020-06-05T07:06:12Z

I initially wrote that design doc but, from my side, the interest decreased since then. The reason is that loading cold blocks on-demand would have a significant performance impact on queries hitting blocks not-yet-loaded and the great work done by @bwplotka to reduce the in-memory index header size as well as the blocks sharding introducing in the Cortex store-gateway relaxed this need.

I still believe we need a faster way to "scan the bucket" to discover new/deleted blocks which doesn't involve having every single query/gateway instance running a periodic full bucket scan, but it's a different topic. Also, the recent support for metadata caching on memcached (including bucket List operation) relaxed this need too.

stale · 2020-07-05T07:41:36Z

Hello 👋 Looks like there was no activity on this issue for last 30 days.
Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗
If there will be no activity for next week, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

stale · 2020-07-12T08:31:39Z

Closing for now as promised, let us know if you need this to be reopened! 🤗

This was referenced Nov 28, 2019

querier: Limit LabelNames; LabelValues to certain time period. #1811

Closed

Long Term Storage Improvements [Tracking Issue] #1705

Closed

bwplotka added feature request/improvement proposal labels Jan 6, 2020

stale bot added the stale label Feb 5, 2020

stale bot removed the stale label Feb 5, 2020

stale bot added the stale label Mar 6, 2020

stale bot removed the stale label Mar 6, 2020

stale bot added the stale label Apr 5, 2020

stale bot removed the stale label Apr 5, 2020

stale bot added the stale label May 5, 2020

bwplotka removed the stale label May 5, 2020

stale bot added the stale label Jun 4, 2020

stale bot removed the stale label Jun 5, 2020

stale bot added the stale label Jul 5, 2020

stale bot closed this as completed Jul 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Design doc: No init startup for store gateway. #1813

Design doc: No init startup for store gateway. #1813

bwplotka commented Nov 28, 2019

daixiang0 commented Jan 6, 2020

stale bot commented Feb 5, 2020

pracucci commented Feb 5, 2020

bwplotka commented Feb 5, 2020

stale bot commented Mar 6, 2020

pracucci commented Mar 6, 2020

stale bot commented Apr 5, 2020

bwplotka commented Apr 5, 2020 via email

stale bot commented May 5, 2020

stale bot commented Jun 4, 2020

pracucci commented Jun 5, 2020

stale bot commented Jul 5, 2020

stale bot commented Jul 12, 2020

Design doc: No init startup for store gateway. #1813

Design doc: No init startup for store gateway. #1813

Comments

bwplotka commented Nov 28, 2019

daixiang0 commented Jan 6, 2020

stale bot commented Feb 5, 2020

pracucci commented Feb 5, 2020

bwplotka commented Feb 5, 2020

stale bot commented Mar 6, 2020

pracucci commented Mar 6, 2020

stale bot commented Apr 5, 2020

bwplotka commented Apr 5, 2020 via email

stale bot commented May 5, 2020

stale bot commented Jun 4, 2020

pracucci commented Jun 5, 2020

stale bot commented Jul 5, 2020

stale bot commented Jul 12, 2020