TSDB Index reuses slices, adds pools #5630

owen-d · 2022-03-15T20:51:47Z

This PR changes the index interface to accept slices. This allows us to reuse memory more easily. To make this easier, I've introduced a number of releveant pools for us to use.

It also adds a configurable rounds parameter (default 8) for the benchmarking script.

ref #5428

benchmarks:

name                                         old time/op    new time/op    delta
Query_PostingsForMatchers/match_ns-4           4.03µs ± 1%    4.02µs ± 0%     ~     (p=0.159 n=7+8)
Query_PostingsForMatchers/match_ns_regexp-4    4.03µs ± 0%    4.02µs ± 0%   -0.28%  (p=0.031 n=8+8)
Query_GetChunkRefs/match_ns-4                  27.4ms ± 0%    14.2ms ± 0%  -48.36%  (p=0.001 n=6+8)
Query_GetChunkRefs/match_ns_regexp-4           27.5ms ± 1%    14.3ms ± 1%  -47.97%  (p=0.000 n=8+7)
Query_GetChunkRefsSharded/match_ns-4            120ms ± 1%     102ms ± 1%  -15.29%  (p=0.000 n=8+8)
Query_GetChunkRefsSharded/match_ns_regexp-4     119ms ± 1%     102ms ± 1%  -14.64%  (p=0.000 n=7+8)

name                                         old alloc/op   new alloc/op   delta
Query_PostingsForMatchers/match_ns-4            80.0B ± 0%     80.0B ± 0%     ~     (all equal)
Query_PostingsForMatchers/match_ns_regexp-4     80.0B ± 0%     80.0B ± 0%     ~     (all equal)
Query_GetChunkRefs/match_ns-4                  40.4MB ± 0%     7.4MB ± 0%  -81.71%  (p=0.000 n=8+8)
Query_GetChunkRefs/match_ns_regexp-4           40.4MB ± 0%     7.4MB ± 0%  -81.70%  (p=0.000 n=8+7)
Query_GetChunkRefsSharded/match_ns-4           48.6MB ± 0%    15.0MB ± 0%  -69.20%  (p=0.001 n=7+7)
Query_GetChunkRefsSharded/match_ns_regexp-4    48.6MB ± 0%    15.0MB ± 0%  -69.20%  (p=0.001 n=8+6)

name                                         old allocs/op  new allocs/op  delta
Query_PostingsForMatchers/match_ns-4             4.00 ± 0%      4.00 ± 0%     ~     (all equal)
Query_PostingsForMatchers/match_ns_regexp-4      4.00 ± 0%      4.00 ± 0%     ~     (all equal)
Query_GetChunkRefs/match_ns-4                    278k ± 0%      278k ± 0%   -0.01%  (p=0.000 n=8+8)
Query_GetChunkRefs/match_ns_regexp-4             278k ± 0%      278k ± 0%   -0.01%  (p=0.000 n=8+8)
Query_GetChunkRefsSharded/match_ns-4             394k ± 0%      394k ± 0%   -0.12%  (p=0.000 n=8+8)
Query_GetChunkRefsSharded/match_ns_regexp-4      394k ± 0%      394k ± 0%   -0.12%  (p=0.000 n=8+8)

…all index types

pkg/storage/tsdb/index.go

cyriltovena · 2022-03-17T07:58:48Z

pkg/storage/tsdb/multi_file_index.go

 	groups, err := i.forIndices(ctx, from, through, func(ctx context.Context, idx Index) (interface{}, error) {
-		return idx.GetChunkRefs(ctx, userID, from, through, shard, matchers...)
+		refs := ChunkRefsPool.Get()
+		err := idx.GetChunkRefs(ctx, userID, from, through, &refs, shard, matchers...)


if the final append was inside here you wouldn't need to Get from the pool multiple times, just reset the slice for each indice.

Same for the Multi Series.

I used independent slices because i.forIndices runs all these calls in parallel.

cyriltovena

LGTM

I think we should keep the interface to return results, feels more natural to me.

chaudum · 2022-03-17T09:33:40Z

pkg/storage/tsdb/multi_file_index.go

+func (i *MultiIndex) GetChunkRefs(ctx context.Context, userID string, from, through model.Time, res *[]ChunkRef, shard *index.ShardAnnotation, matchers ...*labels.Matcher) error {
+	*res = (*res)[:0]


Where is (*MultiIndex).GetChunkRefs() called from? If a newly retrieved *[]ChunkRef is passed to the function, why do we need to reset the slice?

This isn't actually called from anywhere yet -- we're just building the prototype for how we will interact with the tsdb-based index. Regarding *[]ChunkRef, it was to allow the caller to supply their own slice to avoid allocations. It's encouraged that the slice could be reused across multiple calls to avoid allocations, so we reset it.

adds a pool for ChunkMetas

74183a8

owen-d requested a review from a team as a code owner March 15, 2022 20:51

owen-d added 2 commits March 15, 2022 17:32

index takes slice pointers for allocation reasons and adds pools for …

16d7848

…all index types

reuse slices in tsdb benchmarks

523a2a8

owen-d changed the title ~~adds a pool for ChunkMetas~~ TSDB Index reuses slices, adds pools Mar 15, 2022

pull-request-size bot added size/M size/L and removed size/M labels Mar 15, 2022

owen-d added 2 commits March 15, 2022 17:40

adds count to bench script

1af35d7

properly returns series to pool

fc12a92

cyriltovena reviewed Mar 17, 2022

View reviewed changes

pkg/storage/tsdb/index.go Outdated Show resolved Hide resolved

cyriltovena reviewed Mar 17, 2022

View reviewed changes

cyriltovena approved these changes Mar 17, 2022

View reviewed changes

chaudum approved these changes Mar 17, 2022

View reviewed changes

owen-d added 4 commits March 17, 2022 13:07

Merge remote-tracking branch 'upstream/main' into tsdb/chunkmeta-pool

0ab0bdd

more ergonomic index signatures while still supporting slice reuse

a1e24c4

tsdb index documentation

2230a95

aligns tsdb-map tooling with new index signatures

9d1146d

owen-d merged commit 3f28a33 into grafana:main Mar 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TSDB Index reuses slices, adds pools #5630

TSDB Index reuses slices, adds pools #5630

owen-d commented Mar 15, 2022 •

edited

Loading

cyriltovena Mar 17, 2022

owen-d Mar 17, 2022

cyriltovena left a comment

chaudum Mar 17, 2022

owen-d Mar 17, 2022

		func (i MultiIndex) GetChunkRefs(ctx context.Context, userID string, from, through model.Time, res []ChunkRef, shard index.ShardAnnotation, matchers ...labels.Matcher) error {
		res = (res)[:0]

TSDB Index reuses slices, adds pools #5630

TSDB Index reuses slices, adds pools #5630

Conversation

owen-d commented Mar 15, 2022 • edited Loading

cyriltovena Mar 17, 2022

Choose a reason for hiding this comment

owen-d Mar 17, 2022

Choose a reason for hiding this comment

cyriltovena left a comment

Choose a reason for hiding this comment

chaudum Mar 17, 2022

Choose a reason for hiding this comment

owen-d Mar 17, 2022

Choose a reason for hiding this comment

owen-d commented Mar 15, 2022 •

edited

Loading