store: use loser trees #7304

GiedriusS · 2024-04-25T13:45:24Z

Remove a long-standing TODO item in the code - let's use the great loser tree implementation by Bryan. It is faster than the heap because less comparisons are needed. Should be a nice improvement given that the merging is used in a lot of hot paths.

Since Prometheus also uses this library, it's tricky to import the "any" version. I tried doing bboreham/go-loser#3 but it's still impossible to do that. Let's just copy/paste the code, it's not a lot.

Bench:

goos: linux
goarch: amd64
pkg: github.com/thanos-io/thanos/pkg/store
cpu: Intel(R) Core(TM) i9-10885H CPU @ 2.40GHz
             │   oldkway   │               newkway               │
             │   sec/op    │    sec/op     vs base               │
KWayMerge-16   2.292m ± 3%   2.075m ± 15%  -9.47% (p=0.023 n=10)

             │   oldkway    │               newkway               │
             │     B/op     │     B/op      vs base               │
KWayMerge-16   1.553Mi ± 0%   1.585Mi ± 0%  +2.04% (p=0.000 n=10)

             │   oldkway   │              newkway               │
             │  allocs/op  │  allocs/op   vs base               │
KWayMerge-16   27.26k ± 0%   26.27k ± 0%  -3.66% (p=0.000 n=10)

pkg/losertree/tree.go

yeya24

I know it is a dumb question. May I know why we have to use the any version, not the main one?

GiedriusS · 2024-04-26T07:18:38Z

I know it is a dumb question. May I know why we have to use the any version, not the main one?

Because the main version works on:

type Value constraints.Ordered

In other words, any type that supports < and so on (https://pkg.go.dev/golang.org/x/exp/constraints#Ordered). It's not possible to do that with custom types like storepb.SeriesResponse.

Remove a long-standing TODO item in the code - let's use the great loser tree implementation by Bryan. It is faster than the heap because less comparisons are needed. Should be a nice improvement given that the heap is used in a lot of hot paths. Since Prometheus also uses this library, it's tricky to import the "any" version. I tried doing bboreham/go-loser#3 but it's still impossible to do that. Let's just copy/paste the code, it's not a lot. Bench: ``` goos: linux goarch: amd64 pkg: github.com/thanos-io/thanos/pkg/store cpu: Intel(R) Core(TM) i9-10885H CPU @ 2.40GHz │ oldkway │ newkway │ │ sec/op │ sec/op vs base │ KWayMerge-16 2.292m ± 3% 2.075m ± 15% -9.47% (p=0.023 n=10) │ oldkway │ newkway │ │ B/op │ B/op vs base │ KWayMerge-16 1.553Mi ± 0% 1.585Mi ± 0% +2.04% (p=0.000 n=10) │ oldkway │ newkway │ │ allocs/op │ allocs/op vs base │ KWayMerge-16 27.26k ± 0% 26.27k ± 0% -3.66% (p=0.000 n=10) ``` Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

yeya24 · 2024-04-26T16:27:34Z

pkg/store/proxy_merge.go

@@ -814,15 +754,15 @@ func (l *eagerRespSet) At() *storepb.SeriesResponse {
 		return nil
 	}

-	return l.bufferedResponses[l.i]
+	return l.bufferedResponses[l.i-1]


Is it because we call Next before At so we need to change to i-1?

Yeah, our dedup heap tree was a bit buggy - we were calling At() before Next() was ever called. The loser tree properly calls Next first hence this change

Remove a long-standing TODO item in the code - let's use the great loser tree implementation by Bryan. It is faster than the heap because less comparisons are needed. Should be a nice improvement given that the heap is used in a lot of hot paths. Since Prometheus also uses this library, it's tricky to import the "any" version. I tried doing bboreham/go-loser#3 but it's still impossible to do that. Let's just copy/paste the code, it's not a lot. Bench: ``` goos: linux goarch: amd64 pkg: github.com/thanos-io/thanos/pkg/store cpu: Intel(R) Core(TM) i9-10885H CPU @ 2.40GHz │ oldkway │ newkway │ │ sec/op │ sec/op vs base │ KWayMerge-16 2.292m ± 3% 2.075m ± 15% -9.47% (p=0.023 n=10) │ oldkway │ newkway │ │ B/op │ B/op vs base │ KWayMerge-16 1.553Mi ± 0% 1.585Mi ± 0% +2.04% (p=0.000 n=10) │ oldkway │ newkway │ │ allocs/op │ allocs/op vs base │ KWayMerge-16 27.26k ± 0% 26.27k ± 0% -3.66% (p=0.000 n=10) ``` Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com> Signed-off-by: mluffman <nashluffman@gmail.com>

*: pull pr thanos-io#7304

Remove a long-standing TODO item in the code - let's use the great loser tree implementation by Bryan. It is faster than the heap because less comparisons are needed. Should be a nice improvement given that the heap is used in a lot of hot paths. Since Prometheus also uses this library, it's tricky to import the "any" version. I tried doing bboreham/go-loser#3 but it's still impossible to do that. Let's just copy/paste the code, it's not a lot. Bench: ``` goos: linux goarch: amd64 pkg: github.com/thanos-io/thanos/pkg/store cpu: Intel(R) Core(TM) i9-10885H CPU @ 2.40GHz │ oldkway │ newkway │ │ sec/op │ sec/op vs base │ KWayMerge-16 2.292m ± 3% 2.075m ± 15% -9.47% (p=0.023 n=10) │ oldkway │ newkway │ │ B/op │ B/op vs base │ KWayMerge-16 1.553Mi ± 0% 1.585Mi ± 0% +2.04% (p=0.000 n=10) │ oldkway │ newkway │ │ allocs/op │ allocs/op vs base │ KWayMerge-16 27.26k ± 0% 26.27k ± 0% -3.66% (p=0.000 n=10) ``` Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

pull-request-size bot added the size/XL label Apr 25, 2024

yeya24 reviewed Apr 26, 2024

View reviewed changes

pkg/losertree/tree.go Show resolved Hide resolved

yeya24 reviewed Apr 26, 2024

View reviewed changes

GiedriusS force-pushed the switch_to_losertree branch 4 times, most recently from bd65217 to 62faefb Compare April 26, 2024 13:42

GiedriusS force-pushed the switch_to_losertree branch from 62faefb to 9790323 Compare April 26, 2024 14:53

GiedriusS requested a review from yeya24 April 26, 2024 15:44

yeya24 reviewed Apr 26, 2024

View reviewed changes

yeya24 approved these changes Apr 26, 2024

View reviewed changes

yeya24 merged commit 6bf98f9 into main Apr 26, 2024
20 checks passed

GiedriusS deleted the switch_to_losertree branch April 26, 2024 19:48

GiedriusS mentioned this pull request May 31, 2024

*: pull pr #7304 vinted/thanos#108

Merged

GiedriusS added a commit to vinted/thanos that referenced this pull request May 31, 2024

Merge pull request #108 from vinted/pull_pr_7304

13610e4

*: pull pr thanos-io#7304

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

store: use loser trees #7304

store: use loser trees #7304

GiedriusS commented Apr 25, 2024 •

edited

Loading

yeya24 left a comment

GiedriusS commented Apr 26, 2024

yeya24 Apr 26, 2024

GiedriusS Apr 26, 2024

store: use loser trees #7304

store: use loser trees #7304

Conversation

GiedriusS commented Apr 25, 2024 • edited Loading

yeya24 left a comment

Choose a reason for hiding this comment

GiedriusS commented Apr 26, 2024

yeya24 Apr 26, 2024

Choose a reason for hiding this comment

GiedriusS Apr 26, 2024

Choose a reason for hiding this comment

GiedriusS commented Apr 25, 2024 •

edited

Loading