[nbs] getMany read planning and parallelize #7731

max-hoffman · 2024-04-11T03:12:20Z

Add read planning for journal getMany so instead of doing randIO we get seqIO. SeqIO has a large effect for disk storage systems. Also parallelize getMany calls. The biggest change here is the semantics around how reading locks the journal file. Reads hold the journal lock for longer, which could lower write throughput in some cases. If we see evidence of this, we can do more work to limit the amount of time batch reads can interruptibly holding the journal lock.

max-hoffman · 2024-04-11T03:12:26Z

#benchmark

github-actions · 2024-04-11T03:12:50Z

@max-hoffman workflow run: https://github.com/dolthub/dolt/actions/runs/8640983945

coffeegoddd · 2024-04-11T03:34:21Z

@max-hoffman DOLT

test_name	from_latency_median	to_latency_median	is_faster
tpcc-scale-factor-1	223.34	130.13	1

test_name	server_name	server_version	tps	test_name	server_name	server_version	tps	is_faster
tpcc-scale-factor-1	dolt	`a6ce239`	14.19	tpcc-scale-factor-1	dolt	`c3af984`	16.71	0

coffeegoddd · 2024-04-11T03:43:37Z

@max-hoffman DOLT

comparing_percentages
100.000000 to 100.000000

version	result	total
`c3af984`	ok	5937457

version	total_tests
`c3af984`	5937457

correctness_percentage
100.0

coffeegoddd · 2024-04-11T04:34:58Z

@max-hoffman DOLT

read_tests	from_latency_median	to_latency_median
covering_index_scan	3.02	2.97
groupby_scan	17.63	17.63
index_join	5.28	5.18
index_join_scan	2.3	2.3
index_scan	55.82	55.82
oltp_point_select	0.54	0.54
oltp_read_only	8.58	8.58
select_random_points	0.84	0.84
select_random_ranges	0.99	0.99
table_scan	55.82	55.82
types_table_scan	164.45	161.51

write_tests	from_latency_median	to_latency_median
oltp_delete_insert	6.79	6.79
oltp_insert	3.36	3.43
oltp_read_write	16.41	16.41
oltp_update_index	3.49	3.49
oltp_update_non_index	3.43	3.43
oltp_write_only	7.84	7.98
types_delete_insert	7.56	7.56

max-hoffman · 2024-04-11T21:47:41Z

#benchmark

github-actions · 2024-04-11T21:54:35Z

@max-hoffman workflow run: https://github.com/dolthub/dolt/actions/runs/8653681534

coffeegoddd · 2024-04-11T22:17:44Z

@max-hoffman DOLT

comparing_percentages
100.000000 to 100.000000

version	result	total
`f47c827`	ok	5937457

version	total_tests
`f47c827`	5937457

correctness_percentage
100.0

coffeegoddd · 2024-04-11T22:29:36Z

@max-hoffman DOLT

test_name	from_latency_median	to_latency_median	is_faster
tpcc-scale-factor-1	84.47	90.78	0

test_name	server_name	server_version	tps	test_name	server_name	server_version	tps	is_faster
tpcc-scale-factor-1	dolt	`5d5847f`	0.47	tpcc-scale-factor-1	dolt	`f47c827`	17.03	-1

coffeegoddd · 2024-04-11T23:23:09Z

@max-hoffman DOLT

read_tests	from_latency_median	to_latency_median
covering_index_scan	3.02	3.02
groupby_scan	17.63	17.95
index_join	5.28	5.28
index_join_scan	2.26	2.3
index_scan	54.83	54.83
oltp_point_select	0.53	0.53
oltp_read_only	8.58	8.58
select_random_points	0.84	0.84
select_random_ranges	0.99	0.99
table_scan	55.82	55.82
types_table_scan	161.51	158.63

write_tests	from_latency_median	to_latency_median
oltp_delete_insert	6.79	6.79
oltp_insert	3.36	3.43
oltp_read_write	16.41	16.41
oltp_update_index	3.49	3.49
oltp_update_non_index	3.43	3.43
oltp_write_only	7.84	7.84
types_delete_insert	7.56	7.56

coffeegoddd · 2024-04-12T18:57:23Z

@max-hoffman DOLT

comparing_percentages
100.000000 to 100.000000

version	result	total
`ba06fd4`	ok	5937457

version	total_tests
`ba06fd4`	5937457

correctness_percentage
100.0

…ock on journal throughout the getMany work.

coffeegoddd · 2024-04-12T21:23:53Z

@max-hoffman DOLT

comparing_percentages
100.000000 to 100.000000

version	result	total
`8d0834c`	ok	5937457

version	total_tests
`8d0834c`	5937457

correctness_percentage
100.0

coffeegoddd · 2024-04-12T22:37:11Z

@max-hoffman DOLT

comparing_percentages
100.000000 to 100.000000

version	result	total
`6699fca`	ok	5937457

version	total_tests
`6699fca`	5937457

correctness_percentage
100.0

reltuk

This looks good to me, other than the race in getMany. I don't know how to measure the impact on the lock duration here vs. what we used to have before...it's something we can always come back and fix though. I think this is good to go if we fix the race.

Simplest way to fix the race might be to move getManyCompressed impl to a helper function that has a signature something like:

func (s journalChunkSource) getManyCompressed_ext(ctx context.Context, eg *errgroup.Group, reqs []getRecord, found func(context.Context, CompressedChunk) error, stats *Stats) (bool, error) {

and then to wrap it in both getMany and getManyCompressed...

reltuk · 2024-04-19T00:51:24Z

go/store/nbs/journal_chunk_source.go

+	return s.getManyCompressed(ctx, eg, reqs, func(ctx context.Context, cc CompressedChunk) {
+		ch, err := cc.ToChunk()
+		if err != nil {
+			eg.Go(func() error {


Unfortunately, I don't think this is race-free. In particular, I don't think it's safe to eg.Go() on an errgroup where eg.Wait() may have already been called.

Actually, I think I might be wrong here...in particular, errgroup is implemented in terms of a sync.WaitGroup, and I thought all positive delta wg.Add() calls were dangerous after a wg.Wait(), but that's not true...

// Note that calls with a positive delta that occur when the counter is zero // must happen before a Wait. Calls with a negative delta, or calls with a // positive delta that start when the counter is greater than zero, may happen // at any time.

And by, "I think I might be wrong here"...I'm quite sure I'm wrong and your usage is safe / to spec. Sorry for the noise.

max-hoffman · 2024-04-24T17:19:01Z

We've talked about how this may negatively impact mixed workloads that will now block on reads for longer. Tradeoff being that batch reads that will complete faster, but are less interruptible because the read holds the journal lock for longer. The tradeoff for tablescans is pretty obvious, if we see issues with write throughput after adding this we add changes that make batch reads only hold the journal lock for short periods of time.

[nbs] parallelize journal getMany

c3af984

coffeegoddd added the correctness_approved label Apr 11, 2024

max-hoffman added 2 commits April 11, 2024 14:33

journal read planning

265fd7d

cleanup

f47c827

max-hoffman changed the title ~~[nbs] parallelize journal getMany~~ [nbs] getMany read planning and parallelize Apr 11, 2024

max-hoffman requested a review from reltuk April 11, 2024 21:50

do getCompressedChunks also

ba06fd4

max-hoffman and others added 2 commits April 12, 2024 13:52

separate getMany errgroup

8d0834c

go/store/nbs: Small fix for locking semantics. Stay paranoid about RL…

28620aa

…ock on journal throughout the getMany work.

comments

6699fca

reltuk reviewed Apr 19, 2024

View reviewed changes

max-hoffman merged commit 463434e into main Apr 24, 2024

max-hoffman deleted the max/parallelize-get-many branch April 24, 2024 17:21

BrewTestBot mentioned this pull request Apr 26, 2024

dolt 1.35.11 Homebrew/homebrew-core#170111

Merged

Uh oh!

[nbs] getMany read planning and parallelize #7731

[nbs] getMany read planning and parallelize #7731

Uh oh!

Conversation

max-hoffman commented Apr 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

max-hoffman commented Apr 11, 2024

Uh oh!

github-actions bot commented Apr 11, 2024

Uh oh!

coffeegoddd commented Apr 11, 2024

Uh oh!

coffeegoddd commented Apr 11, 2024

Uh oh!

coffeegoddd commented Apr 11, 2024

Uh oh!

max-hoffman commented Apr 11, 2024

Uh oh!

github-actions bot commented Apr 11, 2024

Uh oh!

coffeegoddd commented Apr 11, 2024

Uh oh!

coffeegoddd commented Apr 11, 2024

Uh oh!

coffeegoddd commented Apr 11, 2024

Uh oh!

coffeegoddd commented Apr 12, 2024

Uh oh!

coffeegoddd commented Apr 12, 2024

Uh oh!

coffeegoddd commented Apr 12, 2024

Uh oh!

reltuk left a comment

Choose a reason for hiding this comment

Uh oh!

reltuk Apr 19, 2024

Choose a reason for hiding this comment

Uh oh!

reltuk Apr 19, 2024

Choose a reason for hiding this comment

Uh oh!

reltuk Apr 19, 2024

Choose a reason for hiding this comment

Uh oh!

max-hoffman commented Apr 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

max-hoffman commented Apr 11, 2024 •

edited

Loading