support sub compaction to speed up large compaction #70

bobotu · 2018-12-17T02:48:13Z

Use sub compaction to avoid write stall due to large L1 to L2 compaction block L0 compaction, and speed up L0 to L1 compaction.

levels.go

coocood · 2018-12-17T12:51:55Z

Can we only use the bottom level bounds to split the compaction?
And since the bottom level is usually 10x size of the top level, we can ignore the data size in top level, then we don't need to implement approximate size in range.

bobotu · 2018-12-18T04:44:24Z

@coocood This heuristic algorithm is adopted from RocksDB.

Select boundaries based on the natural boundary of input levels/files.
- first and last key of L0 files
- first and last key of non-0, non-last levels
- first key of each SST file of the last level
Sort boundaries and unique.
Use ApproximateSize to estimate the data size in each boundary.
Merge boundaries to eliminate empty and smaller-than-average ranges.
- find the average size in each range
- starting from beginning greedily merge adjacent ranges until their total size exceeds the average

We should consider the size of each input file, so we cannot ignore the bounds of L0 files. Because each sst file has nearly the same size, each boundary added in step 1 represents like a data distribution in the whole input.

Supposing L0 inputs [a, d] [c, d] [e, j] and L1 inputs [a, f] [f, i] [i, k]. The result of step 2 is [a, c] [c, d] [d, e] [e, f] [f, i] [i, j] [j, k]. As you can see, the large range [a, f] is split into many smaller ranges.

Then we estimate the size of each small range, compute the estimated size of each sub compaction. Merge small ranges into a larger one.

This algorithm is not so determinate, because we cannot split the whole input equally without iterate over them, so we use some heuristic rules here, and it worked out well so far.

BTW, RocksDB disabled sub compaction for non-L0 levels, but I enabled this for L1 when it touches more than 10 sst, otherwise it may block the L0 -> L1 compaction.

ngaut · 2018-12-19T08:52:44Z

Ping @coocood

coocood · 2018-12-19T09:54:49Z

@bobotu
We can reserve the size estimation and remove the L0 bounds.

levels.go

coocood · 2018-12-20T09:29:40Z

LGTM

support sub compaction to speed up large compaction

60f577f

coocood reviewed Dec 17, 2018

View reviewed changes

levels.go Outdated Show resolved Hide resolved

coocood reviewed Dec 17, 2018

View reviewed changes

levels.go Outdated Show resolved Hide resolved

code cleanup

3d23e8d

address comments

947a1d4

coocood reviewed Dec 20, 2018

View reviewed changes

levels.go Outdated Show resolved Hide resolved

bobotu added 2 commits December 20, 2018 16:50

address comments

8c2d5d7

remove useless if

2f9a8fd

coocood reviewed Dec 20, 2018

View reviewed changes

levels.go Show resolved Hide resolved

clean up determineSubCompactPlan

3ebdc2f

coocood merged commit 48654df into pingcap:master Dec 20, 2018

bobotu deleted the compaction branch August 10, 2020 07:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support sub compaction to speed up large compaction #70

support sub compaction to speed up large compaction #70

bobotu commented Dec 17, 2018

coocood commented Dec 17, 2018

bobotu commented Dec 18, 2018

ngaut commented Dec 19, 2018

coocood commented Dec 19, 2018

coocood commented Dec 20, 2018

support sub compaction to speed up large compaction #70

support sub compaction to speed up large compaction #70

Conversation

bobotu commented Dec 17, 2018

coocood commented Dec 17, 2018

bobotu commented Dec 18, 2018

ngaut commented Dec 19, 2018

coocood commented Dec 19, 2018

coocood commented Dec 20, 2018