Optimize performance for `UnionScanExec` and `MemBuffer` #43249

lcwangchao · 2023-04-20T09:32:55Z

Enhancement

The UnionScan is not very efficient, for example:

UnionScanExec will filter all rows/indexes in Open instead of Next. It will take more time when the SQL has a LIMIT because Open will not considerate it.
Maybe we can implementMemBuffer more efficient.

The union scan's performance will affect the performance of temporary table, cached table and queries in a txn with a lot of uncommitted rows.

The text was updated successfully, but these errors were encountered:

tiancaiamao · 2023-06-21T02:37:07Z

cd executor;
 go test -run XXX -bench BenchmarkUnionScan -cpuprofile cpu.out -benchtime 45s

The inefficency comes from too parts:

Open() always drain all the data even it might be useless in the Limit scenario. It's better to make it a streaming API
The decode / encode and row format translation cost. When loading, we decode kv -> row in []Datum representation. and when merging, []Datum -> []Datum, and finally output is translated from row representation to chunk, i.e. []Datum -> chunk

…44874) ref #43249

#45051) ref #43249

ref #43249

ekexium · 2024-05-14T08:47:04Z

Hey @lcwangchao do you have more elaboration in "Maybe we can implementMemBuffer more efficient."? The flamegraph shows that time spent in memdb is quite little. Did you mean codec-related work?

tiancaiamao · 2024-05-16T01:18:20Z

Hey @lcwangchao do you have more elaboration in "Maybe we can implementMemBuffer more efficient."? The flamegraph shows that time spent in memdb is quite little. Did you mean codec-related work?

After my previous optimization, much of that had been improved.

#55987) ref #43249

ref #43249

…exReader (#56006) ref #43249

lcwangchao added type/enhancement The issue or PR belongs to an enhancement. sig/execution SIG execution sig/sql-infra SIG: SQL Infra labels Apr 20, 2023

tiancaiamao self-assigned this Jun 19, 2023

tiancaiamao mentioned this issue Jun 21, 2023

executor: union scan refactor, introduce the the mem rows iterator #44874

Merged

12 tasks

ti-chi-bot bot pushed a commit that referenced this issue Jun 27, 2023

executor: union scan refactor, introduce the the mem rows iterator (#…

9d42922

…44874) ref #43249

This was referenced Jun 29, 2023

executor: union scan refactor, optimize the membuffer decoding process #45051

Merged

code improve: support ExtraPhysTblID in UnionScan #45052

Closed

ti-chi-bot bot pushed a commit that referenced this issue Jul 6, 2023

executor: union scan refactor, optimize the membuffer decoding process (

fbbd9b0

#45051) ref #43249

tiancaiamao mentioned this issue Aug 3, 2023

executor: union scan refactor for index read scenario #45786

Merged

12 tasks

ti-chi-bot bot pushed a commit that referenced this issue Nov 15, 2023

executor: union scan refactor for index read scenario (#45786)

1c8d383

ref #43249

This was referenced Sep 6, 2024

executor: support memRowsIter for memIndexLookUpReader #55922

Merged

tablecodec: optimize DecodeIndexHandle to avoid unnecessary allocation #55987

Merged

executor,tablecodec: optimize decodeIndexKeyValue function for memIndexReader #56006

Merged

ti-chi-bot bot pushed a commit that referenced this issue Sep 11, 2024

tablecodec: optimize DecodeIndexHandle to avoid unnecessary allocation (

d82f06b

#55987) ref #43249

ti-chi-bot bot pushed a commit that referenced this issue Sep 12, 2024

executor: support memRowsIter for memIndexLookUpReader (#55922)

7aa0105

ref #43249

ti-chi-bot bot pushed a commit that referenced this issue Sep 14, 2024

executor,tablecodec: optimize decodeIndexKeyValue function for memInd…

fa8fd91

…exReader (#56006) ref #43249

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize performance for `UnionScanExec` and `MemBuffer` #43249

Optimize performance for `UnionScanExec` and `MemBuffer` #43249

lcwangchao commented Apr 20, 2023

tiancaiamao commented Jun 21, 2023

ekexium commented May 14, 2024

tiancaiamao commented May 16, 2024

Optimize performance for UnionScanExec and MemBuffer #43249

Optimize performance for UnionScanExec and MemBuffer #43249

Comments

lcwangchao commented Apr 20, 2023

Enhancement

tiancaiamao commented Jun 21, 2023

ekexium commented May 14, 2024

tiancaiamao commented May 16, 2024

Optimize performance for `UnionScanExec` and `MemBuffer` #43249

Optimize performance for `UnionScanExec` and `MemBuffer` #43249