docs/design: add the table partition proposal #7969

tiancaiamao · 2018-10-19T17:38:40Z

What problem does this PR solve?

Add a table partition design proposal document #7907

@CaitinChen @shenli

This change is

CaitinChen · 2018-10-22T03:53:22Z

OK. I will review it later~

docs/design/2018-10-19-table-partition.md

tiancaiamao · 2018-11-08T11:05:39Z

Comment addressed, PTAL, thanks @CaitinChen

docs/design/2018-10-19-table-partition.md

tiancaiamao · 2018-11-12T05:47:27Z

PTAL @CaitinChen

CaitinChen

LGTM

tiancaiamao · 2018-11-12T12:20:02Z

/run-all-tests
PTAL @shenli

tiancaiamao · 2018-11-13T05:17:20Z

CI fail, blocked by #8287

tiancaiamao · 2018-11-14T10:55:50Z

/run-all-tests

tiancaiamao · 2018-11-15T09:16:45Z

/run-all-tests

tiancaiamao · 2018-11-15T09:56:26Z

PTAL @shenli

tiancaiamao · 2018-11-16T03:06:17Z

PTAL @shenli

tiancaiamao · 2018-11-19T02:21:18Z

PTAL @shenli

zz-jason · 2018-11-19T03:05:51Z

/run-all-tests

tiancaiamao · 2018-11-20T02:32:47Z

PTAL @shenli

shenli · 2018-11-20T03:05:40Z

docs/design/2018-10-19-table-partition.md

+
+## Background
+
+MySQL has the [table partition](https://dev.mysql.com/doc/refman/8.0/en/partitioning.html) feature. If this feature is supported in TiDB, many issues could be addressed. For example, drop ranged partitions could be used to remove old data; partition by hash could address the hot data issue and thus the write performance is improved; query on partitioned tables could be faster than on manual sharding tables because of the partition pruning.


query on partitioned tables could be faster than unpartitioned tables because of the partition pruning.

shenli · 2018-11-20T03:09:42Z

docs/design/2018-10-19-table-partition.md

+There are two levels of mapping in TiDB: SQL data to a logical key range, logical key range to the physical storage.
+When TiDB maps table data to key-value storage, it first encodes `table id + row id` as key, row data as value. Then the logical key range is split into regions and stored into TiKV.
+
+Table partition works on the first level of mapping, partition ID will be made equivalent of the table ID. A partitioned table row uses `partition id + row id` as encoded key.


Please mention that PartitionID is unique in cluster scope.

shenli · 2018-11-20T03:11:27Z

docs/design/2018-10-19-table-partition.md

+select * from p3 where id < 30)
+```
+
+During the logical optimization phase, the `DataSource` plan is translated into `UnionAll`, and then each partition generates its own `TableReader` in the physical optimization phase.


DataSource plan -> DataSource operator
UnionAll -> UnionAll operator

shenli · 2018-11-20T03:27:18Z

docs/design/2018-10-19-table-partition.md

+The drawbacks of this implementation are:
+
+* If the table has many partitions, there will be many readers, and then the `explain` result is not friendly to the user
+* The `UnionAll` executor cannot keep the results in order, so if some executor needs ordered results such as `IndexReader`, an extra `Sort` executor is needed


We are talking about plan phrase, so there is no executor.

shenli · 2018-11-20T03:29:36Z

docs/design/2018-10-19-table-partition.md

+
+### How to write to the partitioned table
+
+All the write operation calls functions like `table.AddRecord` eventually, so implementing the write operation on a partitioned table simply implements this interface method on the `PartitionedTable` struct.


singular? or plural？

All the write operation calls

tiancaiamao · 2018-11-20T04:37:04Z

Comment addressed @shenli

shenli · 2018-11-20T05:16:04Z

LGTM

This reverts commit 012cb6d.

docs/design: add the table partition proposal

131d756

tiancaiamao added the component/docs label Oct 19, 2018

CaitinChen reviewed Oct 24, 2018

View reviewed changes

address comment

581919c

CaitinChen reviewed Nov 8, 2018

View reviewed changes

CaitinChen reviewed Nov 12, 2018

View reviewed changes

tiancaiamao added the status/LGT1 Indicates that a PR has LGTM 1. label Nov 12, 2018

tiancaiamao added the status/all tests passed label Nov 15, 2018

shenli self-requested a review November 19, 2018 02:22

shenli reviewed Nov 20, 2018

View reviewed changes

tiancaiamao added 2 commits November 20, 2018 12:21

address comment

7a40b2d

Merge branch 'master' into proposal

91e26ef

tiancaiamao force-pushed the proposal branch from 89b2117 to 91e26ef Compare November 20, 2018 04:24

Merge branch 'master' into proposal

1572868

shenli approved these changes Nov 20, 2018

View reviewed changes

tiancaiamao added 2 commits November 20, 2018 15:55

Merge branch 'master' into proposal

71688c1

Merge branch 'master' into proposal

1187816

tiancaiamao merged commit 012cb6d into pingcap:master Nov 20, 2018

tiancaiamao deleted the proposal branch November 20, 2018 08:20

lysu added a commit that referenced this pull request Nov 20, 2018

Revert "docs/design: add the table partition proposal (#7969)"

71ed235

This reverts commit 012cb6d.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs/design: add the table partition proposal #7969

docs/design: add the table partition proposal #7969

tiancaiamao commented Oct 19, 2018 •

edited by c4pt0r

Loading

CaitinChen commented Oct 22, 2018

tiancaiamao commented Nov 8, 2018

tiancaiamao commented Nov 12, 2018

CaitinChen left a comment

tiancaiamao commented Nov 12, 2018

tiancaiamao commented Nov 13, 2018 •

edited

Loading

tiancaiamao commented Nov 14, 2018

tiancaiamao commented Nov 15, 2018

tiancaiamao commented Nov 15, 2018

tiancaiamao commented Nov 16, 2018

tiancaiamao commented Nov 19, 2018

zz-jason commented Nov 19, 2018

tiancaiamao commented Nov 20, 2018

shenli Nov 20, 2018

shenli Nov 20, 2018

shenli Nov 20, 2018

shenli Nov 20, 2018

shenli Nov 20, 2018

tiancaiamao commented Nov 20, 2018

shenli commented Nov 20, 2018


		## Background

		MySQL has the [table partition](https://dev.mysql.com/doc/refman/8.0/en/partitioning.html) feature. If this feature is supported in TiDB, many issues could be addressed. For example, drop ranged partitions could be used to remove old data; partition by hash could address the hot data issue and thus the write performance is improved; query on partitioned tables could be faster than on manual sharding tables because of the partition pruning.


		### How to write to the partitioned table

		All the write operation calls functions like `table.AddRecord` eventually, so implementing the write operation on a partitioned table simply implements this interface method on the `PartitionedTable` struct.

docs/design: add the table partition proposal #7969

docs/design: add the table partition proposal #7969

Conversation

tiancaiamao commented Oct 19, 2018 • edited by c4pt0r Loading

What problem does this PR solve?

CaitinChen commented Oct 22, 2018

tiancaiamao commented Nov 8, 2018

tiancaiamao commented Nov 12, 2018

CaitinChen left a comment

Choose a reason for hiding this comment

tiancaiamao commented Nov 12, 2018

tiancaiamao commented Nov 13, 2018 • edited Loading

tiancaiamao commented Nov 14, 2018

tiancaiamao commented Nov 15, 2018

tiancaiamao commented Nov 15, 2018

tiancaiamao commented Nov 16, 2018

tiancaiamao commented Nov 19, 2018

zz-jason commented Nov 19, 2018

tiancaiamao commented Nov 20, 2018

shenli Nov 20, 2018

Choose a reason for hiding this comment

shenli Nov 20, 2018

Choose a reason for hiding this comment

shenli Nov 20, 2018

Choose a reason for hiding this comment

shenli Nov 20, 2018

Choose a reason for hiding this comment

shenli Nov 20, 2018

Choose a reason for hiding this comment

tiancaiamao commented Nov 20, 2018

shenli commented Nov 20, 2018

tiancaiamao commented Oct 19, 2018 •

edited by c4pt0r

Loading

tiancaiamao commented Nov 13, 2018 •

edited

Loading