tools: update descriptions of handling sharding ddl locks manually #876

yikeke · 2019-01-28T06:53:45Z

Via: pingcap/tidb-tools#161
Comment addressed. PTAL. @lilin90 @csuzhangxc @GregoryIan

Please help fix the broken link in this file, thanks. @lilin90

Via: pingcap/tidb-tools#161

IANTHEREAL · 2019-01-30T03:08:31Z

we had add a dm directory #873, can you put this file in it @yikeke

tools/manually-handle-sharding-ddl-locks.md

yikeke · 2019-01-30T05:08:28Z

we had add a dm directory #873, can you put this file in it @yikeke

Sure.

tools/manually-handling-sharding-ddl-locks.md

csuzhangxc · 2019-01-30T08:44:21Z

tools/manually-handling-sharding-ddl-locks.md

+- `remove-id`: flag; string; `--remove-id`; optional; if being specified, it should be the ID of some DDL lock; if not being specified, remove the corresponding DDL lock information only when the breaking operation succeeds; if being specified, compulsorily remove the DDL lock information 
+- `exec`: flag; boolean; `--exec`; optional; cannot be specified simultaneously with the `--skip` parameter; if being specified, ask the DM-worker to execute the corresponding DDL of the lock 
+- `skip`: flag; boolean; `--skip`; optional; cannot be specified simultaneously with the `--exec` parameter; if being specified, ask the DM-worker to skip the corresponding DDL of the lock 
+- `task-name`: non-flag; string; not optional; specify the name of the task containing the lock that is going to execute the breaking operation (you can check if a task contains the lock via [query-status](../task-handling/query-status.md))


the link seems is broken.

I'll doublecheck it later.

have you checked?

All the links should work well after shard-merge.md is merged. There are two broken links for now.

tools/manually-handling-sharding-ddl-locks.md

IANTHEREAL · 2019-01-30T11:03:51Z

tools/manually-handling-sharding-ddl-locks.md

+» show-ddl-locks test
+{
+    "result": true,                                        # show if the locking process succeeds
+    "msg": "",                                             # show the reason for the locking process failure or other descriptive information (for example, the locking task does not exist)


the reason 🤔 is additional message better?

Sounds fair, I'll change it.

IANTHEREAL · 2019-01-30T11:10:35Z

tools/manually-handling-sharding-ddl-locks.md

+    "locks": [                                             # the lock information list on the DM-master
+        {
+            "ID": "test-`shard_db`.`shard_table`",         # the ID of the lock, which is composed of the current task name and the corresponding schema/table information of the DDL
+            "task": "test",                                # the task name of the lock


lock belongs to task

Thanks, then it should be:

the lock ID, which is made up of the current task name and the schema/table information corresponding to the DDL

the name of the task to which the lock belongs

Sounds right to you?

IANTHEREAL · 2019-01-30T11:12:24Z

tools/manually-handling-sharding-ddl-locks.md

+        {
+            "ID": "test-`shard_db`.`shard_table`",         # the ID of the lock, which is composed of the current task name and the corresponding schema/table information of the DDL
+            "task": "test",                                # the task name of the lock
+            "owner": "127.0.0.1:8262",                     # the owner of the lock (the first DM-worker which receives the DDL event)


remove (the first DM-worker which receives the DDL event), it is not completely accurate

IANTHEREAL · 2019-01-30T11:45:32Z

tools/manually-handling-sharding-ddl-locks.md

+
+##### Variables description
+
+- `worker`: flag; string; `--worker`; optional; can be specified multiple times; if not being specified, send requests for all DM-workers that are waiting for the lock to skip the DDL; if being specified, send requests for the specified DM-worker to skip the DDL


let all dm-worker to skip the ddl， except for owner?

IANTHEREAL · 2019-01-30T11:52:43Z

tools/manually-handling-sharding-ddl-locks.md

+```bash
+» unlock-ddl-lock test-`shard_db`.`shard_table`
+{
+    "result": true,                                        # show if the unlocking process succeeds


is it right? @csuzhangxc

inaccuracy. true means successful, but false can also be successful. some descriptions provided in the Some DM-workers go offline scenario.

So can I just say "the result of the unlocking process"? Should we specify what "true" and "false" means here? @GregoryIan @csuzhangxc

I think it's unnecessary. @yikeke

IANTHEREAL · 2019-01-30T12:01:06Z

tools/manually-handling-sharding-ddl-locks.md

+
+Before `DM-master` tries to automatically unlock the sharding DDL lock, all the DM-workers need to receive the sharding DDL event (for details, see [shard merge principles](./shard-merge.md#Principles)). If the sharding DDL event is already in the synchronization process, and some DM-workers have gone offline and are not to be restarted (these DM-workers have been removed according to the application demand), then the sharding DDL lock cannot be automatically synchronized and unlocked because not all the DM-workers can receive the DDL event.
+
+> If you do not need to make some DM-workers offline in the process of synchronizing sharding DDL events, a better solution is to use `stop-task` to stop the running tasks first, make the DM-workers go offline, remove the corresponding configuration information from the task configuration file, and finally use `start-task` and the new task configuration to restart the synchronization task. 


do not need to => need to?

Yes, it should be: "If you need to make some DM-workers offline when not in the process of synchronizing sharding DDL events".

…ally-handling-sharding-ddl-locks.md

lilin90 · 2019-02-03T09:56:11Z

Since the file in this PR has links that point to the file added in #887, this PR can only be merged after #887 is merged. @GregoryIan @csuzhangxc

IANTHEREAL · 2019-02-11T07:31:48Z

tools/dm/manually-handling-sharding-ddl-locks.md

+    "workers": [                                           # the result list of the DDL execution/skipping operation of each DM-worker
+        {
+            "result": true,                                # the result of the DDL execution/skipping operation
+            "worker": "127.0.0.1:8262",                    # the address of the DM-worker (the DM-worker ID)


I think # the DM-worker ID is enough, below is same

IANTHEREAL · 2019-02-11T07:43:26Z

tools/dm/manually-handling-sharding-ddl-locks.md

+#### Variables description
+
+- `worker`: flag; string; `--worker`; required; specify the DM-worker which needs to execute the breaking operation
+- `remove-id`: flag; string; `--remove-id`; optional; if being specified, it should be the ID of some DDL lock; if not being specified, remove the corresponding DDL lock information only when the breaking operation succeeds; if being specified, compulsorily remove the DDL lock information 


remove-id is deprecated, we should indicate is deprecated

IANTHEREAL · 2019-02-11T08:05:47Z

tools/dm/manually-handling-sharding-ddl-locks.md

+6. Use `unlock-dll-lock` to ask `DM-master` to actively unlock the DDL lock.
+    - If the owner of the DDL lock has gone offline, you can use the parameter `--owner` to specify another DM-worker as the new owner to execute the DDL.
+    - If any DM-worker reports an error, `result` will be set to `false`, and at this point you should check carefully if the errors of each DM-worker is acceptable and within expectations.
+    - DM-workers that have gone offline will return the error `rpc error: code = Unavailable`, which is within expectations and can be neglected; but if other online DM-workers return errors, then you should deal with them based on the scenario.


I think it’s example of If any DM-worker reports an error, `result` will be set to `false`, and at this point you should check carefully if the errors of each DM-worker is acceptable and within expectations. should we put it under the statement at L208?

@GregoryIan Yes, updated.

IANTHEREAL · 2019-02-11T08:33:53Z

tools/dm/manually-handling-sharding-ddl-locks.md

+
+#### The reason for the abnormal lock
+
+It has the similar reason for the abnormal lock in [Some DM-workers restart during the DDL unlocking process](#scenario-2-some-dm-workers-restart-during-the-ddl-unlocking-process). If the DM-worker is temporarily unreachable when you ask the DM-worker to skip the DDL, this DM-worker might fail to skip the DDL.


same with Scenario 3, we can also use solution of Scenario 3 to solve problem of Scenario 2

correct it, the difference is dm-master doesn't have a lock in Scenario 3, but dm-master has a new lock lock in Scenario 2

IANTHEREAL

LGTM

csuzhangxc · 2019-02-11T10:22:04Z

LGTM

tools: update descriptions of handling sharding ddl locks manually

2f8211c

Via: pingcap/tidb-tools#161

csuzhangxc reviewed Jan 30, 2019

View reviewed changes

tools/manually-handle-sharding-ddl-locks.md Outdated Show resolved Hide resolved

tools/manually-handle-sharding-ddl-locks.md Outdated Show resolved Hide resolved

yikeke added 4 commits January 30, 2019 13:23

tools: add the comment

62a0e81

tools: remove the heading "feature"

9fa6167

tools: remove some definite articles

68ed700

tools: change the file name

44a53d7

csuzhangxc reviewed Jan 30, 2019

View reviewed changes

tools/manually-handling-sharding-ddl-locks.md Outdated Show resolved Hide resolved

tools/manually-handling-sharding-ddl-locks.md Outdated Show resolved Hide resolved

IANTHEREAL reviewed Jan 30, 2019

View reviewed changes

csuzhangxc mentioned this pull request Jan 30, 2019

README, dm: update the files that refer to "manually-handling-sharding-ddl-locks.md" #881

Closed

yikeke and others added 5 commits February 1, 2019 15:18

tools: address the comment

ff6acc1

tools: fix the broken links

fe82a5d

tools: update wording and fix format

b92162f

Rename tools/manually-handling-sharding-ddl-locks.md to tools/dm/manu…

b5de44c

…ally-handling-sharding-ddl-locks.md

tool/dm: fix several links

2f97342

lilin90 mentioned this pull request Feb 3, 2019

tools, media, readme: add DM shard merge #887

Merged

IANTHEREAL mentioned this pull request Feb 3, 2019

update DM document #874

Closed

8 tasks

IANTHEREAL reviewed Feb 11, 2019

View reviewed changes

lilin90 added 2 commits February 11, 2019 16:24

tools/dm: update wording, format, address comments

8cca829

tools/dm: update wording, format, address comments

d0e602f

tools/dm: add a blank line

75b204a

IANTHEREAL reviewed Feb 11, 2019

View reviewed changes

tools/dm: fix typo and add scenario difference

b52bb8a

IANTHEREAL approved these changes Feb 11, 2019

View reviewed changes

lilin90 merged commit 8f1bdb5 into pingcap:master Feb 11, 2019

lilin90 mentioned this pull request Feb 12, 2019

tools/dm: update links and refine list #891

Merged

yikeke deleted the manually-handle-sharding-DDL-lock branch June 27, 2019 05:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tools: update descriptions of handling sharding ddl locks manually #876

tools: update descriptions of handling sharding ddl locks manually #876

yikeke commented Jan 28, 2019 •

edited

Loading

IANTHEREAL commented Jan 30, 2019

yikeke commented Jan 30, 2019

csuzhangxc Jan 30, 2019

yikeke Jan 31, 2019

csuzhangxc Feb 1, 2019

yikeke Feb 1, 2019

IANTHEREAL Jan 30, 2019

yikeke Jan 31, 2019

IANTHEREAL Jan 30, 2019

yikeke Jan 31, 2019 •

edited

Loading

csuzhangxc Jan 31, 2019

IANTHEREAL Jan 30, 2019 •

edited

Loading

yikeke Jan 31, 2019

IANTHEREAL Jan 30, 2019 •

edited

Loading

IANTHEREAL Jan 30, 2019

csuzhangxc Jan 30, 2019

yikeke Jan 31, 2019

lilin90 Feb 11, 2019 •

edited

Loading

IANTHEREAL Jan 30, 2019

yikeke Jan 31, 2019

lilin90 commented Feb 3, 2019

IANTHEREAL Feb 11, 2019

IANTHEREAL Feb 11, 2019

IANTHEREAL Feb 11, 2019 •

edited

Loading

lilin90 Feb 11, 2019

IANTHEREAL Feb 11, 2019

IANTHEREAL Feb 11, 2019

IANTHEREAL left a comment

csuzhangxc commented Feb 11, 2019


		##### Variables description

		- `worker`: flag; string; `--worker`; optional; can be specified multiple times; if not being specified, send requests for all DM-workers that are waiting for the lock to skip the DDL; if being specified, send requests for the specified DM-worker to skip the DDL


		Before `DM-master` tries to automatically unlock the sharding DDL lock, all the DM-workers need to receive the sharding DDL event (for details, see [shard merge principles](./shard-merge.md#Principles)). If the sharding DDL event is already in the synchronization process, and some DM-workers have gone offline and are not to be restarted (these DM-workers have been removed according to the application demand), then the sharding DDL lock cannot be automatically synchronized and unlocked because not all the DM-workers can receive the DDL event.

		> If you do not need to make some DM-workers offline in the process of synchronizing sharding DDL events, a better solution is to use `stop-task` to stop the running tasks first, make the DM-workers go offline, remove the corresponding configuration information from the task configuration file, and finally use `start-task` and the new task configuration to restart the synchronization task.


		#### The reason for the abnormal lock

		It has the similar reason for the abnormal lock in [Some DM-workers restart during the DDL unlocking process](#scenario-2-some-dm-workers-restart-during-the-ddl-unlocking-process). If the DM-worker is temporarily unreachable when you ask the DM-worker to skip the DDL, this DM-worker might fail to skip the DDL.

tools: update descriptions of handling sharding ddl locks manually #876

tools: update descriptions of handling sharding ddl locks manually #876

Conversation

yikeke commented Jan 28, 2019 • edited Loading

IANTHEREAL commented Jan 30, 2019

yikeke commented Jan 30, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yikeke Jan 31, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

IANTHEREAL Jan 30, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

IANTHEREAL Jan 30, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lilin90 Feb 11, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lilin90 commented Feb 3, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

IANTHEREAL Feb 11, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

IANTHEREAL left a comment

Choose a reason for hiding this comment

csuzhangxc commented Feb 11, 2019

yikeke commented Jan 28, 2019 •

edited

Loading

yikeke Jan 31, 2019 •

edited

Loading

IANTHEREAL Jan 30, 2019 •

edited

Loading

IANTHEREAL Jan 30, 2019 •

edited

Loading

lilin90 Feb 11, 2019 •

edited

Loading

IANTHEREAL Feb 11, 2019 •

edited

Loading