tidb_batch_insert should not hit executor OOM quota #20597

ghost · 2020-10-23T02:20:07Z

Bug Report

Please answer these questions before submitting your issue. Thanks!

1. Minimal reproduce step (Required)

My understanding of tidb_batch_insert is that it is not documented intentionally. I think there is a valid use case for incrementally committing data however. See: #18038

This bug report is about tidb_batch_insert. Consider the following sample data:

DROP TABLE IF EXISTS t1;
CREATE TABLE t1 (
 pk VARBINARY(36) NOT NULL PRIMARY KEY,
 b BIGINT NOT NULL,
 c BIGINT NOT NULL,
 pad VARBINARY(2048),
 INDEX (b),
 INDEX (c)
);
INSERT INTO t1 SELECT uuid(), FLOOR(RAND()*5), FLOOR(RAND()*1000000), HEX(RANDOM_BYTES(1000)) FROM dual;
INSERT INTO t1 SELECT uuid(), FLOOR(RAND()*5), FLOOR(RAND()*1000000), HEX(RANDOM_BYTES(1000)) FROM t1 a JOIN t1 b JOIN t1 c;
INSERT INTO t1 SELECT uuid(), FLOOR(RAND()*5), FLOOR(RAND()*1000000), HEX(RANDOM_BYTES(1000)) FROM t1 a JOIN t1 b JOIN t1 c;
INSERT INTO t1 SELECT uuid(), FLOOR(RAND()*5), FLOOR(RAND()*1000000), HEX(RANDOM_BYTES(1000)) FROM t1 a JOIN t1 b JOIN t1 c;

If I use the MySQL command line client, I can select 1 billion rows with:

$ mysql test --quick -e "SELECT uuid(), FLOOR(RAND()*5), FLOOR(RAND()*1000000), HEX(RANDOM_BYTES(1000)) FROM t1 a JOIN t1 b JOIN t1 c;" > /dev/null

(The --quick option is required to prevent a client OOM. It retrieves one row at a time from the server.)

The memory for the above query remains stable, as observed in both top and in select * from information_schema.processlist:

mysql> select *, format_bytes(mem) from information_schema.processlist\G
*************************** 1. row ***************************
               ID: 7
             USER: root
             HOST: 127.0.0.1
               DB: NULL
          COMMAND: Query
             TIME: 0
            STATE: autocommit
             INFO: select *, format_bytes(mem) from information_schema.processlist
           DIGEST: 1ca560039f00cd76cc5aa081d57a76777af29383eae786850ca4a920e360773b
              MEM: 64540
         TxnStart: 
format_bytes(mem): 63.03 KiB
*************************** 2. row ***************************
               ID: 6
             USER: root
             HOST: 127.0.0.1
               DB: test
          COMMAND: Query
             TIME: 745
            STATE: autocommit
             INFO: SELECT uuid(), FLOOR(RAND()*5), FLOOR(RAND()*1000000), HEX(RANDOM_BYTES(1000)) FROM t1 a JOIN t1 b JOIN t1 c
           DIGEST: 93c940d9486707218fc41341aa7c108788312daebcac7af39a40b7efa8aeb2ca
              MEM: 12350624
         TxnStart: 10-22 19:56:07.101(420326451996524548)
format_bytes(mem): 11.78 MiB
2 rows in set (0.00 sec)

However, if I set enable-batch-dml = true in my tidb.toml file and attempt to execute this same query as an INSERT .. SELECT I reach query OOM. This is despite setting tidb_dml_batch_size=20000 + tidb_batch_insert=1:

SET tidb_dml_batch_size=20000;
SET tidb_batch_insert=1;
INSERT INTO t1 SELECT uuid(), FLOOR(RAND()*5), FLOOR(RAND()*1000000), HEX(RANDOM_BYTES(1000)) FROM t1 a JOIN t1 b JOIN t1 c;

Output:

mysql> INSERT INTO t1 SELECT uuid(), FLOOR(RAND()*5), FLOOR(RAND()*1000000), HEX(RANDOM_BYTES(1000)) FROM t1 a JOIN t1 b JOIN t1 c;
ERROR 1105 (HY000): Out Of Memory Quota![conn_id=9]

During this time, I can see the memory growing at a constant rate:

mysql> select *, format_bytes(mem) from information_schema.processlist where id = 9\G
*************************** 1. row ***************************
               ID: 9
             USER: root
             HOST: 127.0.0.1
               DB: test
          COMMAND: Query
             TIME: 40
            STATE: autocommit
             INFO: INSERT INTO t1 SELECT uuid(), FLOOR(RAND()*5), FLOOR(RAND()*1000000), HEX(RANDOM_BYTES(1000)) FROM t1 a JOIN t1 b JOIN t1 c
           DIGEST: 2279da97a82e38809b19698286f9991190b943460983a7cf4212132d319fa7d1
              MEM: 905574977
         TxnStart: 10-22 20:10:46.751(420326682591494147)
format_bytes(mem): 863.62 MiB
1 row in set (0.00 sec)

2. What did you expect to see? (Required)

Because the tidb_dml_batch_size is 20K, I expect the memory usage to be stable as TiDB continues to incrementally commit.

The expected result is that this query should be able to run for however long it takes to insert 1B rows.

3. What did you see instead (Required)

ERROR 1105 (HY000): Out Of Memory Quota![conn_id=9]

4. What is your TiDB version? (Required)

mysql> SELECT tidb_version()\G
*************************** 1. row ***************************
tidb_version(): Release Version: v4.0.0-beta.2-1407-ge30220099
Edition: Community
Git Commit Hash: e302200999045d81b19d08d5ec240d3438674abb
Git Branch: master
UTC Build Time: 2020-10-22 03:05:47
GoVersion: go1.13
Race Enabled: false
TiKV Min Version: v3.0.0-60965b006877ca7234adaced7890d7b029ed1306
Check Table Before Drop: false
1 row in set (0.00 sec)

The text was updated successfully, but these errors were encountered:

ghost · 2020-10-23T02:23:36Z

@jackysp PTAL, thanks!

SunRunAway · 2020-10-26T06:52:31Z

@cfzjywxk PTAL

jackysp · 2020-10-27T07:12:07Z

Sorry for the late reply. I think we forget to Consume the transaction size when finishing one batch.

ti-srebot · 2020-11-11T05:11:44Z

Please edit this comment to complete the following information

Bug

Note: Make Sure that 'component', and 'severity' labels are added
Example for how to fill out the template: #20100

1. Root Cause Analysis (RCA) (optional)

Forget to Consume the transaction size when finishing one batch.

2. Symptom (optional)

tidb_batch_insert can still exceed the memory quota when the size of the batch lower than the memory quota.

3. All Trigger Conditions (optional)

Use tidb_batch_insert to insert a large amount of data.

4. Workaround (optional)

5. Affected versions

6. Fixed versions

master

ti-srebot · 2020-11-25T10:40:26Z

( AffectedVersions ) fields are empty.

ghost added type/bug The issue is confirmed as a bug. component/executor labels Oct 23, 2020

jebter added the severity/major label Oct 23, 2020

ghost mentioned this issue Oct 31, 2020

Support incremental commit of INSERT...SELECT #18038

Open

cfzjywxk assigned bobotu Nov 3, 2020

bobotu mentioned this issue Nov 5, 2020

executor: update memory usage in batch insert/delete #20866

Merged

ghost mentioned this issue Nov 10, 2020

wrong memory info about Join operators showed in results of EXPLAIN ANALYZE #11924

Closed

ti-srebot closed this as completed in #20866 Nov 11, 2020

ti-srebot added the need-more-info label Nov 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tidb_batch_insert should not hit executor OOM quota #20597

tidb_batch_insert should not hit executor OOM quota #20597

ghost commented Oct 23, 2020 •

edited by ghost

Loading

ghost commented Oct 23, 2020

SunRunAway commented Oct 26, 2020

jackysp commented Oct 27, 2020

ti-srebot commented Nov 11, 2020 •

edited by bobotu

Loading

ti-srebot commented Nov 25, 2020

tidb_batch_insert should not hit executor OOM quota #20597

tidb_batch_insert should not hit executor OOM quota #20597

Comments

ghost commented Oct 23, 2020 • edited by ghost Loading

Bug Report

1. Minimal reproduce step (Required)

2. What did you expect to see? (Required)

3. What did you see instead (Required)

4. What is your TiDB version? (Required)

ghost commented Oct 23, 2020

SunRunAway commented Oct 26, 2020

jackysp commented Oct 27, 2020

ti-srebot commented Nov 11, 2020 • edited by bobotu Loading

Please edit this comment to complete the following information

Bug

1. Root Cause Analysis (RCA) (optional)

2. Symptom (optional)

3. All Trigger Conditions (optional)

4. Workaround (optional)

5. Affected versions

6. Fixed versions

ti-srebot commented Nov 25, 2020

ghost commented Oct 23, 2020 •

edited by ghost

Loading

ti-srebot commented Nov 11, 2020 •

edited by bobotu

Loading