Issue: Long SQL Causing TiKV gRPC to Hang #55845

WalterWj · 2024-09-04T08:35:08Z

Bug Report

Please answer these questions before submitting your issue. Thanks!

1. Minimal reproduce step (Required)

Background:

The SQL query is select * from t where id in (?), with many values in the IN clause, such as 10,000 entries. The SQL length is several MBs. During execution, TiKV experiences leader drop incidents, and monitoring shows server report failures. There is a noticeable leader drop.

After optimizing the SQL by reducing the contents of the IN clause, the issue no longer occurs. This effectively reduces the request size between TiDB and TiKV.

Problem Description:

This is a known issue. During interactions between TiDB and TiKV, if the request is too large, TiKV's gRPC threads become busy with frequent deserialization of large messages, causing the threads to hang. This affects the sending of other messages, such as heartbeat packets between the leader and followers of a Region.

This issue is caused by business reasons and can be reproduced in older versions, indicating that it exists across versions. The upgrade triggered this phenomenon. Many factors, including data and leader distribution, can affect the requests sent to TiKV, making it difficult to pinpoint specific changes. Restarting might also be a factor. However, the root cause is clear: this is an edge case. Future considerations may include enhancements to address it.

2. What did you expect to see? (Required)

3. What did you see instead (Required)

4. What is your TiDB version? (Required)

seiya-annie · 2024-09-06T04:10:39Z

/report customer

WalterWj added the type/bug The issue is confirmed as a bug. label Sep 4, 2024

ti-chi-bot bot added the report/customer Customers have encountered this bug. label Sep 6, 2024

jebter added component/tikv severity/major labels Sep 6, 2024

ti-chi-bot bot added may-affects-5.4 This bug maybe affects 5.4.x versions. may-affects-6.1 may-affects-6.5 may-affects-7.1 may-affects-7.5 may-affects-8.1 labels Sep 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue: Long SQL Causing TiKV gRPC to Hang #55845

Issue: Long SQL Causing TiKV gRPC to Hang #55845

WalterWj commented Sep 4, 2024

seiya-annie commented Sep 6, 2024

Issue: Long SQL Causing TiKV gRPC to Hang #55845

Issue: Long SQL Causing TiKV gRPC to Hang #55845

Comments

WalterWj commented Sep 4, 2024

Bug Report

1. Minimal reproduce step (Required)

2. What did you expect to see? (Required)

3. What did you see instead (Required)

4. What is your TiDB version? (Required)

seiya-annie commented Sep 6, 2024