Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TiCDC ResolvedTs stuck for half an hour #10847

Open
hongyunyan opened this issue Mar 26, 2024 · 3 comments
Open

TiCDC ResolvedTs stuck for half an hour #10847

hongyunyan opened this issue Mar 26, 2024 · 3 comments
Assignees
Labels
affects-7.5 affects-8.1 area/ticdc Issues or PRs related to TiCDC. severity/major type/bug The issue is confirmed as a bug.

Comments

@hongyunyan
Copy link
Collaborator

hongyunyan commented Mar 26, 2024

What did you do?

Add sysbench workload in a cluster with lots of jxx tables, which leading to some create table ddls.

What did you expect to see?

ticdc works smoothly, with reasonable resolve ts and checkpoint ts

What did you see instead?

resolve ts and checkpoint ts stuck for half an hour.

Based on the log and metrics, we find it's due to the incremental scan cost too many times. And for the incremental scanning, the cpu is reach 100%.

[2024/03/25 07:50:18.403 +00:00] [INFO] [initializer.rs:545] ["cdc incremental scan uses ts filter: true"] [tables=13] [filtered_versions=41367935] [mvcc_versions=41367935] [hint_min_ts=448620924264776010] [region_id=63851] [thread_id=215]

img_v3_029b_093d19b7-48c6-4594-bb7b-14c5e800740g

Versions of the cluster

Upstream TiDB cluster version (execute SELECT tidb_version(); in a MySQL client):

(paste TiDB cluster version here)

Upstream TiKV version (execute tikv-server --version):

(paste TiKV version here)

TiCDC version (execute cdc version):

(paste TiCDC version here)
@hongyunyan hongyunyan added type/bug The issue is confirmed as a bug. area/ticdc Issues or PRs related to TiCDC. labels Mar 26, 2024
@hicqu
Copy link
Contributor

hicqu commented Mar 26, 2024

Seems duplicate with tikv/tikv#16601.

@hicqu
Copy link
Contributor

hicqu commented Mar 26, 2024

Seems the slow scan region is the last one in whole key space:

[2024/03/25 09:59:58.475 +00:00] [INFO] [region.go:740] ["region Version changed"] [region-id=63851] [detail="StartKey Changed:{7480000000000033FF6E5F72B1E04F919AFFD0F6830000000000FA} -> {7480000000000033FF725F698000000000FF00000103C1179678FFE93A796903C11796FF78E93A7969000000FC}, EndKey:{748000FFFFFFFFFFFFF900000000000000F8}"] [old-version=4570] [new-version=4572]

@fubinzh
Copy link

fubinzh commented Mar 27, 2024

/severity major

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
affects-7.5 affects-8.1 area/ticdc Issues or PRs related to TiCDC. severity/major type/bug The issue is confirmed as a bug.
Projects
Status: Need Triage
Development

No branches or pull requests

5 participants