PD can't redistribute the hot write regions among TiFlash nodes #3261
Closed
Description
Bug Report
What did you do?
I deployed a cluster with 1 TiDB + 1 PD + 1 TiKV + 2 TiFlash base on version v4.0.8.
The TiFlash and its proxy branch are "store_stats_4.0". These two branches fix the problem that the written bytes and written keys at the store level are not reported to PD.
What did you expect to see?
If the write-pressure between TiFlash nodes is not balanced, PD can generate some move-hot-write-region
operators and make it balanced.
What did you see instead?
By adding a sysbench workload on this cluster, I found that:
move-hot-write-region
between TiFlash store rarely happen.- In the phase of running the "oltp_update_index" workload, the writes-pressure between two TiFlash nodes is imbalanced, one is about 10 times to another node. But PD still did not generate
move-hot-write-region
between TiFlash stores.
Another problem, maybe related or not:
I use the PD API: /pd/api/v1/hotspot/regions/write
to check the stats of hot write regions. In the TiFlash node, the flow bytes by summing all regions is about 4 times to the TiFlash reported.
What version of PD are you using (pd-server -V
)?
> ./pd-server -V
Release Version: v4.0.8
Edition: Community
Git Commit Hash: 775b6a5ef517f8ab2f43fef6418bbfc7d6c9c9dc
Git Branch: heads/refs/tags/v4.0.8
UTC Build Time: 2020-10-30 08:15:09