Skip to content

PD can't redistribute the hot write regions among TiFlash nodes #3261

Closed
@JaySon-Huang

Description

Bug Report

What did you do?

I deployed a cluster with 1 TiDB + 1 PD + 1 TiKV + 2 TiFlash base on version v4.0.8.

The TiFlash and its proxy branch are "store_stats_4.0". These two branches fix the problem that the written bytes and written keys at the store level are not reported to PD.

What did you expect to see?

If the write-pressure between TiFlash nodes is not balanced, PD can generate some move-hot-write-region operators and make it balanced.

What did you see instead?

By adding a sysbench workload on this cluster, I found that:

  • move-hot-write-region between TiFlash store rarely happen.
  • In the phase of running the "oltp_update_index" workload, the writes-pressure between two TiFlash nodes is imbalanced, one is about 10 times to another node. But PD still did not generate move-hot-write-region between TiFlash stores.

Another problem, maybe related or not:
I use the PD API: /pd/api/v1/hotspot/regions/write to check the stats of hot write regions. In the TiFlash node, the flow bytes by summing all regions is about 4 times to the TiFlash reported.

What version of PD are you using (pd-server -V)?

>  ./pd-server -V
Release Version: v4.0.8
Edition: Community
Git Commit Hash: 775b6a5ef517f8ab2f43fef6418bbfc7d6c9c9dc
Git Branch: heads/refs/tags/v4.0.8
UTC Build Time:  2020-10-30 08:15:09

Metadata

Assignees

No one assigned

    Labels

    type/enhancementThe issue or PR belongs to an enhancement.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions