Skip to content

Large Query Influence the Resource Group Stability #50831

@nolouch

Description

@nolouch

Background

They're running a stable workload. the ru limit works fine. but once a query's plan changes or there comes a new query that will consume lots of RUs, this may make the limit unstable:

the query is running on tidb-0 and consume a lots RUs:

image

image
image

image

As the picture shows, after the problem query, the normal query needs to be recovered according to the refill rate to pay back the debt. which will increase the latency for a long time:

Issues

  • Actually, this problem is by design, but hard to explain to users. user may expect the normal query be recover quickly.
  • Different TiDBs are affected to varying degrees. If the debt can be shared, the overall situation may be better.
  • Also some queries report Exceed the limit after the large query is executed.

Metadata

Metadata

Assignees

No one assigned

    Labels

    type/enhancementThe issue or PR belongs to an enhancement.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions