Speed up health check #216

djshow832 · 2023-02-10T06:12:10Z

Background

Currently, the health check is serial.

For a cluster with N TiDB instances, the maximum overall interval is 3s+5s+(322s+221s)*N=8s+16Ns

If the graceful-wait-before-shutdown of TiDB is set to this duration, then it's too slow for scale-in or upgrading.

One possible way is to add a goroutine pool to do the health check.

The text was updated successfully, but these errors were encountered:

djshow832 self-assigned this Feb 10, 2023

djshow832 removed their assignment Sep 7, 2023

djshow832 added the enhancement New feature or request label Jan 7, 2024

This was referenced Apr 7, 2024

Multi-Factor-Based Balance #465

Closed

router: speed up health check by checking in parallel #498

Merged

ti-chi-bot bot closed this as completed in #498 Apr 8, 2024