Description
When a data node finishes recovering a shard it notifies the master to move it to state STARTED
. Today we repeat this request every time we receive a cluster state that hasn't updated the shard state yet:
This behaviour means if the master is busy processing (potentially thousands) of other URGENT
tasks then we'll submit the same task repeatedly (potentially thousands of times). It dates back a long time but is no longer necessary: we can trust that the master will process our original request first (or we get notified that it failed). We should stop sending these unnecessary retries.
Relates #77466