Closed
Description
We observed some cases (#50241 for example) where a data node responding slowly can cause accumulation of ResponseContexts for indices:monitor/recovery[n]
, indices:monitor/stats[n]
, cluster:monitor/stats[n]
and cluster:monitor/xpack/ml/job/stats/get[n]
which correspond to _xpack/usage
and _nodes/stats
calls.
We would like to improve robustness of stats and usage call in case of a slowly responding data nodes by
- introducing timeout on stats and usage APIs and/or
- making stats and usage APIs tasks cancellable and cancel them if the REST client disconnects