-
Notifications
You must be signed in to change notification settings - Fork 25.6k
Closed
Labels
:Data Management/MonitoringTeam:Data ManagementMeta label for data/management teamMeta label for data/management teamresiliency
Description
We observed some cases (#50241 for example) where a data node responding slowly can cause accumulation of ResponseContexts for indices:monitor/recovery[n], indices:monitor/stats[n], cluster:monitor/stats[n] and cluster:monitor/xpack/ml/job/stats/get[n] which correspond to _xpack/usage and _nodes/stats calls.
We would like to improve robustness of stats and usage call in case of a slowly responding data nodes by
- introducing timeout on stats and usage APIs and/or
- making stats and usage APIs tasks cancellable and cancel them if the REST client disconnects
Metadata
Metadata
Assignees
Labels
:Data Management/MonitoringTeam:Data ManagementMeta label for data/management teamMeta label for data/management teamresiliency