You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(alerts): KubeNodePressure and KubeNodeEviction (#1014)
* feat: create alert "KubeletEvictingPods"
Signed-off-by: TheRealNoob <mike1118@live.com>
* fix syntax
* move to resources.libsonnet
Signed-off-by: TheRealNoob <mike1118@live.com>
* add selector filter
Signed-off-by: TheRealNoob <mike1118@live.com>
* move {{cluster}} injection
Co-authored-by: Stephen Lang <skl@users.noreply.github.com>
* redo alerts
Signed-off-by: TheRealNoob <mike1118@live.com>
* update runbook
Signed-off-by: TheRealNoob <mike1118@live.com>
* add tests
Signed-off-by: TheRealNoob <mike1118@live.com>
* fix tests
Signed-off-by: TheRealNoob <mike1118@live.com>
* fix "smelly selector" syntax preference
this turned out to be a good chance because it made me realize there was an additional label value here that wasn't be handled.
Signed-off-by: TheRealNoob <mike1118@live.com>
* Update alerts/kubelet.libsonnet
Co-authored-by: Stephen Lang <skl@users.noreply.github.com>
* Update alerts/kubelet.libsonnet
Co-authored-by: Stephen Lang <skl@users.noreply.github.com>
* add test KubeNodePressure
Signed-off-by: TheRealNoob <mike1118@live.com>
* chore: make --always-make markdownfmt
* rename KubeNodeEviction, fix test case
Signed-off-by: TheRealNoob <mike1118@live.com>
* remove in-progress change
Signed-off-by: TheRealNoob <mike1118@live.com>
* Update kubelet.libsonnet
update KubeNodeEviction query
* Update tests.yaml
update KubeNodeEviction test case
* update test KubeNodeEviction
Co-authored-by: Stephen Lang <skl@users.noreply.github.com>
---------
Signed-off-by: TheRealNoob <mike1118@live.com>
Co-authored-by: Stephen Lang <skl@users.noreply.github.com>
Co-authored-by: Stephen Lang <stephen.lang@grafana.com>
* on (%(clusterLabel)s, instance) group_left(node)
132
+
max by (%(clusterLabel)s, instance, node) (
133
+
kubelet_node_name{%(kubeletSelector)s}
134
+
)
135
+
> %(KubeNodeEvictionRateThreshold)s
136
+
||| % $._config,
137
+
labels: {
138
+
severity:'info',
139
+
},
140
+
'for': '0s',
141
+
annotations: {
142
+
description:'Node {{ $labels.node }}%s is evicting Pods due to {{ $labels.eviction_signal }}. Eviction occurs when eviction thresholds are crossed, typically caused by Pods exceeding RAM/ephemeral-storage limits.' % [
143
+
utils.ifShowMultiCluster($._config, ' on {{ $labels.%(clusterLabel)s }}' % $._config),
+*Message*: `Node {{ $labels.node }} is evicting Pods due to {{ $labels.eviction_signal }}. Eviction occurs when eviction thresholds are crossed, typically caused by Pods exceeding RAM/ephemeral-storage limits.`
description: 'Node minikube is evicting Pods due to memory.available. Eviction occurs when eviction thresholds are crossed, typically caused by Pods exceeding RAM/ephemeral-storage limits.'
0 commit comments