[Draft] K8s otel overview dashboard #10910

tetianakravchenko · 2024-08-28T08:01:23Z

Proposed commit message

Checklist

I have reviewed tips for building integrations and this pull request is aligned with them.
I have verified that all data streams collect metrics or logs.
I have added an entry to my package's changelog.yml file.
I have verified that Kibana version constraints are current according to guidelines.

Author's Checklist

[ ]

How to test this PR locally

Related issues

Screenshots

Current state:

TODO:

decisions:

not change manifests: pods allocatable and node conditions
cronjob/jobs are not included in the workload (for now)

Signed-off-by: Tetiana Kravchenko <tetiana.kravchenko@elastic.co>

elasticmachine · 2024-09-04T09:57:48Z

💔 Build Failed

Buildkite Build
Commit: bc96b68

Failed CI Steps

:pipeline:⬆️ Upload Pipeline: .buildkite/pipeline.yml

History

💔 Build #15201 failed e50ef24
💔 Build #15140 failed 6ab7e04
💔 Build #15133 failed 8f7dd00
💔 Build #15104 failed 6eba660

ChrsMark

A couple of comments for alignment with the spec.

Could we try to be consistent with the terminology, i.e. usage vs utilization vs pct? Our panel titles etc should be aligned with the semantic convention definitions. Hence we should only use usage or utilization where it applies (no need for pct, utilization is a ratio anyways).
Could we consider leveraging the limits' utilization metrics as well? There was strong push-back by the community when we introduced the *.node.utilization metrics so relying only these might be risky in case the community decides to deprecate them. From my perspective these are nice to haves but we can rely to *.cpu.usage ones and the limit based utilizations.
Also in the panels it would be nice if we explicitly mention what utilization we display (ratio against the node's capacity VS ratio against the limits).

ChrsMark · 2024-09-12T07:05:50Z

packages/kubernetes/kibana/dashboard/kubernetes-d7bf6834-3f14-45c5-a17c-638f13e793ed.json

+                                                        "query": "\"metrics.k8s.node.cpu.usage\": *"
+                                                    },
+                                                    "isBucketed": false,
+                                                    "label": "CPU usage Pct",


k8s.node.cpu.usage metric represents the number of cores used in a time window, hence the term Pct is not accurate here. Or there is a specific reason for mentioning this?

ChrsMark · 2024-09-12T07:07:52Z

packages/kubernetes/kibana/dashboard/kubernetes-d7bf6834-3f14-45c5-a17c-638f13e793ed.json

+                                                        "query": "\"metrics.k8s.pod.cpu.node.utilization\": *"
+                                                    },
+                                                    "isBucketed": false,
+                                                    "label": "Pod CPU Usage ",


Isn't this CPU utilization instead of usage? Also maybe you can consider explicitly mentioning that this utilization is against the Node's capacity and maybe add another graph for the limit based utilizations?

tetianakravchenko · 2024-10-07T13:39:56Z

All comments were addressed in #11310, closing this PR

tetianakravchenko added 2 commits August 21, 2024 14:55

init commit

ca1ef4f

Signed-off-by: Tetiana Kravchenko <tetiana.kravchenko@elastic.co>

update dashboard

6eba660

Signed-off-by: Tetiana Kravchenko <tetiana.kravchenko@elastic.co>

andrewkroh added the Integration:kubernetes Kubernetes label Aug 28, 2024

tetianakravchenko added 2 commits August 28, 2024 16:29

add status viz

8f7dd00

Signed-off-by: Tetiana Kravchenko <tetiana.kravchenko@elastic.co>

add number of nodes and ns; group information

6ab7e04

Signed-off-by: Tetiana Kravchenko <tetiana.kravchenko@elastic.co>

tetianakravchenko requested a review from mlunadia August 29, 2024 09:03

add latest changes

e50ef24

Signed-off-by: Tetiana Kravchenko <tetiana.kravchenko@elastic.co>

andrewkroh added the dashboard Relates to a Kibana dashboard bug, enhancement, or modification. label Aug 30, 2024

tetianakravchenko added 2 commits September 4, 2024 11:54

push latest changes

93f4e74

Signed-off-by: Tetiana Kravchenko <tetiana.kravchenko@elastic.co>

add otel overview dashboard

bc96b68

Signed-off-by: Tetiana Kravchenko <tetiana.kravchenko@elastic.co>

ChrsMark reviewed Sep 12, 2024

View reviewed changes

ChrsMark mentioned this pull request Sep 23, 2024

[kubernetes OTEL] Add kubernetes OTEL package #11137

Merged

4 tasks

tetianakravchenko mentioned this pull request Oct 2, 2024

[Kubernetest OTEL] Follow up enhancements #11310

Merged

4 tasks

tetianakravchenko closed this Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft] K8s otel overview dashboard #10910

[Draft] K8s otel overview dashboard #10910

tetianakravchenko commented Aug 28, 2024 •

edited

Loading

elasticmachine commented Sep 4, 2024

ChrsMark left a comment •

edited

Loading

ChrsMark Sep 12, 2024

ChrsMark Sep 12, 2024

tetianakravchenko commented Oct 7, 2024

[Draft] K8s otel overview dashboard #10910

[Draft] K8s otel overview dashboard #10910

Conversation

tetianakravchenko commented Aug 28, 2024 • edited Loading

Proposed commit message

Checklist

Author's Checklist

How to test this PR locally

Related issues

Screenshots

elasticmachine commented Sep 4, 2024

💔 Build Failed

Failed CI Steps

History

ChrsMark left a comment • edited Loading

Choose a reason for hiding this comment

ChrsMark Sep 12, 2024

Choose a reason for hiding this comment

ChrsMark Sep 12, 2024

Choose a reason for hiding this comment

tetianakravchenko commented Oct 7, 2024

tetianakravchenko commented Aug 28, 2024 •

edited

Loading

ChrsMark left a comment •

edited

Loading