[receiver/vcenter] Add vCenter Host metrics (dropped packet rate + capacity) #33646

BominRahmani · 2024-06-19T03:37:37Z

Description:

The following PR adds the following metrics

vcenter.host.network.packet.drop.rate
vcenter.host.cpu.capacity
vcenter.host.cpu.reserve.capacity

These metrics can be found in the following links respectively:
errorTx and errorRx
reservedCapacity and totalCapacity

Link to tracking Issue: #33607

Testing:
Tested against a live environment to scrape added metrics, and updated golden test files.

Documentation:
Updated documentation through mdatagen.

StefanKurek

Just a few small things. Also needs the scraper test updated with new results.

receiver/vcenterreceiver/metadata.yaml

StefanKurek · 2024-06-25T12:11:26Z

receiver/vcenterreceiver/metrics.go

 	// disk metrics
 	"disk.totalReadLatency.average",
 	"disk.totalWriteLatency.average",
 	"disk.maxTotalLatency.latest",
 	"disk.read.average",
 	"disk.write.average",
+	// cpu metrics
+	"cpu.reservedCapacity.average",
+	"cpu.totalCapacity.average",


Do you know how different this number ends up (if at all) from numCpuCores * cpuMhz?

Yea, so I've been trying to figure this out. From what I understand totalCapacity would be the more accurate metric to measure here. In most cases the numbers should be very close together (usually a bit lower). This is because numCpuCores might get artificially inflated by any logical cores caused by hyper threading. Beyond that the numbers should be very similar if not the same

After talking with Stefan a bit more on this topic, we uncovered that what we think of as totalCapacity as shown on these performance metrics is a bit different than what someone would normally think (numCpuCores * cpuMhz) as shown on the vSphere client.
The difference being the calculated total capacity using the cpuCores and cpuMhz would be talking about how much capacity the host has for it

The performance metric totalCapacity is referring to the Total reservation capacity

and the performance metric reservedCapacity is referring "used reservation" of the total reservation capacity.

In the end, we think both the performance metric and the quickstat metric are both very useful depending on the user use case and setup, so we will be including them both.

receiver/vcenterreceiver/scraper_test.go

receiver/vcenterreceiver/metadata.yaml

StefanKurek

Should be good to move from DRAFT. I still would like to know the difference between the total CPU performance metric and one you could calculate though.

StefanKurek · 2024-06-27T13:28:26Z

receiver/vcenterreceiver/client.go

-	if err == nil {
-		return vApps, nil
+// ResourcePoolInventoryListObjects returns the ResourcePools (with populated InventoryLists) of the vSphere SDK
+func (vc *vcenterClient) ResourcePoolInventoryListObjects(


Actually I'd say this PR is blocked until this PR is merged in and you can rebase (because this function for example isn't actually quite right ATM)

StefanKurek · 2024-06-27T17:13:23Z

receiver/vcenterreceiver/metadata.yaml

+      if_enabled_not_set: "this metric will be enabled by default starting in release v0.105.0"
+  vcenter.host.cpu.reserve.capacity:
+    enabled: false
+    description: Total CPU capacity that is available for reserve or reserved by virtual machines.


I think the description and name of this one has some room for improvement (still sounds a bit confusing).

What do you think about something like vcenter.host.cpu.reserved for the name? I know the performance metric names have "capacity" in them, but the UI equivalents do not. "Capacity" seems to only make sense for "total" and not "used" to me.

Then the description could then be something like The CPU of the host reserved for use by virtual machines.

Attribute could be changed to cpu_reservation_type with values of total and used. Description could be The type of CPU reservation for the host.

These are all just suggestions, but what do you think @BominRahmani ?

I was actually teetering back and forth between vcenter.host.cpu.reserved and vcenter.host.cpu.reserve.capacity I only decided the latter since it felt a bit more verbose. I was also thinking about switching the cpu_reservation_type to total and used to match the UI a bit more originally, So I am totally ok with these changes.

djaglowski

LGTM, just need checks fixed

…, updated docs

github-actions bot added the receiver/vcenter label Jun 19, 2024

github-actions bot requested review from djaglowski, schmikei and StefanKurek June 19, 2024 03:37

BominRahmani force-pushed the feat/add-vcenter-host-metrics branch from f137150 to 4b6d86a Compare June 19, 2024 21:05

BominRahmani mentioned this pull request Jun 25, 2024

[receiver/vcenter] Additional metrics for vCenter receiver #33607

Closed

4 tasks

StefanKurek suggested changes Jun 25, 2024

View reviewed changes

github-actions bot added the cmd/otelcontribcol otelcontribcol command label Jun 25, 2024

StefanKurek reviewed Jun 25, 2024

View reviewed changes

receiver/vcenterreceiver/metadata.yaml Outdated Show resolved Hide resolved

StefanKurek approved these changes Jun 26, 2024

View reviewed changes

BominRahmani marked this pull request as ready for review June 26, 2024 18:17

BominRahmani requested a review from a team June 26, 2024 18:17

github-actions bot assigned codeboten Jun 26, 2024

StefanKurek suggested changes Jun 27, 2024

View reviewed changes

StefanKurek reviewed Jun 27, 2024

View reviewed changes

BominRahmani force-pushed the feat/add-vcenter-host-metrics branch 3 times, most recently from a06edcb to fdd24e7 Compare June 30, 2024 23:11

djaglowski approved these changes Jul 1, 2024

View reviewed changes

BominRahmani force-pushed the feat/add-vcenter-host-metrics branch from fdd24e7 to e44f552 Compare July 1, 2024 14:52

StefanKurek approved these changes Jul 1, 2024

View reviewed changes

BominRahmani added 9 commits July 1, 2024 11:46

Added Host metrics

e00bafb

Fixed lint

6581676

added temp fix for datacenter, updated govmomi, updated mock tests

9e21861

Added correct metrics to host-performance-counters, updated attribute…

627283e

…, updated docs

indent issue

9956e39

fixed perf manager xml, re-generated golden test files

d0c449a

added changelog + documentation grammer fix

cfec17d

Add new capacity metric, clarify old capacity metric

ef75e5b

updated description

ba5f4e1

BominRahmani added 3 commits July 1, 2024 11:52

make crosslink ran to fix failing CLI

eb6cfc4

make gotidy

e38dc7a

rebased

b51853d

BominRahmani force-pushed the feat/add-vcenter-host-metrics branch from 800febe to b51853d Compare July 1, 2024 15:56

djaglowski merged commit 2604193 into open-telemetry:main Jul 1, 2024
154 checks passed

github-actions bot added this to the next release milestone Jul 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[receiver/vcenter] Add vCenter Host metrics (dropped packet rate + capacity) #33646

[receiver/vcenter] Add vCenter Host metrics (dropped packet rate + capacity) #33646

BominRahmani commented Jun 19, 2024 •

edited

Loading

StefanKurek left a comment

StefanKurek Jun 25, 2024

StefanKurek Jun 25, 2024

BominRahmani Jun 26, 2024

BominRahmani Jun 27, 2024

StefanKurek left a comment

StefanKurek Jun 27, 2024

StefanKurek Jun 27, 2024

BominRahmani Jun 27, 2024

djaglowski left a comment

[receiver/vcenter] Add vCenter Host metrics (dropped packet rate + capacity) #33646

[receiver/vcenter] Add vCenter Host metrics (dropped packet rate + capacity) #33646

Conversation

BominRahmani commented Jun 19, 2024 • edited Loading

StefanKurek left a comment

Choose a reason for hiding this comment

StefanKurek Jun 25, 2024

Choose a reason for hiding this comment

StefanKurek Jun 25, 2024

Choose a reason for hiding this comment

BominRahmani Jun 26, 2024

Choose a reason for hiding this comment

BominRahmani Jun 27, 2024

Choose a reason for hiding this comment

StefanKurek left a comment

Choose a reason for hiding this comment

StefanKurek Jun 27, 2024

Choose a reason for hiding this comment

StefanKurek Jun 27, 2024

Choose a reason for hiding this comment

BominRahmani Jun 27, 2024

Choose a reason for hiding this comment

djaglowski left a comment

Choose a reason for hiding this comment

BominRahmani commented Jun 19, 2024 •

edited

Loading