feat(instrumentation-runtime-node)!: add prom-client-metrics #2136

pikalovArtemN · 2024-04-22T14:29:13Z

Which problem is this PR solving?

Make a part of request in Runtime instrumentation #1106

Short description of the changes

Add implementation for collecting event loop lag, garbage collector, heap size and heap space

…arbage collector, heap size and heap space

linux-foundation-easycla · 2024-04-22T14:29:17Z

The committers listed above are authorized under a signed CLA.

✅ login: pikalovArtemN / name: LowArt (fd8d803, 3cba596, dbb34ff, 926f052, 54858de, 046072e, 7418c78, 2cd9093, 5121a0b, 460003f, d99c458, fdfea51, 6506489, a69fc5c, 247403e, 922d677, ade0ab9, 012f370, d7c5108, ed6596f, 5efdd9d, 8bf14b5, 184be5e, 184b212, 43ba51c, 7a98bfd, 6a84254, 2d31e98, 97d8d92, 6b5fc8b, ad88f98, 310a2b2, b663012, cab7e6c)
✅ login: david-luna / name: David Luna (2189d51, feb3f36, 4941407, a6a9e81)

maryliag · 2024-04-24T15:55:43Z

plugins/node/instrumentation-runtime-node/src/types.ts

-   * @default 5000
-   */
-  eventLoopUtilizationMeasurementInterval?: number;
+  monitoringPrecision?: number;


curious about this change, can you give more context why the rename?

Because right now i use it for every metrics, not only for eventLoopUtilization. You think I needed to add parameters for every new metrics group ?

JCMais · 2024-05-01T14:42:47Z

I am super excited for this one, I think it was the main thing missing from OpenTelemetry's Node.js instrumentation! Thanks for opening this PR.

dyladan

Can you please take a look at open-telemetry/semantic-conventions#991 and give your input there? It looks like you made some different decisions than what was made there and I want to make sure we're all on the same page before merging something.

pikalovArtemN · 2024-05-02T06:10:39Z

Can you please take a look at open-telemetry/semantic-conventions#991 and give your input there? It looks like you made some different decisions than what was made there and I want to make sure we're all on the same page before merging something.

Thanks. Yes, i have some mismatches.

maryliag · 2024-05-02T12:20:41Z

@pikalovArtemN I would love to hear your opinion on the mismatches. Let me know what you think makes more sense 😄

pikalovArtemN · 2024-05-03T06:19:34Z

@pikalovArtemN I would love to hear your opinion on the mismatches. Let me know what you think makes more sense 😄

I have mismatches in names like eventloop => event_loop, and i need to fix units, i totaly forgot to set it properly XD. And fix attributes.

I decided to devide the metrics not by label, but names like it was in the prom-client library, but i think you do a better solution in the convension.

I think we need to colobarate, not all of metrics and attributes that i added exists in convention such as:

attribute nodejs.eventloop.lag.type -> stddev and mean not exists
nodejs.eventloop.utilization metric does't exist in specification
nodejs.active_handles.count -> does't exist in runtime-node, do i need to add it ?
nodejs.memory.size -> i have 2 splited metrics heap size and heap space, i think we need to split it by memory, heap size, and heap space in different metrics with attributes nodejs.memory.state: (total and used), nodejs.heapsize.state (total and used), nodejs.heapspace.state( total, used, available) and i add nodejs.memory.size in instumentation-runtime-node
nodejs.active_libuv_requests.count -> does't exist in runtime-node, do i need to add it ?

maryliag · 2024-05-03T13:05:03Z

attribute nodejs.eventloop.lag.type -> stddev and mean not exists

I can add that

nodejs.eventloop.utilization metric does't exist in specification

I can add that

nodejs.active_handles.count -> does't exist in runtime-node, do i need to add it ?

I think it would be good, I got that from the prom-client metrics list, so should be possible

nodejs.memory.size -> i have 2 splited metrics heap size and heap space, i think we need to split it by memory, heap size, and heap space in different metrics with attributes nodejs.memory.state: (total and used), nodejs.heapsize.state (total and used), nodejs.heapspace.state( total, used, available) and i add nodejs.memory.size in instumentation-runtime-node

I can make those changes on the convention

nodejs.active_libuv_requests.count -> -> does't exist in runtime-node, do i need to add it ?

I think it would be good, I also got that from prom-client, but it doesn't have libuv in the name, I added on the convention because of a feedback to make it more clear what it is

maryliag · 2024-05-03T13:44:51Z

heads up: because the semantic convention could be for all JS runtime (not just nodejs, but also others such as denojs or bunjs), it was suggested to not use nodejs on the metric name, so I replaced with jsruntime. I'm still waiting on more feedback if people think that is a good name, but that means you would also probably need to make the same update here

pikalovArtemN · 2024-05-03T13:45:57Z

heads up: because the semantic convention could be for all JS runtime (not just nodejs, but also others such as denojs or bunjs), it was suggested to not use nodejs on the metric name, so I replaced with jsruntime. I'm still waiting on more feedback if people think that is a good name, but that means you would also probably need to make the same update here

Ok

Flarna · 2024-05-03T14:00:37Z

If it is about any js runtime the node.js specific parts should be removed/renamed/...

anything coming from v8 is not automatically applicable to js engine in firefox or safari
browsers do not use libuv as far as I know (not sure about bun either)

maryliag · 2024-05-03T14:30:14Z

anything coming from v8

I don't understand what you mean here.

For things that are node specific, we can still create the metrics for it, such as the active libuv requests, I just need remove that from the semantic convention, but this PR can still keep it because can be useful.
This way we can move comments about the semantic convention to that PR, to keep things in one place

Flarna · 2024-05-03T14:37:29Z

There are JS engines (e.g. quickjs) which use a different heap structure/gc then google v8 (the JS engine used by node.js or deno).
The proposed gc/heap metrics match fine to node.js because it uses v8 but they might not fit well to other JS engines.

As a result I'm not sure if GC metrics should have v8 in the name to allow to distinguish them from other engines.
Alternative would be to create universal gc metrics for all types of runtimes using a gc. But I doubt that will be easy.

pikalovArtemN · 2024-05-03T17:35:36Z

I think right now we need to concentrate on nodejs realization only, because Node is most popular and in the next iteration adopt instrumentation to work with different engines. Right now it's so much work to do

david-luna · 2024-10-10T09:07:46Z

@pikalovArtemN

now I've noticed the package-lock.json file seems to be out of sync with too many changes although you made a small one in you package.json. That explains the installation issues.

Please get the latest version of package-lock.json file from main branch and run npm i from the root folder to add your small update. Push the changes and I'll run the CI again :)

…tion' into feat/node/prom-client-implementation

codecov · 2024-10-17T14:42:43Z

Codecov Report

Attention: Patch coverage is 87.70950% with 22 lines in your changes missing coverage. Please review.

Project coverage is 90.79%. Comparing base (6234918) to head (4941407).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...umentation-runtime-node/src/metrics/gcCollector.ts	72.00%	7 Missing ⚠️
...entation-runtime-node/src/metrics/baseCollector.ts	66.66%	4 Missing ⚠️
...untime-node/src/metrics/eventLoopDelayCollector.ts	93.47%	3 Missing ⚠️
...nstrumentation-runtime-node/src/instrumentation.ts	90.00%	2 Missing ⚠️
...runtime-node/src/metrics/eventLoopTimeCollector.ts	90.47%	2 Missing ⚠️
...-node/src/metrics/eventLoopUtilizationCollector.ts	88.23%	2 Missing ⚠️
...node/src/metrics/heapSpacesSizeAndUsedCollector.ts	94.11%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2136      +/-   ##
==========================================
- Coverage   90.86%   90.79%   -0.07%     
==========================================
  Files         161      169       +8     
  Lines        7858     8009     +151     
  Branches     1612     1632      +20     
==========================================
+ Hits         7140     7272     +132     
- Misses        718      737      +19

Files with missing lines	Coverage Δ
...trumentation-runtime-node/src/consts/attributes.ts	`100.00% <100.00%> (ø)`
...n-runtime-node/src/types/ConventionalNamePrefix.ts	`100.00% <100.00%> (ø)`
...nstrumentation-runtime-node/src/instrumentation.ts	`89.65% <90.00%> (+0.46%)`	⬆️
...runtime-node/src/metrics/eventLoopTimeCollector.ts	`90.47% <90.47%> (ø)`
...-node/src/metrics/eventLoopUtilizationCollector.ts	`88.23% <88.23%> (ø)`
...node/src/metrics/heapSpacesSizeAndUsedCollector.ts	`94.11% <94.11%> (ø)`
...untime-node/src/metrics/eventLoopDelayCollector.ts	`93.47% <93.47%> (ø)`
...entation-runtime-node/src/metrics/baseCollector.ts	`66.66% <66.66%> (ø)`
...umentation-runtime-node/src/metrics/gcCollector.ts	`72.00% <72.00%> (ø)`

maryliag · 2024-10-17T14:54:10Z

@dyladan you still have a request here, which was already addressed, can you take a look again?

david-luna · 2024-10-17T14:55:37Z

@dyladan is seems you need to approve since previously requested changes in this PR.

dyladan · 2024-10-21T13:01:43Z

plugins/node/instrumentation-runtime-node/src/consts/attributes.ts

+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+export const V8_HEAP_SIZE_NAME_ATTRIBUTE = 'heap.space.name';


I assume these are just temporary until the next semconv release?

actually, they already exist on the current release (they didn't when the PR was created), so @pikalovArtemN you can replace for the final values. For example, this one is ATTR_V8JS_HEAP_SPACE_NAME

david-luna · 2024-10-28T17:20:16Z

@pikalovArtemN this is almost done :)

could you fix the conflicts and address the last comment from @maryliag? Thank you

kristian240 · 2024-11-01T09:23:49Z

plugins/node/instrumentation-runtime-node/README.md

@@ -62,7 +62,7 @@ nodejs_performance_event_loop_utilization 0.010140079547955264

 | name | type | unit | default | description |
 |---|---|---|---|---|
-| [`eventLoopUtilizationMeasurementInterval`](./src/types.ts#L25) | `int` | millisecond | `5000` | The approximate number of milliseconds for which to calculate event loop utilization averages. A larger value will result in more accurate averages at the expense of less granular data. Should be set to below the scrape interval of your metrics collector to avoid duplicated data points. |
+| [`monitoringPrecision`](./src/types.ts#L25) | `int` | millisecond | `5000` | The approximate number of milliseconds for which to calculate event loop utilization averages. A larger value will result in more accurate averages at the expense of less granular data. Should be set to below the scrape interval of your metrics collector to avoid duplicated data points. |


suggestion(non-blocking): update the default

You changed the default to 10 so we should update the README too

Suggested change

| [`monitoringPrecision`](./src/types.ts#L25) | `int` | millisecond | `5000` | The approximate number of milliseconds for which to calculate event loop utilization averages. A larger value will result in more accurate averages at the expense of less granular data. Should be set to below the scrape interval of your metrics collector to avoid duplicated data points. |

| [`monitoringPrecision`](./src/types.ts#L25) | `int` | millisecond | `10` | The approximate number of milliseconds for which to calculate event loop utilization averages. A larger value will result in more accurate averages at the expense of less granular data. Should be set to below the scrape interval of your metrics collector to avoid duplicated data points. |

Fixed in 6b5fc8b

kristian240 · 2024-11-01T09:39:48Z

@david-luna any idea how fast this can be released after it is merged?

…tion' into feat/node/prom-client-implementation

maryliag · 2024-11-03T20:08:50Z

@kristian240 after this gets merged, we can aim for a new relase in a couple of days 😄

plugins/node/instrumentation-runtime-node/src/instrumentation.ts

trentm · 2024-11-07T00:56:50Z

plugins/node/instrumentation-runtime-node/README.md

@@ -32,7 +32,7 @@ const prometheusExporter = new PrometheusExporter({
 const sdk = new NodeSDK({
  metricReader: prometheusExporter,
  instrumentations: [new RuntimeNodeInstrumentation({
-    eventLoopUtilizationMeasurementInterval: 5000,
+    monitoringPrecision: 5000,


minor Q: It looks like the default is 10 ms (from lower down in this README). Should the example here show 10 rather than 5000?

pikalovArtemN added 3 commits April 22, 2024 17:16

feat(prom-client) add implementation for collecting event loop lag, g…

ed6596f

…arbage collector, heap size and heap space

test(prom-client) add tests for check implementation

046072e

chore(prom-client) change version and fix README.md

d7c5108

pikalovArtemN requested a review from a team April 22, 2024 14:29

Merge branch 'main' into feat/node/prom-client-implementation

ad88f98

github-actions bot added the pkg:instrumentation-runtime-node label Apr 22, 2024

pikalovArtemN mentioned this pull request Apr 22, 2024

Runtime instrumentation #1106

Open

pikalovArtemN changed the title ~~Feat/node/prom client implementation~~ feat(instrumentation-runtime-node): add prom-client-metrics Apr 22, 2024

maryliag reviewed Apr 24, 2024

View reviewed changes

pikalovArtemN requested a review from maryliag May 1, 2024 07:23

dyladan requested changes May 1, 2024

View reviewed changes

chore(instrumentation-runtime-node): fetch loop lag format in convention

2cd9093

pikalovArtemN added 6 commits May 3, 2024 22:22

chore(instrumentation-runtime-node): fetch other metrics to convention

184b212

test(instrumentation-runtime-node):fix some tests

922d677

chore(instrumentation-runtime-node): sync with conventions

3cba596

test(instrumentation-runtime-node): fix tests

54858de

test(instrumentation-runtime-node): fix tests

926f052

lint(instrumentation-runtime-node): lint

dbb34ff

Merge branch 'main' into feat/node/prom-client-implementation

2189d51

pikalovArtemN added 2 commits October 10, 2024 19:43

chore(instrumentation-runtime-node): fix package-lock.json

460003f

Merge remote-tracking branch 'origin/feat/node/prom-client-implementa…

310a2b2

…tion' into feat/node/prom-client-implementation

david-luna approved these changes Oct 17, 2024

View reviewed changes

Merge branch 'main' into feat/node/prom-client-implementation

feb3f36

dyladan approved these changes Oct 21, 2024

View reviewed changes

dyladan reviewed Oct 21, 2024

View reviewed changes

JessicaJHee mentioned this pull request Oct 22, 2024

chore: move from prom-client to OpenTelemetry metrics janus-idp/backstage-showcase#1811

Open

5 tasks

kristian240 reviewed Nov 1, 2024

View reviewed changes

pikalovArtemN added 3 commits November 3, 2024 10:01

chore(instrumentation-runtime-node): fix attributes names

6b5fc8b

Merge remote-tracking branch 'origin/feat/node/prom-client-implementa…

97d8d92

…tion' into feat/node/prom-client-implementation

chore(instrumentation-runtime-node): merge main

cab7e6c

kristian240 approved these changes Nov 3, 2024

View reviewed changes

chore(instrumentation-runtime-node): lint

b663012

david-luna reviewed Nov 4, 2024

View reviewed changes

plugins/node/instrumentation-runtime-node/src/instrumentation.ts Show resolved Hide resolved

david-luna added 2 commits November 5, 2024 20:06

chore: add skip lint directive

a6a9e81

Merge branch 'main' into feat/node/prom-client-implementation

4941407

david-luna merged commit 80d0c74 into open-telemetry:main Nov 5, 2024
25 checks passed

dyladan mentioned this pull request Nov 5, 2024

chore: release main #2507

Merged

pikalovArtemN deleted the feat/node/prom-client-implementation branch November 6, 2024 18:27

trentm reviewed Nov 7, 2024

View reviewed changes

This was referenced Nov 11, 2024

add instrumentation-runtime-node to auto-instrumentations-node #2523

Closed

feat(auto-instrumentations-node): enable runtime-node #2524

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(instrumentation-runtime-node)!: add prom-client-metrics #2136

feat(instrumentation-runtime-node)!: add prom-client-metrics #2136

pikalovArtemN commented Apr 22, 2024 •

edited

Loading

linux-foundation-easycla bot commented Apr 22, 2024 •

edited

Loading

maryliag Apr 24, 2024

pikalovArtemN Apr 25, 2024 •

edited

Loading

maryliag Jul 16, 2024

JCMais commented May 1, 2024

dyladan left a comment

pikalovArtemN commented May 2, 2024 •

edited

Loading

maryliag commented May 2, 2024

pikalovArtemN commented May 3, 2024 •

edited

Loading

maryliag commented May 3, 2024

maryliag commented May 3, 2024

pikalovArtemN commented May 3, 2024

Flarna commented May 3, 2024

maryliag commented May 3, 2024

Flarna commented May 3, 2024

pikalovArtemN commented May 3, 2024

david-luna commented Oct 10, 2024

codecov bot commented Oct 17, 2024 •

edited

Loading

maryliag commented Oct 17, 2024

david-luna commented Oct 17, 2024

dyladan Oct 21, 2024

maryliag Oct 21, 2024

pikalovArtemN Nov 3, 2024

david-luna commented Oct 28, 2024

kristian240 Nov 1, 2024

kristian240 Nov 3, 2024

kristian240 commented Nov 1, 2024

maryliag commented Nov 3, 2024

trentm Nov 7, 2024

	\| [`monitoringPrecision`](./src/types.ts#L25) \| `int` \| millisecond \| `5000` \| The approximate number of milliseconds for which to calculate event loop utilization averages. A larger value will result in more accurate averages at the expense of less granular data. Should be set to below the scrape interval of your metrics collector to avoid duplicated data points. \|
	\| [`monitoringPrecision`](./src/types.ts#L25) \| `int` \| millisecond \| `10` \| The approximate number of milliseconds for which to calculate event loop utilization averages. A larger value will result in more accurate averages at the expense of less granular data. Should be set to below the scrape interval of your metrics collector to avoid duplicated data points. \|

feat(instrumentation-runtime-node)!: add prom-client-metrics #2136

feat(instrumentation-runtime-node)!: add prom-client-metrics #2136

Conversation

pikalovArtemN commented Apr 22, 2024 • edited Loading

Which problem is this PR solving?

Short description of the changes

linux-foundation-easycla bot commented Apr 22, 2024 • edited Loading

Choose a reason for hiding this comment

pikalovArtemN Apr 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JCMais commented May 1, 2024

dyladan left a comment

Choose a reason for hiding this comment

pikalovArtemN commented May 2, 2024 • edited Loading

maryliag commented May 2, 2024

pikalovArtemN commented May 3, 2024 • edited Loading

maryliag commented May 3, 2024

maryliag commented May 3, 2024

pikalovArtemN commented May 3, 2024

Flarna commented May 3, 2024

maryliag commented May 3, 2024

Flarna commented May 3, 2024

pikalovArtemN commented May 3, 2024

david-luna commented Oct 10, 2024

codecov bot commented Oct 17, 2024 • edited Loading

Codecov Report

maryliag commented Oct 17, 2024

david-luna commented Oct 17, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

david-luna commented Oct 28, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kristian240 commented Nov 1, 2024

maryliag commented Nov 3, 2024

Choose a reason for hiding this comment

pikalovArtemN commented Apr 22, 2024 •

edited

Loading

linux-foundation-easycla bot commented Apr 22, 2024 •

edited

Loading

pikalovArtemN Apr 25, 2024 •

edited

Loading

pikalovArtemN commented May 2, 2024 •

edited

Loading

pikalovArtemN commented May 3, 2024 •

edited

Loading

codecov bot commented Oct 17, 2024 •

edited

Loading