[Merged by Bors] - Configurable monitoring endpoint frequency #3530

michaelsproul · 2022-08-31T07:00:23Z

Issue Addressed

Proposed Changes

Change default monitoring endpoint frequency to 120 seconds to fit with 30k requests/month limit.
Allow configuration of the monitoring endpoint frequency using --monitoring-endpoint-frequency N where N is a value in seconds.

winksaville · 2022-08-31T15:28:40Z

Please add some documentation, I searched the "Lighthouse Book" and all I could find was this and that gives no details as to what metric information nor the frequency.

When I say "what metric information" the link would be real helpful in that documentation. But some additional verbiage on "exactly" what subset lighthouse is providing when is needed. Also, it would be nice to clarify when to use '--validator-monitor-autoand--monitoring-endpoint`.

For example, when using --monitoring-endpoint on vn it sends its metrics and "system" metrics. And on the bn it send its metrics and "system" metrics. Thus the monitoring endpoint has redundant "system" metrics and twice as many requests, that seems wasteful. I wonder if adding validator-monitoring-auto would reduce the requests and eliminate the redundant "system" metrics?

Hopefully, this makes sense :)

pawanjay176

LGTM!

beacon_node/src/cli.rs

pawanjay176 · 2022-08-31T20:55:40Z

@winksaville The monitoring-endpoint is an externally provided service. Lighthouse just confirms to the specifications of the data format described in the link you mentioned. When adding this feature, the assumption was that the monitoring service (e.g. beaconcha.in) would be providing the documentation on how to run this service on the different client implementations https://kb.beaconcha.in/beaconcha.in-explorer/mobile-app-less-than-greater-than-beacon-node

The validator monitor is a Lighthouse specific feature that provides info about specified validators. The documentation we have here https://lighthouse-book.sigmaprime.io/validator-monitoring.html is for that feature.

Happy to add some docs linking to the beaconcha.in service and clarifying that it is different from the lighthouse validator monitor if that would reduce confusion :)

Thus the monitoring endpoint has redundant "system" metrics and twice as many requests, that seems wasteful.

This was done intentionally as we also have setups where the bn and vc are on separate machines. We could add a flag to the vc to prevent redundant sending of system metrics.

beacon_node/src/cli.rs

michaelsproul · 2022-09-01T00:36:35Z

Added some docs that cover all the things I think you wanted @winksaville 🙏 https://github.com/sigp/lighthouse/blob/d3788e2937da0173039c0c065ea46101be62adb2/book/src/advanced_metrics.md#remote-monitoring

winksaville · 2022-09-01T00:53:06Z

validator_client/src/cli.rs

+                .long("monitoring-endpoint-period")
+                .value_name("SECONDS")
+                .help("Defines how many seconds to wait between each message sent to \
+                       the monitoring-endpoint. Default: 60s")


How about something like:

.help(format("Defines how many seconds to wait between each message sent to the monitoring-endpoint. Default: {}", monitoring_api::DEFAULT_UPDATE_DURATION))

We can't easily put a dynamic value here. The problem is that clap requires the arguments to live as long as the entire App struct, and a format!() string only lives as long the scope in which it's defined. We could plumb the dynamic values around (unwieldy) or do what I was doing before where we leak the string. Leaking the string is quite a nasty hack with a small runtime downside, so I'd prefer to just hardcode this and risk it going out of date. Sometimes worse is better.

I'm quite disappointed in clap for this and other reasons, so I'd support a PR in future to move us away from it completely. We're likely to overhaul all our CLI parsing when adding config file support (#3079), and can decide whether to keep clap and leak strings, or migrate to something better.

Add comment to update this, string. maybe something like:

the monitoring-endpoint. Default: 60s") // Update when DEFAULT_UPDATE_DURATION changes

book/src/advanced_metrics.md

winksaville

LGTM one minor tweak if you like and thanks for adding the note to validator-monitoring.md! And a big thanks for doing this PR.

winksaville · 2022-09-01T01:47:08Z

common/monitoring_api/src/lib.rs

@@ -16,7 +16,7 @@ use types::*;
 pub use types::ProcessType;

 /// Duration after which we collect and send metrics to remote endpoint.
-pub const UPDATE_DURATION: u64 = 60;
+pub const DEFAULT_UPDATE_DURATION: u64 = 60;


Add comment something like:

pub const DEFAULT_UPDATE_DURATION: u64 = 60; // Search for this and update hard-coded strings.

winksaville · 2022-09-01T01:47:55Z

validator_client/src/cli.rs

+                .long("monitoring-endpoint-period")
+                .value_name("SECONDS")
+                .help("Defines how many seconds to wait between each message sent to \
+                       the monitoring-endpoint. Default: 60s")


Add comment to update this, string. maybe something like:

the monitoring-endpoint. Default: 60s") // Update when DEFAULT_UPDATE_DURATION changes

michaelsproul · 2022-09-05T08:13:37Z

bors r+

## Issue Addressed Closes #3514 ## Proposed Changes - Change default monitoring endpoint frequency to 120 seconds to fit with 30k requests/month limit. - Allow configuration of the monitoring endpoint frequency using `--monitoring-endpoint-frequency N` where `N` is a value in seconds.

bors · 2022-09-05T11:28:45Z

Pull request successfully merged into unstable.

Build succeeded:

## Issue Addressed Closes sigp#3514 ## Proposed Changes - Change default monitoring endpoint frequency to 120 seconds to fit with 30k requests/month limit. - Allow configuration of the monitoring endpoint frequency using `--monitoring-endpoint-frequency N` where `N` is a value in seconds.

Configurable monitoring-endpoint-frequency

9070660

michaelsproul added ready-for-review The code is ready for review low-hanging-fruit Easy to resolve, get it before someone else does! v3.1.2 Release after v3.1.0 (formerly v3.1.1) labels Aug 31, 2022

michaelsproul requested a review from pawanjay176 August 31, 2022 07:00

michaelsproul mentioned this pull request Aug 31, 2022

Allow DURATION for monitoring_endpoint to be configurable #3514

Closed

pawanjay176 approved these changes Aug 31, 2022

View reviewed changes

beacon_node/src/cli.rs Outdated Show resolved Hide resolved

Set default back to 60s, fix CLI

a24fc56

winksaville reviewed Sep 1, 2022

View reviewed changes

beacon_node/src/cli.rs Outdated Show resolved Hide resolved

Change frequency -> period, add docs

d3788e2

winksaville reviewed Sep 1, 2022

View reviewed changes

book/src/advanced_metrics.md Show resolved Hide resolved

Validator monitor vs monitoring endpoint

0f50052

winksaville approved these changes Sep 1, 2022

View reviewed changes

michaelsproul added ready-for-merge This PR is ready to merge. and removed ready-for-review The code is ready for review labels Sep 5, 2022

bors bot changed the title ~~Configurable monitoring endpoint frequency~~ [Merged by Bors] - Configurable monitoring endpoint frequency Sep 5, 2022

bors bot closed this Sep 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Merged by Bors] - Configurable monitoring endpoint frequency #3530

[Merged by Bors] - Configurable monitoring endpoint frequency #3530

Uh oh!

michaelsproul commented Aug 31, 2022

Uh oh!

winksaville commented Aug 31, 2022

Uh oh!

pawanjay176 left a comment

Uh oh!

Uh oh!

pawanjay176 commented Aug 31, 2022

Uh oh!

Uh oh!

michaelsproul commented Sep 1, 2022

Uh oh!

winksaville Sep 1, 2022

Uh oh!

michaelsproul Sep 1, 2022

Uh oh!

winksaville Sep 1, 2022

Uh oh!

Uh oh!

winksaville left a comment

Uh oh!

winksaville Sep 1, 2022

Uh oh!

winksaville Sep 1, 2022

Uh oh!

michaelsproul commented Sep 5, 2022

Uh oh!

bors bot commented Sep 5, 2022

Uh oh!

Uh oh!

[Merged by Bors] - Configurable monitoring endpoint frequency #3530

[Merged by Bors] - Configurable monitoring endpoint frequency #3530

Uh oh!

Conversation

michaelsproul commented Aug 31, 2022

Issue Addressed

Proposed Changes

Uh oh!

winksaville commented Aug 31, 2022

Uh oh!

pawanjay176 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pawanjay176 commented Aug 31, 2022

Uh oh!

Uh oh!

michaelsproul commented Sep 1, 2022

Uh oh!

winksaville Sep 1, 2022

Choose a reason for hiding this comment

Uh oh!

michaelsproul Sep 1, 2022

Choose a reason for hiding this comment

Uh oh!

winksaville Sep 1, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

winksaville left a comment

Choose a reason for hiding this comment

Uh oh!

winksaville Sep 1, 2022

Choose a reason for hiding this comment

Uh oh!

winksaville Sep 1, 2022

Choose a reason for hiding this comment

Uh oh!

michaelsproul commented Sep 5, 2022

Uh oh!

bors bot commented Sep 5, 2022

Uh oh!

Uh oh!