Skip to content

Telemetry: introduce backlog_* metrics, plus minor fixes #2451

Closed
@adizere

Description

@adizere

This is a follow-up to the work for #2408 and PR #2409.

Suggestions

We discussed yesterday with operators and here are some takeaways, also based on my observations so far:

  1. Even after a channel is cleared, oldest_* metrics can remain to the same value (i.e., not reset to 0).

  2. I think we should clarify what the oldest_timestamp is, it seems this field is a local timestamp to the Hermes process, not an on-chain packet timestamp (when the packet was created), which is not clear from the telemetry help message, specifically:

    # HELP oldest_timestamp The timestamp of the oldest sequence number in seconds
    # TYPE oldest_timestamp gauge
    oldest_timestamp{chain="ibc-0",channel="channel-3",counterparty="ibc-1",port="transfer"} 0

  3. Let's rename oldest_* metrics to backlog_* and additionally:

    • make it clear these metrics are per-channel
    • add a backlog_size metric to capture the number of pending packets in a channel.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions