Skip to content

Admission Control: Essential Metrics Classification & Cloud Integration Updates #162508

@kevin-v-ngo

Description

@kevin-v-ngo

Context

This issue tracks updates to Admission Control (AC) metrics based on review of which metrics meet the bar for Essential status:

  • Clear monitoring / troubleshooting valuex
  • Stability
  • Production guarantees

Playbook to add metrics.

Promote to Essential + Add to Cloud Integrations for Advanced cluster

Core resource indicators

  • sys.cpu.combined.percent-normalized (already Essential, need to add to Cloud integration)
  • sys.runnable.goroutines.per.cpu (already Essential, need to add to Cloud integration)

Admission Control

  • admission.wait_durations.sql-kv-response-p99

  • admission.wait_durations.sql-sql-response-p99

  • admission.wait_durations.elastic-stores-p99

  • admission.wait_durations.elastic-cpu-p99

  • admission.granter.slots_exhausted_duration.kv

  • admission.granter.io_tokens_exhausted_duration.kv

  • admission.granter.elastic_io_tokens_exhausted_duration.kv

  • admission.elastic_cpu.nanos_exhausted_duration

  • kvflowcontrol.eval_wait.regular.duration-p99

  • kvflowcontrol.eval_wait.elastic.duration-p99

  • kvflowcontrol.send_queue.bytes

Epic: CRDB-35697

Jira issue: CRDB-59506

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions