feat: add observoor CPU utilization model #218

samcm · 2026-02-12T04:25:01Z

Summary

Adds external model observoor_cpu_utilization to ingest eBPF CPU utilization data from the observoor database
Adds transformation model fct_node_cpu_utilization with node_class enrichment for EIP-7870 filtering
Migrations 070 (source table) and 071 (transformation output) with standard _local + distributed pattern

Depends on observoor data being available on the xatu ClickHouse cluster. Frontend counterpart: ethpandaops/lab#417

Add external model for observoor.cpu_utilization and transformation model fct_node_cpu_utilization that enriches with node_class for EIP-7870 reference node filtering. Includes migration 070 and auto-generated proto/Go bindings. Also fixes proto comments for int_engine_new_payload_fastest_execution_by_node_class that referenced the wrong table name.

Fix fct_node_cpu_utilization dependency format from "observoor.cpu_utilization" to "{{external}}.observoor_cpu_utilization" to match the expected dependency pattern. Update FROM clause to use the correct dep helper syntax. Add pectra and fusaka transformation tests with assertions covering data integrity, CPU percentage bounds, and node_class enrichment logic.

The CI test runner needs this table schema to exist so it can clone it and load parquet test data for fct_node_cpu_utilization.

The test runner's CloneExternalDatabase always looked in the `default` database for external table schemas. External models with a `database` field in their frontmatter (e.g., observoor_cpu_utilization → observoor.cpu_utilization) need to be cloned from the correct source database. Changes: - Add Database field to Frontmatter, SourceDB/SourceTable to ModelMetadata - Add ExternalTableRef type to carry cross-database source info through the pipeline - Update CloneExternalDatabase to accept ExternalTableRef list with per-table source DB - Update cloneTableWithUniqueReplicaPath and modifyCreateTableForClone to handle table rename when source table name differs from model name - Remove migration 071 (incorrectly placed external table on CBT cluster)

The frontmatter `table` field for cross-database external models contains the source table name (e.g., "cpu_utilization"), not the model identifier (e.g., "observoor_cpu_utilization"). The test runner was caching the model under the wrong key, causing lookups to fail and falling back to default database when cloning tables.

The global word-boundary regex was causing double-prefixing of table names in cross-database clones (observoor_observoor_cpu_utilization_local). Replace with targeted string replacements that only modify the Distributed engine's local table reference.

Two fixes for fct_node_cpu_utilization producing 0 rows in CI: 1. Fix transformation dep key: CBT resolves external dependency entries using eConfig.Table (from frontmatter), not the model name. For cross-database models where table != model name, the dep key must match the frontmatter table name ("cpu_utilization"), not the model name ("observoor_cpu_utilization"). 2. Fix parquet data loading: Cross-database external models have their bounds scan and dependency helpers resolve to the source database (e.g., observoor.cpu_utilization). The test runner must load parquet data into the source database, not the per-test ext_XXX database, so the CBT engine finds data during bounds scanning.

…l models The {{external}} placeholder substitutes the default external database, which doesn't match cross-database models that register under their own database (e.g., observoor.cpu_utilization). Use the literal database.table format so CBT's DAG lookup resolves correctly. Also adds resolveExternalDependency to the test runner so "observoor.cpu_utilization" maps back to the canonical model name "observoor_cpu_utilization".

Better aligns with naming conventions since the data is per-process (keyed by pid + client_type) within each node.

samcm requested a review from Savid as a code owner February 12, 2026 04:25

samcm mentioned this pull request Feb 12, 2026

feat: add Node Resources tab to slot detail page ethpandaops/lab#417

Open

samcm added 9 commits February 12, 2026 16:02

Add missing external model migration for observoor_cpu_utilization

322ddb4

The CI test runner needs this table schema to exist so it can clone it and load parquet test data for fct_node_cpu_utilization.

rename: fct_node_cpu_utilization → fct_node_cpu_utilization_by_process

81cc5f4

Better aligns with naming conventions since the data is per-process (keyed by pid + client_type) within each node.

fix: update blob count assertion for Pectra (max 9 blobs per EIP-7691)

12334b8

samcm merged commit 12334b8 into master Feb 13, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add observoor CPU utilization model #218

feat: add observoor CPU utilization model #218

Uh oh!

samcm commented Feb 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: add observoor CPU utilization model #218

feat: add observoor CPU utilization model #218

Uh oh!

Conversation

samcm commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

samcm commented Feb 12, 2026 •

edited

Loading