Improve stability of Block Handlers #738

DZakh · 2025-09-12T15:48:55Z

Summary by CodeRabbit

New Features
- Configurable per-chain target buffer size with auto-derived defaults and new buffering metrics.
- Improved multichain support: unordered mode now detects pending items across chains.
- Exposed isFetchingAtHead on chain fetchers for clearer head-tracking.
- Added Array.at utility returning an optional element.
Breaking Changes
- Removed db_write_timestamp from generated schemas and indexes; DDL and tests updated.
- Fetch/state APIs switched to buffer-driven fields; several call-site parameters were removed or reshaped.

coderabbitai · 2025-09-12T15:49:02Z

Walkthrough

Adds a buffer-driven fetch model (targetBufferSize, bufferBlockNumber/bufferBlock helpers, latestFullyFetchedBlock/latestOnBlockBlockNumber) across FetchState, ChainFetcher, ChainManager, SourceManager and GlobalState; introduces hasUnorderedNextItem in Batch.res; removes db_write_timestamp from templates/schemas and tests; adds Array.at in Utils; updates tests for buffering and onBlock behavior.

Changes

Cohort / File(s)	Summary
FetchState & buffering propagation `codegenerator/cli/npm/envio/src/FetchState.res`, `codegenerator/cli/npm/envio/src/sources/SourceManager.res`, `codegenerator/cli/npm/envio/src/sources/SourceManager.resi`, `codegenerator/cli/templates/static/codegen/src/eventFetching/ChainFetcher.res`, `codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res`, `codegenerator/cli/templates/static/codegen/src/globalState/GlobalState.res`, `codegenerator/cli/templates/static/codegen/src/Index.res`	Introduces `targetBufferSize`, `latestFullyFetchedBlock`, `latestOnBlockBlockNumber`, `bufferBlockNumber`/`bufferBlock` helpers and comparator logic; replaces prior isFetchingAtHead paths with buffer-based semantics; updates public types and signatures (make, copy, updateInternal, getNextQuery, handleQueryResult, registerDynamicContracts), threads targetBufferSize into fetcher construction, and updates Prometheus metrics.
Unordered batch helper `codegenerator/cli/npm/envio/src/Batch.res`	Adds `hasUnorderedNextItem(fetchStates: ChainMap.t<FetchState.t>) => bool` to detect unordered next items across chains.
DB timestamp removal & templates `codegenerator/cli/npm/envio/src/db/InternalTable.res`, `codegenerator/cli/npm/envio/src/db/EntityHistory.res`, `codegenerator/cli/templates/dynamic/codegen/src/db/Entities.res.hbs`, `scenarios/test_codegen/schema.graphql`, `scenarios/test_codegen/test/SerDe_Test.res`, `scenarios/test_codegen/test/lib_tests/PgStorage_test.res`, `scenarios/test_codegen/test/entity-column-types-test.ts`	Removes `db_write_timestamp` column from raw_events and generated entity templates; EntityHistory.fromTable no longer special-cases/excludes that field; updates schema and tests/expected DDL and test assertions accordingly.
Utils addition `codegenerator/cli/npm/envio/src/Utils.res`	Adds external binding `Array.at: (array<'a>, int) => option<'a>` returning element or None.
Tests: buffering & onBlock `scenarios/test_codegen/test/ChainManager_test.res`, `scenarios/test_codegen/test/lib_tests/FetchState_test.res`, `scenarios/test_codegen/test/lib_tests/FetchState_onBlock_test.res`, `scenarios/test_codegen/test/lib_tests/SourceManager_test.res`, `scenarios/erc20_multichain_factory/test/DynamicContractRecovery_test.res`	Updates and adds tests to exercise targetBufferSize, buffer-driven queue generation for onBlock configs, removal of currentBlockHeight params, and ChainFetcher/FetchState public-field changes; adds FetchState onBlock unit tests and adjusts many test DDL expectations to drop db_write_timestamp.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor ChainManager
  participant ChainFetcher
  participant SourceManager
  participant FetchState
  participant Executor as executeQuery

  ChainManager->>ChainFetcher: requestNext() (uses targetBufferSize)
  ChainFetcher->>FetchState: getNextQuery(fetchState{targetBufferSize,...})
  FetchState-->>ChainFetcher: Query | None (based on bufferBlockNumber)
  alt Query available
    ChainFetcher->>SourceManager: fetchNext(~fetchState,...)
    SourceManager->>Executor: run(query)
    Executor-->>SourceManager: result(newItems, latestFetchedBlock)
    SourceManager->>FetchState: handleQueryResult(~query, ~latestFetchedBlock, ~newItems)
    FetchState-->>SourceManager: updated state (mutItems buffered/sorted)
    note right of FetchState: buffered items bounded by targetBufferSize
  else No query
    ChainFetcher-->>ChainManager: idle / await new block
  end
  note right of ChainManager: unordered readiness uses hasUnorderedNextItem(fetchStates)

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

On block interval #734 — Overlaps on Batch/queue ordering and onBlock queue handling; touches related buffering/ordering code.
Optimise Indexing Buffer #546 — Modifies FetchState internals and metrics around buffering/indexing, aligning with these changes.
Sort items received from data source #694 — Related FetchState queue handling changes (compare/sort and handleQueryResult semantics).

Suggested reviewers

JonoPrest
moose-code

Poem

Thump-thump I buffer, hop and sort,
I chase new blocks and hold my court.
No timestamp column in my trail,
I queue, I fetch — I never fail.
A rabbit coder, nimble and bright 🐇

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The title "Improve stability of Block Handlers" succinctly and accurately reflects the primary intent of the changeset, which centers on buffering and reworking fetch-state/onBlock logic to make block processing more stable (e.g., targetBufferSize, bufferBlockNumber, and related fetcher changes). It is concise, focused on the main goal, and avoids noisy implementation details while remaining meaningful to reviewers scanning history. The phrasing is neither misleading nor overly generic.
Docstring Coverage	✅ Passed	No functions found in the changes. Docstring coverage check skipped.

✨ Finishing touches

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch dz/safe-on-block

Tip

👮 Agentic pre-merge checks are now available in preview!

Pro plan users can now enable pre-merge checks in their settings to enforce checklists before merging PRs.

Built-in checks – Quickly apply ready-made checks to enforce title conventions, require pull request descriptions that follow templates, validate linked issues for compliance, and more.
Custom agentic checks – Define your own rules using CodeRabbit’s advanced agentic capabilities to enforce organization-specific policies and workflows. For example, you can instruct CodeRabbit’s agent to verify that API documentation is updated whenever API schema files are modified in a PR. Note: Upto 5 custom checks are currently allowed during the preview period. Pricing for this feature will be announced in a few weeks.

Please see the documentation for more information.

Example:

reviews:
  pre_merge_checks:
    custom_checks:
      - name: "Undocumented Breaking Changes"
        mode: "warning"
        instructions: |
          Pass/fail criteria: All breaking changes to public APIs, CLI flags, environment variables, configuration keys, database schemas, or HTTP/GraphQL endpoints must be documented in the "Breaking Change" section of the PR description and in CHANGELOG.md. Exclude purely internal or private changes (e.g., code not exported from package entry points or explicitly marked as internal).

Please share your feedback with us on this Discord post.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (7)

codegenerator/cli/templates/static/codegen/src/Index.res (1)
186-192: Avoid misleading 0 for latestFetchedBlockNumber; bound by chain startBlock

If startBlock > 0 and buffer is still empty (-1), clamping to 0 under-reports. Use startBlock - 1 as the lower bound.
-      let latestFetchedBlockNumber = Pervasives.max(fetchState->FetchState.bufferBlockNumber, 0)
+      let minDisplay = Pervasives.max(cf.chainConfig.startBlock - 1, 0)
+      let latestFetchedBlockNumber =
+        Pervasives.max(fetchState->FetchState.bufferBlockNumber, minDisplay)
Please confirm TUI/console consumers expect startBlock - 1 semantics and not 0 in this field.
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainFetcher.res (1)

17-17: Public field isFetchingAtHead added

Looks good; consider documenting semantics (head of chain vs confirmed head) where it’s updated.
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res (1)
9-18: Consider extracting the magic number 3 as a named constant.

The targetBatchesInBuffer value of 3 appears to be a crucial tuning parameter for buffer management. Extracting it as a module-level constant would improve maintainability and make it easier to adjust if needed.
+let defaultTargetBatchesInBuffer = 3
+
 let calculateTargetBufferSize = activeChainsCount => {
-  let targetBatchesInBuffer = 3
   switch Env.targetBufferSize {
   | Some(size) => size
   | None =>
     Env.maxProcessBatchSize * (
-      activeChainsCount > targetBatchesInBuffer ? 1 : targetBatchesInBuffer
+      activeChainsCount > defaultTargetBatchesInBuffer ? 1 : defaultTargetBatchesInBuffer
     )
   }
 }
scenarios/test_codegen/test/lib_tests/FetchState_test.res (1)
2119-2119: Remove or comment out debug logging statement.

The Js.log(updatedFetchState) debug statement should be removed from production test code.
-    Js.log(updatedFetchState)
-
codegenerator/cli/templates/static/codegen/src/globalState/GlobalState.res (1)
157-163: Simplify the isFetchingAtHead logic using max function.

The conditional assignment can be simplified for better readability.
     {
       ...chainFetcher,
-      isFetchingAtHead: chainFetcher.isFetchingAtHead ||
-      chainFetcher.fetchState->FetchState.bufferBlockNumber >= currentBlockHeight,
+      isFetchingAtHead: chainFetcher.isFetchingAtHead ||
+        chainFetcher.fetchState->FetchState.bufferBlockNumber >= currentBlockHeight,
       currentBlockHeight,
     }
codegenerator/cli/npm/envio/src/FetchState.res (2)
218-218: Consider documenting the blockItemLogIndex constant value.

The value 16777216 (2^24) is used to ensure block items have higher priority than events at the same block. This magic number should be documented to explain its purpose and why this specific value was chosen.
-// Some big number which should be bigger than any log index
+// Use 2^24 as the base logIndex for block items to ensure they're processed
+// with higher priority than events at the same block (max logIndex in a block is typically much smaller)
 let blockItemLogIndex = 16777216
1287-1308: Improve hasFullBatch implementation clarity.

The comment about not subtracting 1 is correct but the implementation could be clearer with a direct comparison.
   let hasFullBatch = ({queue} as fetchState: t, ~maxBatchSize) => {
-    // Queue is ordered from latest to earliest, so the earliest eligible
-    // item for a full batch of size B is at index (length - B).
-    // Do NOT subtract an extra 1 here; when length === B we should still
-    // classify the queue as full and probe index 0.
-    let targetBlockIdx = queue->Array.length - maxBatchSize
-    if targetBlockIdx < 0 {
+    if queue->Array.length < maxBatchSize {
       false
     } else {
-      // Unsafe can fail when maxBatchSize is 0,
-      // but we ignore the case
+      // Check if we have enough eligible items for a full batch
+      // The item at index (length - maxBatchSize) must be eligible
+      let targetBlockIdx = queue->Array.length - maxBatchSize
       queue->Js.Array2.unsafe_get(targetBlockIdx)->Internal.getItemBlockNumber <=
         fetchState->bufferBlockNumber
     }
   }

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4ac325a and 2e429c2.

📒 Files selected for processing (20)

codegenerator/cli/npm/envio/src/Batch.res (1 hunks)
codegenerator/cli/npm/envio/src/FetchState.res (17 hunks)
codegenerator/cli/npm/envio/src/Utils.res (1 hunks)
codegenerator/cli/npm/envio/src/db/EntityHistory.res (0 hunks)
codegenerator/cli/npm/envio/src/db/InternalTable.res (0 hunks)
codegenerator/cli/npm/envio/src/sources/SourceManager.res (0 hunks)
codegenerator/cli/npm/envio/src/sources/SourceManager.resi (0 hunks)
codegenerator/cli/templates/dynamic/codegen/src/db/Entities.res.hbs (0 hunks)
codegenerator/cli/templates/static/codegen/src/Index.res (1 hunks)
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainFetcher.res (10 hunks)
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res (5 hunks)
codegenerator/cli/templates/static/codegen/src/globalState/GlobalState.res (7 hunks)
scenarios/test_codegen/schema.graphql (0 hunks)
scenarios/test_codegen/test/ChainManager_test.res (3 hunks)
scenarios/test_codegen/test/SerDe_Test.res (1 hunks)
scenarios/test_codegen/test/entity-column-types-test.ts (1 hunks)
scenarios/test_codegen/test/lib_tests/FetchState_onBlock_test.res (1 hunks)
scenarios/test_codegen/test/lib_tests/FetchState_test.res (64 hunks)
scenarios/test_codegen/test/lib_tests/PgStorage_test.res (6 hunks)
scenarios/test_codegen/test/lib_tests/SourceManager_test.res (3 hunks)

💤 Files with no reviewable changes (6)

scenarios/test_codegen/schema.graphql
codegenerator/cli/npm/envio/src/sources/SourceManager.res
codegenerator/cli/npm/envio/src/sources/SourceManager.resi
codegenerator/cli/npm/envio/src/db/EntityHistory.res
codegenerator/cli/templates/dynamic/codegen/src/db/Entities.res.hbs
codegenerator/cli/npm/envio/src/db/InternalTable.res

🧰 Additional context used

📓 Path-based instructions (4)

scenarios/test_codegen/**/*.ts

📄 CodeRabbit inference engine (scenarios/test_codegen/.cursor/rules/hyperindex.mdc)

scenarios/test_codegen/**/*.ts: After changing any TypeScript files, run: pnpm tsc --noEmit to ensure successful compilation
When updating existing entities in handlers, always use the spread operator to create updated objects before persisting
For any external call (e.g., fetch), wrap it in an Effect via experimental_createEffect and consume via context.effect
Use !context.isPreload to skip logic that should not run during preload
In TypeScript, set relationship fields using *_id properties (e.g., token_id) rather than object references
Always cast timestamps from events to BigInt (e.g., BigInt(event.block.timestamp))
When matching addresses in configuration objects within code, ensure keys are lowercase and compare using address.toLowerCase()
Use string | undefined for optional string fields instead of string | null
Always normalize token amounts to a standard decimal (e.g., 18) before addition across tokens; use helpers like normalizeAmountToUSD()

Files:

scenarios/test_codegen/test/entity-column-types-test.ts

**/*.{res,resi}

📄 CodeRabbit inference engine (.cursor/rules/rescript.mdc)

**/*.{res,resi}: Never use [| item |] to create an array. Use [ item ] instead.
Must always use = for setting value to a field. Use := only for ref values created using ref function.
ReScript has record types which require a type definition before hand. You can access record fields by dot like foo.myField.
It's also possible to define an inline object, it'll have quoted fields in this case.
Use records when working with structured data, and objects to conveniently pass payload data between functions.
Never use %raw to access object fields if you know the type.

Files:

codegenerator/cli/templates/static/codegen/src/Index.res
scenarios/test_codegen/test/SerDe_Test.res
codegenerator/cli/npm/envio/src/Batch.res
scenarios/test_codegen/test/lib_tests/FetchState_onBlock_test.res
codegenerator/cli/npm/envio/src/Utils.res
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res
scenarios/test_codegen/test/lib_tests/PgStorage_test.res
scenarios/test_codegen/test/ChainManager_test.res
scenarios/test_codegen/test/lib_tests/FetchState_test.res
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainFetcher.res
codegenerator/cli/templates/static/codegen/src/globalState/GlobalState.res
scenarios/test_codegen/test/lib_tests/SourceManager_test.res
codegenerator/cli/npm/envio/src/FetchState.res

**/*.res

📄 CodeRabbit inference engine (.cursor/rules/navigation.mdc)

Prefer reading ReScript .res modules directly; compiled .js artifacts can be ignored

Files:

codegenerator/cli/templates/static/codegen/src/Index.res
scenarios/test_codegen/test/SerDe_Test.res
codegenerator/cli/npm/envio/src/Batch.res
scenarios/test_codegen/test/lib_tests/FetchState_onBlock_test.res
codegenerator/cli/npm/envio/src/Utils.res
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res
scenarios/test_codegen/test/lib_tests/PgStorage_test.res
scenarios/test_codegen/test/ChainManager_test.res
scenarios/test_codegen/test/lib_tests/FetchState_test.res
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainFetcher.res
codegenerator/cli/templates/static/codegen/src/globalState/GlobalState.res
scenarios/test_codegen/test/lib_tests/SourceManager_test.res
codegenerator/cli/npm/envio/src/FetchState.res

codegenerator/cli/templates/{static,dynamic}/codegen/src/**/*

📄 CodeRabbit inference engine (.cursor/rules/navigation.mdc)

Edit template versions under codegenerator/cli/templates/static/codegen/src or codegenerator/cli/templates/dynamic/codegen/src instead of editing generated/src

Files:

codegenerator/cli/templates/static/codegen/src/Index.res
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainFetcher.res
codegenerator/cli/templates/static/codegen/src/globalState/GlobalState.res

🧠 Learnings (6)

📚 Learning: 2025-08-11T08:49:57.399Z

Learnt from: CR
PR: enviodev/hyperindex#0
File: scenarios/test_codegen/.cursor/rules/hyperindex.mdc:0-0
Timestamp: 2025-08-11T08:49:57.399Z
Learning: Applies to scenarios/test_codegen/**/*.ts : When updating existing entities in handlers, always use the spread operator to create updated objects before persisting

Applied to files:

scenarios/test_codegen/test/entity-column-types-test.ts

📚 Learning: 2025-08-11T08:49:57.399Z

Learnt from: CR
PR: enviodev/hyperindex#0
File: scenarios/test_codegen/.cursor/rules/hyperindex.mdc:0-0
Timestamp: 2025-08-11T08:49:57.399Z
Learning: Applies to scenarios/test_codegen/**/*.ts : In TypeScript, set relationship fields using *_id properties (e.g., token_id) rather than object references

Applied to files:

scenarios/test_codegen/test/entity-column-types-test.ts

📚 Learning: 2025-09-04T14:42:53.500Z

Learnt from: CR
PR: enviodev/hyperindex#0
File: .cursor/rules/subgraph-migration.mdc:0-0
Timestamp: 2025-09-04T14:42:53.500Z
Learning: Applies to src/**/*.ts : Match entity field types exactly with schema: use bigint for BigInt fields, BigDecimal for BigDecimal fields, and correct scalar types

Applied to files:

scenarios/test_codegen/test/entity-column-types-test.ts

📚 Learning: 2025-08-11T08:47:04.346Z

Learnt from: CR
PR: enviodev/hyperindex#0
File: scenarios/fuel_test/.cursor/rules/hyperindex.mdc:0-0
Timestamp: 2025-08-11T08:47:04.346Z
Learning: Applies to scenarios/fuel_test/**/*.ts : When updating existing entities in handlers, always use the spread operator to clone immutable returned objects before modifying and saving

Applied to files:

scenarios/test_codegen/test/entity-column-types-test.ts

📚 Learning: 2025-08-11T08:47:04.346Z

Learnt from: CR
PR: enviodev/hyperindex#0
File: scenarios/fuel_test/.cursor/rules/hyperindex.mdc:0-0
Timestamp: 2025-08-11T08:47:04.346Z
Learning: Applies to scenarios/fuel_test/**/*.ts : In TypeScript code, set relationship fields using the `_id` form expected by generated types (e.g., `{ token_id: tokenId }`, not `{ token: tokenObject }`)

Applied to files:

scenarios/test_codegen/test/entity-column-types-test.ts

📚 Learning: 2025-08-11T08:49:57.399Z

Learnt from: CR
PR: enviodev/hyperindex#0
File: scenarios/test_codegen/.cursor/rules/hyperindex.mdc:0-0
Timestamp: 2025-08-11T08:49:57.399Z
Learning: Applies to scenarios/test_codegen/**/*.ts : Always cast timestamps from events to BigInt (e.g., BigInt(event.block.timestamp))

Applied to files:

scenarios/test_codegen/test/entity-column-types-test.ts

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (5)

GitHub Check: build_and_test
GitHub Check: aarch64-apple-darwin (macos-latest)
GitHub Check: x86_64-apple-darwin (macos-latest)
GitHub Check: aarch64-unknown-linux-gnu (ubuntu-22.04)
GitHub Check: x86_64-unknown-linux-musl (ubuntu-22.04)

🔇 Additional comments (22)

scenarios/test_codegen/test/entity-column-types-test.ts (1)

161-161: Exact column-count check aligns with removal of db_write_timestamp — LGTM

Stricter equality is correct now that no extra auto-generated column is present.

TypeScript compile verification required — run pnpm -s tsc --noEmit and paste stdout/stderr (previous run produced no output).

scenarios/test_codegen/test/SerDe_Test.res (1)

129-129: Updated CREATE TABLE expectation (no db_write_timestamp) is consistent — LGTM

Matches the generator/template removals.

scenarios/test_codegen/test/lib_tests/PgStorage_test.res (6)

68-69: A table DDL expectation updated (timestamp column removed) — LGTM

78-80: B table DDL expectation updated — LGTM

88-93: Default-values DDL expectation for A updated — LGTM

150-154: Main DDL: raw_events/A/B definitions updated to drop db_write_timestamp — LGTM

The new raw_events layout placing serial immediately after params matches insert behavior.

235-236: Minimal config DDL: raw_events change reflected — LGTM

317-319: Public schema DDL: raw_events and A table updates — LGTM
codegenerator/cli/npm/envio/src/Utils.res (1)
306-308: Ensure Array.at is supported at runtime or replace with a pure ReScript fallback

Ran verification: no engines.node found in package.json files scanned; no setup-node occurrences in .github workflows; jq failed parsing a template file — no repo evidence guarantees Array.prototype.at.

File: codegenerator/cli/npm/envio/src/Utils.res (lines 306–308)

Either declare/lock a Node engine (Node ≥ 16.6) and pin CI, or replace the external with this pure ReScript fallback (preserves negative-index semantics):
-  @send external at: (array<'a>, int) => option<'a> = "at"
+  /**
+   * Option-safe index access with negative index support.
+   * Pure ReScript version to avoid relying on Array.prototype.at at runtime.
+   */
+  let at = (arr: array<'a>, index: int): option<'a> => {
+    let len = arr->Array.length
+    let idx = if index >= 0 { index } else { len + index }
+    if idx >= 0 && idx < len {
+      Some(arr->Js.Array2.unsafe_get(idx))
+    } else {
+      None
+    }
+  }
codegenerator/cli/npm/envio/src/Batch.res (1)

61-71: LGTM: concise unordered-next indicator

Clear, short-circuiting check gated by isActivelyIndexing.

scenarios/test_codegen/test/lib_tests/SourceManager_test.res (3)

254-256: Good default coverage for targetBufferSize

Defaulting to 5000 here mirrors production and keeps tests representative.

293-301: Fields aligned with new buffering API

latestOnBlockBlockNumber and onBlockConfigs [] look correct with updateInternal.

799-800: Nice boundary test for queue saturation

Using targetBufferSize=4 exercises the “skip at max queue size” path well.

scenarios/test_codegen/test/ChainManager_test.res (3)

33-35: FetchState.make: targetBufferSize plumbed into tests

Keeps test harness consistent with runtime behavior.

115-115: isFetchingAtHead added to mock fetcher

Matches the new public field and keeps tests compiling.

271-273: Partition fields updated to buffering model

latestOnBlockBlockNumber and onBlockConfigs [] are appropriate; targetBufferSize included.

Also applies to: 282-282
codegenerator/cli/templates/static/codegen/src/eventFetching/ChainFetcher.res (3)
39-39: targetBufferSize is threaded through construction

End-to-end propagation into FetchState.make and initial state is correct.

Also applies to: 181-181, 201-201

470-475: Updated FetchState.handleQueryResult signature

Removal of currentBlockHeight arg is correctly reflected.

212-230: Factory signature updated to accept targetBufferSize

No issues spotted; callers must be updated accordingly.

Run to find any stale call sites:
#!/bin/bash
rg -nP 'ChainFetcher\.makeFromConfig\([^)]*\)' -C2
rg -nP 'ChainFetcher\.make\([^)]*\)' -C2
scenarios/test_codegen/test/lib_tests/FetchState_onBlock_test.res (1)

256-257: LGTM! Clear test structure with comprehensive onBlock functionality coverage.

The test suite effectively validates the new buffering-based block generation logic, covering various edge cases including intervals, start/end block boundaries, and multiple onBlock configs. The use of logIndex values (16777216 + index) to distinguish block items from different configs is a clever approach for deterministic testing.
codegenerator/cli/npm/envio/src/FetchState.res (2)
1168-1168: Fix inconsistent comment about rollback behavior.

The FIXME comment questions whether latestFetchedBlock should be rolled back, but the implementation doesn't rollback for wildcard partitions. This should either be fixed or the comment should be updated to explain why.

Should wildcard partitions have their latestFetchedBlock rolled back when a rollback occurs? The current implementation doesn't roll it back for partitions with dependsOnAddresses: false.

255-256: Verify supported runtimes before using Utils.Array.at(-index)

Array.prototype.at with negative indices is natively supported in Chrome 92+, Edge 92+, Firefox 90+, Safari 15.4+, Node.js 16.6.0+, Deno 1.12+. If you must support older engines, replace or polyfill. Location: codegenerator/cli/npm/envio/src/FetchState.res (lines 255-256).
-      }->Utils.Array.at(-fetchState.targetBufferSize) {
+      }->{
+        let arr = _
+        let idx = arr->Array.length - fetchState.targetBufferSize
+        if idx >= 0 {
+          arr->Array.get(idx)
+        } else {
+          None
+        }
+      } {

codegenerator/cli/npm/envio/src/FetchState.res

coderabbitai · 2025-09-12T16:01:53Z

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainFetcher.res

+  ~targetBufferSize,
  ~sql=Db.sql,
 ) => {


⚠️ Potential issue

Resume filter off-by-one and cleanup condition risk

restartBlockNumber is set to progressBlockNumber + 1, but the filter comment says “Start from the same block and filter out already processed.” As written, events in the next block (progressBlockNumber + 1) with logIndex ≤ progressNextBlockLogIndex would be incorrectly dropped, and cleanup uses bufferBlockNumber (likely the max), which can remove the filter while older-block items may still be queued. Suggest aligning to the last committed block and cleaning up based on the earliest queued block.

- let restartBlockNumber = - // Can be -1 when not set - resumedChainState.progressBlockNumber >= 0 - ? resumedChainState.progressBlockNumber + 1 - : resumedChainState.startBlock + // Resume from the last committed block; filter within it by logIndex. + let resumeBlockNumber = + resumedChainState.progressBlockNumber >= 0 + ? resumedChainState.progressBlockNumber + : resumedChainState.startBlock @@ - filter: qItem => { + filter: qItem => { switch qItem { | Internal.Event({blockNumber, logIndex}) | Internal.Block({blockNumber, logIndex}) => - //Only keep events greater than the last processed event - (blockNumber, logIndex) > (restartBlockNumber, progressNextBlockLogIndex) + // Only keep events strictly after the last processed (block, logIndex) + (blockNumber, logIndex) > (resumeBlockNumber, progressNextBlockLogIndex) } }, - isValid: (~fetchState) => { - //the filter can be cleaned up as soon as the fetch state block is ahead of the latestProcessedEvent blockNumber - fetchState->FetchState.bufferBlockNumber <= restartBlockNumber - }, + isValid: (~fetchState) => { + // Keep the filter until the earliest queued block advances beyond resumeBlockNumber + switch fetchState->FetchState.getEarliestEvent { + | Item({item}) => item->Internal.getItemBlockNumber <= resumeBlockNumber + | NoItem({latestFetchedBlock: {blockNumber}}) => blockNumber <= resumeBlockNumber + } + },

Please validate expected semantics with an end-to-end resume case (e.g., progressBlockNumber=100, progressNextBlockLogIndex=4) to ensure no events from block 101 are erroneously filtered and duplicates from block 100 are not reprocessed. You can grep for how progressBlockNumber/NextBlockLogIndex are persisted/consumed:

#!/bin/bash rg -n 'progressBlockNumber|progressNextBlockLogIndex' -C3 --type=res --type=sql --type=ts

Also applies to: 251-256, 262-275

🤖 Prompt for AI Agents

In codegenerator/cli/templates/static/codegen/src/eventFetching/ChainFetcher.res around lines 245-247 (also review 251-256 and 262-275), the resume logic sets restartBlockNumber to progressBlockNumber + 1 and uses bufferBlockNumber for cleaning up filters, which can incorrectly drop events in the next block and prematurely remove filters; change restartBlockNumber to start from progressBlockNumber (not +1) and make the filter comparison only drop events with logIndex ≤ progressNextBlockLogIndex when blockNumber == progressBlockNumber, allow all events from blockNumber > progressBlockNumber, and change cleanup to base removal on the earliest queued/block-minimum (not buffer/max) so older queued items aren’t lost; after changes, validate with an end-to-end resume case (e.g., progressBlockNumber=100, progressNextBlockLogIndex=4) to confirm block 101 events are kept and duplicates from block 100 are filtered.

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res (1)

105-108: Resolved: unordered next-item logic now correctly negated.

nextItemIsNone uses !hasUnorderedNextItem as expected. Thanks for addressing.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2e429c2 and d9a5fae.

📒 Files selected for processing (1)

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res (5 hunks)

🧰 Additional context used

📓 Path-based instructions (3)

**/*.{res,resi}

📄 CodeRabbit inference engine (.cursor/rules/rescript.mdc)

**/*.{res,resi}: Never use [| item |] to create an array. Use [ item ] instead.
Must always use = for setting value to a field. Use := only for ref values created using ref function.
ReScript has record types which require a type definition before hand. You can access record fields by dot like foo.myField.
It's also possible to define an inline object, it'll have quoted fields in this case.
Use records when working with structured data, and objects to conveniently pass payload data between functions.
Never use %raw to access object fields if you know the type.

Files:

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res

**/*.res

📄 CodeRabbit inference engine (.cursor/rules/navigation.mdc)

Prefer reading ReScript .res modules directly; compiled .js artifacts can be ignored

Files:

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res

codegenerator/cli/templates/{static,dynamic}/codegen/src/**/*

📄 CodeRabbit inference engine (.cursor/rules/navigation.mdc)

Edit template versions under codegenerator/cli/templates/static/codegen/src or codegenerator/cli/templates/dynamic/codegen/src instead of editing generated/src

Files:

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build_and_test

🔇 Additional comments (4)

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res (4)

21-24: Propagate targetBufferSize through makeFromConfig — LGTM.

The calculation and forwarding into ChainFetcher.makeFromConfig look correct.

48-52: Good instrumentation for visibility — LGTM.

Prometheus gauges for max batch size, target buffer size, and reorg threshold are helpful for ops.

61-66: Forwarding targetBufferSize to ChainFetcher.makeFromDbState — LGTM.

Consistent with constructor path; keeps buffering behavior uniform on resume.

213-214: Switched to cf.isFetchingAtHead property — verify remaining call sites.

rg -nP '\bChainFetcher.isFetchingAtHead\b' returned no matches; not conclusive. Run a full repo search for "isFetchingAtHead" and function-call forms (e.g. "isFetchingAtHead(" or ".isFetchingAtHead()") and update any remaining call sites. Location: codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res lines 213–214

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

scenarios/erc20_multichain_factory/test/DynamicContractRecovery_test.res (1)
164-173: Hoist test targetBufferSize; callsites already include ~targetBufferSize.

All ChainFetcher.makeFromDbState callsites include the ~targetBufferSize label; the test still hard-codes 5000 — hoist to a local constant and assert the wiring.

Apply this minimal change at the callsite:
-      ~targetBufferSize=5000,
+      ~targetBufferSize=targetBufferSize,
Add near the top of getFetchingDcAddressesFromDbState:
+    let targetBufferSize = 5_000
Optionally assert the wiring after construction:
+    Assert.equal(
+      chainFetcher.fetchState.targetBufferSize,
+      targetBufferSize,
+      ~message="ChainFetcher should use configured targetBufferSize",
+    )

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d9a5fae and 77d3f95.

⛔ Files ignored due to path filters (1)

scenarios/erc20_multichain_factory/pnpm-lock.yaml is excluded by !**/pnpm-lock.yaml

📒 Files selected for processing (1)

scenarios/erc20_multichain_factory/test/DynamicContractRecovery_test.res (1 hunks)

🧰 Additional context used

📓 Path-based instructions (2)

**/*.{res,resi}

📄 CodeRabbit inference engine (.cursor/rules/rescript.mdc)

**/*.{res,resi}: Never use [| item |] to create an array. Use [ item ] instead.
Must always use = for setting value to a field. Use := only for ref values created using ref function.
ReScript has record types which require a type definition before hand. You can access record fields by dot like foo.myField.
It's also possible to define an inline object, it'll have quoted fields in this case.
Use records when working with structured data, and objects to conveniently pass payload data between functions.
Never use %raw to access object fields if you know the type.

Files:

scenarios/erc20_multichain_factory/test/DynamicContractRecovery_test.res

**/*.res

📄 CodeRabbit inference engine (.cursor/rules/navigation.mdc)

Prefer reading ReScript .res modules directly; compiled .js artifacts can be ignored

Files:

scenarios/erc20_multichain_factory/test/DynamicContractRecovery_test.res

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build_and_test

JonoPrest · 2025-09-15T14:14:30Z

codegenerator/cli/npm/envio/src/FetchState.res

+let compareBufferItem = (a: Internal.item, b: Internal.item) => {
+  let blockDiff = b->Internal.getItemBlockNumber - a->Internal.getItemBlockNumber
+  if blockDiff === 0 {
+    b->Internal.getItemLogIndex - a->Internal.getItemLogIndex
+  } else {
+    blockDiff
+  }
+}


This looks backwards for a cmp function.

Usually negative return would mean a comes first and positive would mean b comes first

We still store buffer in desc order, so this is correct. I actually didn't change this part + it's covered with tests.

Cool 👍🏼

JonoPrest · 2025-09-15T16:03:29Z

codegenerator/cli/npm/envio/src/db/InternalTable.res

-      mkField(
-        "db_write_timestamp",
-        TimestampWithoutTimezone,
-        ~default="CURRENT_TIMESTAMP",
-        ~fieldSchema=S.int,
-      ),


This is kind of a breaking change because this was public on hasura. I think that's why it was left in.

I don't like it there by default but it is unfortunately breaking.

Yes, but it's not the first time we are breaking undocumented behaviour. I think it should be fine, unless someone complains.

DZakh added 5 commits September 11, 2025 17:03

Remove db_write_timestamp

9bf7149

Prevent OOM for big ranges of onBlock handler

c338b77

Update tests

c8d2650

Improve rollback logic for onBlock handlers

ac600cf

Fix block handler for multichain

2e429c2

DZakh requested a review from JonoPrest September 12, 2025 15:48

coderabbitai bot reviewed Sep 12, 2025

View reviewed changes

Fix nextItemIsNone check

d9a5fae

coderabbitai bot reviewed Sep 13, 2025

View reviewed changes

codegenerator/cli/templates/static/codegen/src/eventFetching/ChainManager.res Show resolved Hide resolved

Fix tests

77d3f95

coderabbitai bot reviewed Sep 15, 2025

View reviewed changes

DZakh enabled auto-merge (squash) September 15, 2025 13:21

JonoPrest reviewed Sep 15, 2025

View reviewed changes

DZakh requested a review from JonoPrest September 16, 2025 08:14

JonoPrest approved these changes Sep 18, 2025

View reviewed changes

DZakh merged commit ed71a78 into main Sep 18, 2025
8 checks passed

DZakh deleted the dz/safe-on-block branch September 18, 2025 13:37

This was referenced Sep 19, 2025

Reproduce duplicate history on rollback bug #745

Closed

Persist only processed DCs #752

Merged

Always include all block events to a batch #753

Merged

Uh oh!

Improve stability of Block Handlers #738

Improve stability of Block Handlers #738

Uh oh!

Conversation

DZakh commented Sep 12, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

JonoPrest Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

DZakh Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

JonoPrest Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

JonoPrest Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

DZakh Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DZakh commented Sep 12, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Sep 12, 2025 •

edited

Loading