chore(trie): fully reveal sparse tries prior to leaf updates/removals #17643

mediocregopher · 2025-07-28T14:32:02Z

Background

The Serial/ParallelSparseTrie's have their nodes revealed using the DecodedMultiProofs generated based on the previous database state and the changed leafset. Once revealed, the leaf updates/removals are applied to the sparse tries and root hashes are calculated.

There are specific edge-cases during leaf adding/removal where a node which is outside the changeset is required to complete the operation. In these cases we were falling back to a singular database lookup to fill in the gap.

By modifying the generation of the DecodedMultiProofs to include those extra required nodes we can simplify the sparse trie implementations significantly, as well as the surrounding engine code which supports these one-off lookups.

Changes

This primarily relies on a change to the ProofRetainer in alloy-trie, made here:

alloy-rs/trie#109

These changes are enabled by a flag which the payload validator enables; other places where proofs are generated remain unnaffected.

There is a further change here to the trie walker to not skip over branch nodes when they might be involved in a leaf removal. A change to support leaf addition is not required, because that case involves children of extension nodes, and we don't ever skip over extension nodes anyway.

The sparse trie code to fallback to the db for missing nodes remains in place for now, but we emit a warning when it happens. This will let us track if there's any edge-cases we're missing.

Benchmarks - 2k blocks on main

No significant change

Timestamp: 2025-07-28 10:37:12 UTC
Baseline: main
Feature:  origin/mediocregopher/17571-leaf-updates-removals

Performance Changes:
  NewPayload Latency: -1.24%
  FCU Latency:        -1.74%
  Total Latency:      -1.26%
  Gas/Second:         +1.27%
  Blocks/Second:      +1.27%

Baseline Summary:
  Blocks: 2000, Gas: 36118922440, Duration: 74.02s
  Avg NewPayload: 35.69ms, Avg FCU: 1.29ms, Avg Total: 36.99ms
  Started: 2025-07-28 10:48:20 UTC, Ended: 2025-07-28 10:56:39 UTC

Feature Summary:
  Blocks: 2000, Gas: 36118922440, Duration: 73.08s
  Avg NewPayload: 35.25ms, Avg FCU: 1.27ms, Avg Total: 36.52ms
  Started: 2025-07-28 10:58:35 UTC, Ended: 2025-07-28 11:06:55 UTC

Fixes #17571 **Background** The Serial/ParallelSparseTrie's have their nodes revealed using the DecodedMultiProofs generated based on the previous database state and the changed leafset. Once revealed, the leaf updates/removals are applied to the sparse tries and root hashes are calculated. There are specific edge-cases during leaf adding/removal where a node which is outside the changeset is required to complete the operation. In these cases we were falling back to a singular database lookup to fill in the gap. By modifying the generation of the DecodedMultiProofs to include those extra required nodes we can simplify the sparse trie implementations significantly, as well as the surrounding engine code which supports these one-off lookups. **Changes** This primarily relies on a change to the ProofRetainer in alloy-trie, made here: alloy-rs/trie#109 These changes are enabled by a flag which the payload validator enables; other places where proofs are generated remain unnaffected. There is a further change here to the trie walker to not skip over branch nodes when they might be involved in a leaf removal. A change to support leaf addition is not required, because that case involves children of extension nodes, and we don't ever skip over extension nodes anyway. The sparse trie code to fallback to the db for missing nodes remains in place for now, but we emit a warning when it happens. This will let us track if there's any edge-cases we're missing. **Benchmarks - 2k blocks on main** No significant change ``` Timestamp: 2025-07-28 10:37:12 UTC Baseline: main Feature: origin/mediocregopher/17571-leaf-updates-removals Performance Changes: NewPayload Latency: -1.24% FCU Latency: -1.74% Total Latency: -1.26% Gas/Second: +1.27% Blocks/Second: +1.27% Baseline Summary: Blocks: 2000, Gas: 36118922440, Duration: 74.02s Avg NewPayload: 35.69ms, Avg FCU: 1.29ms, Avg Total: 36.99ms Started: 2025-07-28 10:48:20 UTC, Ended: 2025-07-28 10:56:39 UTC Feature Summary: Blocks: 2000, Gas: 36118922440, Duration: 73.08s Avg NewPayload: 35.25ms, Avg FCU: 1.27ms, Avg Total: 36.52ms Started: 2025-07-28 10:58:35 UTC, Ended: 2025-07-28 11:06:55 UTC ```

…1-leaf-updates-removals

Rjected · 2025-07-30T02:30:26Z

The sparse trie code to fallback to the db for missing nodes remains in place for now, but we emit a warning when it happens. This will let us track if there's any edge-cases we're missing.

Assuming there are no warnings from the benches? Are there any changes in the metrics (nodes fetched for ex)

mediocregopher · 2025-07-30T13:50:38Z

The sparse trie code to fallback to the db for missing nodes remains in place for now, but we emit a warning when it happens. This will let us track if there's any edge-cases we're missing.

Assuming there are no warnings from the benches? Are there any changes in the metrics (nodes fetched for ex)

I swear there were no warnings when I ran this against mainnet on monday, but now I'm seeing two related to leaf removal, so there's a bit more investigation to do here 🤦

Regarding metrics, I just reran the bench (2k blocks on mainnet) and this time it came out 3% faster 🤷 there were indeed more multiproofs fetched and more db txs generally though.

Summary:
Total multiproof account nodes: +20.4118%
Total multiproof storage nodes: +19.1292%
Opened Read-Write Transactions: +13.9773%
Opened Read-Only Transactions: +12.8246%

…hildren

…1-leaf-updates-removals-debug

Rjected

the comments are very helpful, I mainly have one question about empty account edge cases, and a suggestion wrt the span ID

crates/trie/trie/src/walker.rs

Rjected · 2025-08-17T21:59:56Z

crates/trie/common/src/added_removed_keys.rs

+                if account.is_empty() {
+                    self.account.insert_removed(*hashed_address);
+                }
+                continue
+            }
+
+            let storage_removed_keys =
+                self.storages.entry(*hashed_address).or_insert_with(default_added_removed_keys);
+
+            for (key, val) in &storage.storage {
+                if *val == U256::ZERO {
+                    storage_removed_keys.insert_removed(*key);
+                } else {
+                    storage_removed_keys.remove_removed(key);
+                }
+            }
+
+            if !account.is_empty() {
+                self.account.remove_removed(hashed_address);
+            }


hmm, is it possible for these account empty cases to be hit? not suggesting they be removed, just wondering, we might want to just add docs if they can't be

I originally thought it wasn't possible, but they do actually get hit. For example if I comment out these self.account lines I get this error on block 22729256 of mainnet:

WARN Branch node child not revealed in remove_leaf, falling back to db child_path=Nibbles(0xba982e83) leaf_full_path=Nibbles(0xba982e827371ca20eb274c77c02d21b0acb7e04fed4f6b28a73948060688dc6e)

This corresponds to https://etherscan.io/address/0xecb7ca9ec4cb52536b61227176993216cf4f4154, which gets self-destructed in that tx without having ever called a tx itself (and therefore having a zero nonce).

crates/trie/parallel/src/proof_task.rs

…1-leaf-updates-removals

mattsse

I believe I was able to follow along here

all of this made sense, lgtm but also want @Rjected to take a final look at this

crates/trie/common/src/added_removed_keys.rs

mattsse · 2025-08-21T10:38:13Z

crates/trie/trie/src/proof/mod.rs

@@ -183,7 +184,7 @@ where

 /// Generates storage merkle proofs.
 #[derive(Debug)]
-pub struct StorageProof<T, H> {
+pub struct StorageProof<T, H, K = AddedRemovedKeys> {


ah I see, this way this works for both, proofretainer and addedremovedkeys

makes sense

Co-authored-by: Matthias Seitz <matthias.seitz@outlook.de>

Rjected

LGTM, nice

Rjected · 2025-08-21T18:46:51Z

needs docs fixes

In #17643 we introduced tracking of removed keys within a block in order to fully reveal sparse tries prior to updating/removing leafs. This lets us never have to block root calculation to reveal blinded nodes in the sparse tries. This was implemented by assuming that all keys are added (as opposed to being only modified), which results in over-revealing the sparse tries with all extension node children. This isn't logically incorrect, but is a bit wasteful. This change implements proper tracking of added keys alongside tracking of removed keys. When a key is added we always generate its proof in `on_state_update`, just like key removal. Because we can now enforce that added keys have their proofs generated in `on_state_update` we no longer need to optimistically retain extra proofs in prewarms.

github-project-automation bot added this to Reth Tracker Jul 28, 2025

github-project-automation bot moved this to Backlog in Reth Tracker Jul 28, 2025

mediocregopher added 2 commits July 28, 2025 16:50

Merge remote-tracking branch 'upstream/main' into mediocregopher/1757…

4a7c73f

…1-leaf-updates-removals

only_sibling -> sole_sibling for better clarity

cf12967

mediocregopher added C-perf A change motivated by improving speed, memory usage or disk footprint A-trie Related to Merkle Patricia Trie implementation labels Jul 28, 2025

mediocregopher added 18 commits July 31, 2025 12:53

Update alloy-trie, add trace span to storage proof calc

37572ea

SerialSparseTrie tracing

51f047e

Perform leaf updates prior to leaf removals

6463e81

Use AddedRemovedKeys to always regenerate proofs for removed branch c…

9040cfc

…hildren

debug

7742426

Pass AddedRemovedKeys into TrieWalker

7e53361

Fix for TrieWalker can skip check

5848cb6

debug

62d583d

Track account removals in AddedRemovedKeys

cc7ef25

Merge remote-tracking branch 'upstream/main' into mediocregopher/1757…

da65ce9

…1-leaf-updates-removals-debug

Cleanup

d33a8b6

Don't track removed leaves as non-targets

b238e17

Pass MultiAddedRemovedKeys using an Arc rather than cloning

3399765

Pass AddedRemovedKeys as references

d3e58b6

Fix extension nodes

dcdeddc

Tests and TODOs

6bfae51

Fix bug in AddedRemovedKeys::update_from_state

30fff28

alloy trie branch

009c2c9

mediocregopher mentioned this pull request Aug 6, 2025

feat: retain proofs of non-target nodes in certain edge-cases. alloy-rs/trie#109

Merged

mediocregopher added 2 commits August 6, 2025 06:17

Clippy and fmt

a34deb5

Update alloy trie dependency

0799c3e

mattsse assigned Rjected Aug 12, 2025

Rjected reviewed Aug 17, 2025

View reviewed changes

PR feedback

9110060

jenpaff unassigned Rjected Aug 18, 2025

mediocregopher added 3 commits August 19, 2025 11:29

Use merged alloy-trie

d194108

Merge remote-tracking branch 'upstream/main' into mediocregopher/1757…

9f24f5a

…1-leaf-updates-removals

Comment patch

fb518aa

mediocregopher marked this pull request as ready for review August 19, 2025 10:39

mediocregopher requested review from rkrasiuk, shekhirin, mattsse, fgimenez and gakonst as code owners August 19, 2025 10:39

ci

309efa3

mediocregopher requested a review from Rjected August 21, 2025 09:35

mattsse reviewed Aug 21, 2025

View reviewed changes

mediocregopher and others added 3 commits August 21, 2025 12:47

Update crates/trie/common/src/added_removed_keys.rs

8e12a0a

Co-authored-by: Matthias Seitz <matthias.seitz@outlook.de>

Merge branch 'main' into mediocregopher/17571-leaf-updates-removals

3e1d658

fix unused dep

8b8ad16

Rjected approved these changes Aug 21, 2025

View reviewed changes

github-project-automation bot moved this from Backlog to In Progress in Reth Tracker Aug 21, 2025

doc fix

5ce7a09

mediocregopher enabled auto-merge August 22, 2025 09:02

mediocregopher added this pull request to the merge queue Aug 22, 2025

Merged via the queue into main with commit 8193fcf Aug 22, 2025
41 checks passed

mediocregopher deleted the mediocregopher/17571-leaf-updates-removals branch August 22, 2025 09:34

github-project-automation bot moved this from In Progress to Done in Reth Tracker Aug 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(trie): fully reveal sparse tries prior to leaf updates/removals #17643

chore(trie): fully reveal sparse tries prior to leaf updates/removals #17643

Uh oh!

mediocregopher commented Jul 28, 2025

Uh oh!

Rjected commented Jul 30, 2025

Uh oh!

mediocregopher commented Jul 30, 2025

Uh oh!

Rjected left a comment

Uh oh!

Uh oh!

Rjected Aug 17, 2025

Uh oh!

mediocregopher Aug 18, 2025

Uh oh!

Uh oh!

mattsse left a comment

Uh oh!

Uh oh!

mattsse Aug 21, 2025

Uh oh!

Rjected left a comment

Uh oh!

Rjected commented Aug 21, 2025

Uh oh!

Uh oh!

Uh oh!

chore(trie): fully reveal sparse tries prior to leaf updates/removals #17643

chore(trie): fully reveal sparse tries prior to leaf updates/removals #17643

Uh oh!

Conversation

mediocregopher commented Jul 28, 2025

Uh oh!

Rjected commented Jul 30, 2025

Uh oh!

mediocregopher commented Jul 30, 2025

Uh oh!

Rjected left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Rjected Aug 17, 2025

Choose a reason for hiding this comment

Uh oh!

mediocregopher Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattsse left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattsse Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

Rjected left a comment

Choose a reason for hiding this comment

Uh oh!

Rjected commented Aug 21, 2025

Uh oh!

Uh oh!

Uh oh!