core/state: trie prefetcher change: calling trie() doesn't stop the associated subfetcher #29035

jwasinger · 2024-02-20T13:55:31Z

This pulls trie prefetcher changes from #29029 into a separate PR.

It changes the trie prefetcher s.t. a call to trie returns a trie (after any pending tasks have been resolved in the subfetcher). After a call to trie, the subfetcher is now available for more work (instead of terminating).

core/state/trie_prefetcher.go

jwasinger · 2024-02-22T06:48:01Z

Failed tests... the last commit borks something.

jwasinger · 2024-02-23T05:11:55Z

Okay. CI is running successfully locally on two separate machines (mac/linux) for me. Unsure why it fails in github actions.

core/state/trie_prefetcher.go

holiman

Could you please elaborate a bit more about what this PR tries to achieve, and why that is better.
As in, why it's needed for your upcoming work, but also what the effects on the current code is. Does it improve anything? Simplify something? Degrade something?

holiman · 2024-03-12T09:23:10Z

core/state/trie_prefetcher.go

+//     repeated.
+//  2. Finalize of the main account trie. This happens only once per block.
+func (p *triePrefetcher) prefetch(owner common.Hash, root common.Hash, addr common.Address, keys [][]byte) error {
+	if p.closed {


Is there ever a multi-thread accessing prefetch / close ? If so, might be better to make closed into an atomic.Bool? But if we don't access it that way, it's fine.

Prefetcher is manipulated in the statedb, which is not regarded as thread-safe. I believe we never do multi-thread accessing prefetch.

rjl493456442 · 2024-03-12T12:03:48Z

The biggest difference this pull request introduces is:

Originally, when a trie is requested, the prefetching tasks are suspended and resumed once the live trie is copied and returned. This approach could cause problems for witness collection. For instance:

If slot A is accessed but not modified, the relevant trie nodes should be wrapped as the witness. Subsequently, slot A will be scheduled for prefetching and queued in the task list.

However, if the corresponding storage trie is retrieved without slot A being preloaded at that time, then the relevant trie nodes won't be included in the witness.

In this pull request, the prefetching tasks scheduled before retrieving the trie will always be executed. Thus the issue is resolved.

Besides, the Copy function is dropped for simplification. And actually the original complexity is not worthwhile anyway. The statedb.Copy is mostly used in miner, but after the change made by marius, we don't copy the statedb in miner anymore. Make no sense to copy the prefetcher for the performance gain.

holiman

Looking good, but we need to run it on the benchmarking infra for a bit

…ssociated subfetcher Co-authored-by: Martin HS <martin@swende.se> Co-authored-by: Péter Szilágyi <peterke@gmail.com>

… until pending tasks on the subfetcher have been resolved.

jwasinger · 2024-03-15T02:14:19Z

Unfortunately, the added overhead to block processing and increased memory usage means that this PR is a non-starter.

holiman · 2024-03-15T08:11:38Z

I think it's maybe a bit premature to close this. Please add the relevant graphs/charts from the benchmark-run, so we can discuss and reason about it

holiman · 2024-03-15T09:43:15Z

Mar 14 04:09:55 bench05.ethdevops.io geth INFO [03-14|03:09:55.583] Starting peer-to-peer node instance=Geth/v1.14.0-unstable-3c26ffeb/linux-amd64/go1.21.8
Mar 14 04:10:12 bench06.ethdevops.io geth INFO [03-14|03:10:11.942] Starting peer-to-peer node instance=Geth/v1.14.0-unstable-287cead1/linux-amd64/go1.21.8

So apparently bench06 is running this PR.

And bench06 is markedly slower, mainly in the bucket called execution. I guess that's where the time spent waiting for the subfetchers to finish is counted.

The prefetchers fetch roughly the same amount of data (bench06 a bit more slots):

And the both have same "misses", that is, work performed that was never needed (e.g. fetching slots at the end of transaction, but the next transaction set the slots back to the previous value).

In the "optimistic" prefetcher, if we overload it with requests, it will just fetch what it can, and then we'll say "ok stop now". In the strict one, overloading it with requests means we'll have to wait it out, it will penalize use. So if we do this, maybe we have to be more clever about what to schedule.

I'm not really 100% sure what scenarios causes the wasted work, or the delivery misses. Might be worth investigating a bit

karalabe · 2024-04-12T11:12:00Z

core/state/trie_prefetcher.go

@@ -237,7 +194,7 @@ func newSubfetcher(db Database, state common.Hash, owner common.Hash, root commo
 		owner: owner,
 		root:  root,
 		addr:  addr,
-		wake:  make(chan struct{}, 1),
+		wake:  make(chan struct{}),


This change seems problematic. Whereas the original code allowed to schedule the request and then fly off continuing execution, the new code (along with L217 below) will block until a previous request is done (actually, block on the second request).

Any particular reason you made the async notifier synchronous?

Well spotted!

karalabe · 2024-04-12T11:14:20Z

There's a change in this PR that converted the async wake notifier into a sync one, which can potentially have a really heavy hit on performance as it ill block execution during preloading. Perhaps that's the root of the slowdown.

karalabe · 2024-04-12T12:08:06Z

Closing and reopening my own because OP didn't permit maintainer changes.

jwasinger · 2024-04-12T15:10:59Z

:'( I don't know why this is suddenly something I have to explicitly enable. Until recently, PRs I've made in the past have been maintainer-modifiable by default....

jwasinger requested review from karalabe, holiman and rjl493456442 as code owners February 20, 2024 13:55

rjl493456442 reviewed Feb 21, 2024

View reviewed changes

core/state/trie_prefetcher.go Outdated Show resolved Hide resolved

rjl493456442 reviewed Feb 21, 2024

View reviewed changes

core/state/trie_prefetcher.go Show resolved Hide resolved

rjl493456442 reviewed Feb 21, 2024

View reviewed changes

core/state/trie_prefetcher.go Show resolved Hide resolved

jwasinger commented Feb 22, 2024

View reviewed changes

core/state/trie_prefetcher.go Show resolved Hide resolved

rjl493456442 reviewed Feb 27, 2024

View reviewed changes

core/state/trie_prefetcher.go Outdated Show resolved Hide resolved

core/state/trie_prefetcher.go Show resolved Hide resolved

core/state/trie_prefetcher.go Show resolved Hide resolved

jwasinger force-pushed the stateless-witness-prefetcher-changes branch from 5ecb17e to 13196b8 Compare February 28, 2024 03:25

rjl493456442 approved these changes Mar 6, 2024

View reviewed changes

jwasinger added the status:triage label Mar 11, 2024

holiman reviewed Mar 12, 2024

View reviewed changes

jwasinger and others added 8 commits March 13, 2024 19:52

core/state: trie prefetcher change: calling trie() doesn't stop the a…

d6d06b6

…ssociated subfetcher Co-authored-by: Martin HS <martin@swende.se> Co-authored-by: Péter Szilágyi <peterke@gmail.com>

remove uneccessary TODO

e3e4ffe

remove unused import

46a10c1

fix CI

df372b0

change trie prefetcher so that any calls to triePrefetcher.trie block…

c7cd97f

… until pending tasks on the subfetcher have been resolved.

fix lint

dc16ec3

address feedback

8d1074f

core/state: improve prefetcher

287cead

jwasinger force-pushed the stateless-witness-prefetcher-changes branch from 453f06d to 287cead Compare March 14, 2024 02:53

jwasinger closed this Mar 15, 2024

holiman reopened this Mar 15, 2024

lightclient removed the status:triage label Mar 26, 2024

karalabe reviewed Apr 12, 2024

View reviewed changes

karalabe mentioned this pull request Apr 12, 2024

Stateless witness prefetcher changes #29519

Merged

karalabe closed this Apr 12, 2024

core/state: trie prefetcher change: calling trie() doesn't stop the associated subfetcher #29035

core/state: trie prefetcher change: calling trie() doesn't stop the associated subfetcher #29035

Uh oh!

Conversation

jwasinger commented Feb 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jwasinger commented Feb 22, 2024

Uh oh!

jwasinger commented Feb 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

holiman left a comment

Choose a reason for hiding this comment

Uh oh!

holiman Mar 12, 2024

Choose a reason for hiding this comment

Uh oh!

rjl493456442 Mar 12, 2024

Choose a reason for hiding this comment

Uh oh!

rjl493456442 commented Mar 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

holiman left a comment

Choose a reason for hiding this comment

Uh oh!

jwasinger commented Mar 15, 2024

Uh oh!

holiman commented Mar 15, 2024

Uh oh!

holiman commented Mar 15, 2024

Uh oh!

karalabe Apr 12, 2024

Choose a reason for hiding this comment

Uh oh!

holiman Apr 12, 2024

Choose a reason for hiding this comment

Uh oh!

karalabe commented Apr 12, 2024

Uh oh!

karalabe commented Apr 12, 2024

Uh oh!

jwasinger commented Apr 12, 2024

Uh oh!

Uh oh!

jwasinger commented Feb 20, 2024 •

edited

Loading

jwasinger commented Feb 23, 2024 •

edited

Loading

rjl493456442 commented Mar 12, 2024 •

edited

Loading