`fsync` more

## Description

In our quest against database corruption, I think we should start calling `fsync` in more places. The problem is that this could cause serious performance degradation if we do it too liberally.

This PR for tree-states (https://github.com/sigp/lighthouse/pull/5144) adds some fsyncs to prevent issues during backfill. I suspect that fsync will be most useful when coordinating changes across multiple databases e.g. hot & freezer, or hot & blobs. On `stable` we currently only have one synchronisation point, which is during the hot -> cold database migration:

https://github.com/sigp/lighthouse/blob/c7e5dd1098a145c0d2174dc65bd23faeb5074249/beacon_node/store/src/hot_cold_store.rs#L2487-L2498

This synchronisation was added by @adaszko, and I reckon it has probably saved us from many more instances of DB corruption.

## Steps to resolve

1. Add fsyncs during all database transactions that involve >1 database. Likely candidates for a re-work are `do_atomically_with_block_and_blobs_cache` and backfill (the tree-states PR).
2. Measure the impact of these `fsync` additions on performance. For backfill the metric is #blocks/sec speed, although this is a little hard to control when network factors also play a role (we could import Era files to remove this variability).
3. (Optional) Consider adding more `fsync`s after database operations involving a single database. The "early attester cache" in block processing should mean that it's OK for us to spend a little more time on the database write (it is _off the hot path_). 


	// Warning: Critical section. We have to take care not to put any of the two databases in an
	// inconsistent state if the OS process dies at any point during the freezing
	// procedure.
	//
	// Since it is pretty much impossible to be atomic across more than one database, we trade
	// losing track of states to delete, for consistency. In other words: We should be safe to die
	// at any point below but it may happen that some states won't be deleted from the hot database
	// and will remain there forever. Since dying in these particular few lines should be an
	// exceedingly rare event, this should be an acceptable tradeoff.

	// Flush to disk all the states that have just been migrated to the cold store.
	store.cold_db.sync()?;

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`fsync` more #5145

Description

Steps to resolve

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

fsync more #5145

Description

Description

Steps to resolve

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`fsync` more #5145