Introduce cache index for searchable snapshots (#60522) #61595

DaveCTurner · 2020-08-26T17:19:14Z

If a searchable snapshot shard fails (e.g. its node leaves the cluster)
we want to be able to start it up again on a different node as quickly
as possible to avoid unnecessarily blocking or failing searches. It
isn't feasible to fully restore such shards in an acceptably short time.
In particular we would like to be able to deal with the can_match
phase of a search ASAP so that we can skip unnecessary waiting on shards
that may still be warming up but which are not required for the search.

This commit solves this problem by introducing a system index that holds
much of the data required to start a shard. Today(*) this means it holds
the contents of every file with size <8kB, and the first 4kB of every
other file in the shard. This system index acts as a second-level cache,
behind the first-level node-local disk cache but in front of the blob
store itself. Reading chunks from the index is slower than reading them
directly from disk, but faster than reading them from the blob store,
and is also replicated and accessible to all nodes in the cluster.

(*) the exact heuristics for what we should put into the system index
are still under investigation and may change in future.

This second-level cache is populated when we attempt to read a chunk
which is missing from both levels of cache and must therefore be read
from the blob store.

We also introduce SearchableSnapshotsBlobStoreCacheIntegTests which
verify that we do not hit the blob store more than necessary when
starting up a shard that we've seen before, whether due to a node
restart or because a snapshot was mounted multiple times.

Co-authored-by: David Turner david.turner@elastic.co

If a searchable snapshot shard fails (e.g. its node leaves the cluster) we want to be able to start it up again on a different node as quickly as possible to avoid unnecessarily blocking or failing searches. It isn't feasible to fully restore such shards in an acceptably short time. In particular we would like to be able to deal with the `can_match` phase of a search ASAP so that we can skip unnecessary waiting on shards that may still be warming up but which are not required for the search. This commit solves this problem by introducing a system index that holds much of the data required to start a shard. Today(*) this means it holds the contents of every file with size <8kB, and the first 4kB of every other file in the shard. This system index acts as a second-level cache, behind the first-level node-local disk cache but in front of the blob store itself. Reading chunks from the index is slower than reading them directly from disk, but faster than reading them from the blob store, and is also replicated and accessible to all nodes in the cluster. (*) the exact heuristics for what we should put into the system index are still under investigation and may change in future. This second-level cache is populated when we attempt to read a chunk which is missing from both levels of cache and must therefore be read from the blob store. We also introduce `SearchableSnapshotsBlobStoreCacheIntegTests` which verify that we do not hit the blob store more than necessary when starting up a shard that we've seen before, whether due to a node restart or because a snapshot was mounted multiple times. Co-authored-by: David Turner <david.turner@elastic.co>

elasticmachine · 2020-08-26T17:19:16Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

DaveCTurner added >enhancement :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs backport v7.10.0 labels Aug 26, 2020

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Aug 26, 2020

pull bot pushed a commit to kp-forks/elasticsearch that referenced this pull request Aug 27, 2020

Disable BWC tests while merging elastic#61595

c291b6d

DaveCTurner merged commit e14d9c9 into elastic:7.x Aug 27, 2020

DaveCTurner deleted the 2020-08-26-backport-60522 branch August 27, 2020 05:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce cache index for searchable snapshots (#60522) #61595

Introduce cache index for searchable snapshots (#60522) #61595

Uh oh!

DaveCTurner commented Aug 26, 2020

Uh oh!

elasticmachine commented Aug 26, 2020

Uh oh!

Uh oh!

Introduce cache index for searchable snapshots (#60522) #61595

Introduce cache index for searchable snapshots (#60522) #61595

Uh oh!

Conversation

DaveCTurner commented Aug 26, 2020

Uh oh!

elasticmachine commented Aug 26, 2020

Uh oh!

Uh oh!