Skip to content

Multi-HNSW graphs per segment? #14341

Open
@benwtrent

Description

@benwtrent

Description

What do we think about clustering or grouping documents by centroids, or potentially in chunks of filters and allow multiple graphs per segment. If segments are random sub-samples, it is conceivable that clusters are of common distributions, consequently we can quit searching clusters early and only search a couple of the clusters at a time.

Seeing the optimistic query changes, makes me think we can maybe cluster, have one graph per cluster (or chunk of the ordered docs) and more eagerly enable early termination of the multi-graph searches.

This is a crazy idea, maybe it ain't worth doing.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions