Skip to content

Decouple within-query concurrency from the index's segment geometry [LUCENE-8675] #9721

@asfimport

Description

@asfimport

Segment search is a single threaded operation today, which can be a bottleneck for large analytical queries which index a lot of data and have complex queries which touch multiple segments (imagine a composite query with range query and filters on top). This ticket is for discussing the idea of splitting a single segment into multiple threads based on mutually exclusive document ID ranges.

This will be a two phase effort, the first phase targeting queries returning all matching documents (collectors not terminating early). The second phase patch will introduce staged execution and will build on top of this patch.

PhraseHighFreqP50.png

PhraseHighFreqP90.png

TermHighFreqP50.png

TermHighFreqP90.png


Migrated from LUCENE-8675 by Atri Sharma (@atris), 1 vote, updated Aug 04 2022
Attachments: PhraseHighFreqP50.png, PhraseHighFreqP90.png, TermHighFreqP50.png, TermHighFreqP90.png

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions