Reduce heap usage in BestBucketsDeferringCollector #105444

iverase · 2024-02-13T10:43:00Z

BestBucketsDeferringCollector delays the creation of sub-aggregations by collecting on the heap the values of documents and buckets received in LeafBucketCollector#collect(int doc, long bucket). Those values are replayed in a later stage if necessary.

We are currently using Lucene PackedLongValues to compress those integers on heap. In #103624 we added PFOR-delta for compression of postings in Elasticsearch which is probably more space efficient that the algorithm in PackedLongValues which is essentially a FOR (without the P) so likely more wasteful.

This PR introduces PForLongValues which follows the PackedLongValues API so it can easily replace the current usage of PackedLongValues in BestBucketsDeferringCollector.

elasticsearchmachine · 2024-02-13T10:43:24Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

elasticsearchmachine · 2024-02-13T10:43:24Z

Hi @iverase, I've created a changelog YAML for you.

iverase added 2 commits February 13, 2024 11:31

Reduce heap usage in BestBucketsDeferringCollector

00104af

iter

49e8569

iverase added >enhancement :Analytics/Aggregations Aggregations v8.13.0 labels Feb 13, 2024

iverase requested a review from jpountz February 13, 2024 10:43

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Feb 13, 2024

iverase and others added 2 commits February 13, 2024 11:43

Update docs/changelog/105444.yaml

8c22ec6

fix compile error

897cfb1

elasticsearchmachine added v8.14.0 and removed v8.13.0 labels Feb 14, 2024

Merge branch 'main' into PForLongValues

b2f5d94

elasticsearchmachine added v8.15.0 and removed v8.14.0 labels Apr 17, 2024

elasticsearchmachine added v8.16.0 and removed v8.15.0 labels Jul 4, 2024

mark-vieira added v9.0.0 and removed v8.16.0 labels Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce heap usage in BestBucketsDeferringCollector #105444

Reduce heap usage in BestBucketsDeferringCollector #105444

iverase commented Feb 13, 2024

elasticsearchmachine commented Feb 13, 2024

elasticsearchmachine commented Feb 13, 2024

Reduce heap usage in BestBucketsDeferringCollector #105444

Are you sure you want to change the base?

Reduce heap usage in BestBucketsDeferringCollector #105444

Conversation

iverase commented Feb 13, 2024

elasticsearchmachine commented Feb 13, 2024

elasticsearchmachine commented Feb 13, 2024