[Feature Request] Speed up percentile aggregation by switching implementation

### Is your feature request related to a problem? Please describe

The percentiles aggregation can be very slow. We rely on the t-digest library to get approximate percentiles. While poking around in the code I noticed we use their `AVLTreeDigest` implementation, but the [recommended](https://github.com/tdunning/t-digest/blob/7905f3d2ad18e7d7176811147d1316a3e23d7061/core/src/main/java/com/tdunning/math/stats/TDigest.java#L50) one is now `MergingDigest`. It looks like OpenSearch's `TDigestState` was last meaningfully modified in March 2017, but this new implementation was introduced after that in [April 2017](https://github.com/tdunning/t-digest/commit/353f1283827d8fe14b739567badc7ddcf83fe431#diff-4708ffdc9f1e95b24595629824a237898c3753da5ccad7ee86fef604af837bc7), which explains why we aren't already using it. 

The comments claim this implementation is both faster and also uses ["much less than half"](https://github.com/tdunning/t-digest/blob/7905f3d2ad18e7d7176811147d1316a3e23d7061/core/src/main/java/com/tdunning/math/stats/MergingDigest.java#L59) of the memory of `AVLTreeDigest`. I couldn't find any actual numbers for speed posted online but I did run some benchmarks with OpenSearch that look good.

### Describe the solution you'd like

We should switch to the new implementation. Since these extend the same abstract class it would be a drag-and-drop change. 

I benchmarked this change on http_logs which has 247M docs. I did it for the "@timestamp" field (high cardinality) and the "status" field (low cardinality since it's an HTTP status code). The speedup was especially large for status: 

|Field|Baseline latency (ms)|Modififed latency (ms)|
|--|--|--|
|timestamp|13,085|6,293|
|status|196,794|6,212|

### Related component

Search:Performance

### Describe alternatives you've considered

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature Request] Speed up percentile aggregation by switching implementation #18122

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Field	Baseline latency (ms)	Modififed latency (ms)
timestamp	13,085	6,293
status	196,794	6,212

Uh oh!

[Feature Request] Speed up percentile aggregation by switching implementation #18122

Description

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions