-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Description
Overview
We will run the benchmarks from the PR using the OpenSearch CDK and OSB and compare the runs for every workload with mainline ( containing Lucene 9 ) to ensure there are no regressions in search or indexing. Lucene 10 has introduced explicit vectorization for comparing vectors and decoding postings, so we should ensure the benchmarks are run on data nodes that have CPU supporting SIM-D capabilities.
We should note down any improvements and try to correlate with changes in Lucene 10 that could have caused them.
We might encounter cases where we are using a older Lucene API which has a faster alternative available in Lucene 10. Any performance regression needs to be fixed before we can go ahead with preparing the RC for OpenSearch Version 3.0
Related component
Other
To Reproduce
Compare performance report of runs with daily runs of mainline.
Expected behavior
There should be no performance regression in search or indexing.
Any improvements should be noted explicitly.
Issues
- [BUG] Time Series Desc Optimisation gets skipped with Lucene 10 Upgrade #17386
- [BUG] Increased latency for Keyword Term operations in Big5 #17387
- [BUG] composite_terms-keyword has increased latency with Lucene 10 index #17388
- [BUG] desc_sort_timestamp is slower with OS 3.0 index compared to OS 2.19 index on OS 3.0 server #17404
- [BUG] Date Histogram aggregation and it's variations have increased latency with OS 3.0/Lucene10 upgrade #17405
- [BUG] Explore adoption for interval change in Boolean queries #17456