Skip to content

Conversation

@vibrantvarun
Copy link
Member

@vibrantvarun vibrantvarun commented Sep 22, 2025

Description

Merge Introduction to HybridQueryCollectorContextSpec into main

Hybrid query with 3 subqueries (term, range and date)
Dataset : noaa_semantic_search
image

image (1) image (2)
  1. We have observed major perf improvements about 20% in hybrid search with lexical search only.
  2. Aggregations show improvements from 3-5% range.
  3. Sorting show improvements from 10-15% range.

Hybrid query with match and neural query: neural search workload
image (3)

During multiple iterations, we have observed that perf difference between 3.2 and 3.2-perf lies in -5% to + 5% range.

Related Issues

1428

Check List

  • New functionality includes testing.
  • New functionality has been documented.
  • API changes companion pull request created.
  • Commits are signed per the DCO using --signoff.
  • Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

vibrantvarun and others added 2 commits September 22, 2025 10:31
…rch performance (opensearch-project#1534)

* QueryCollectorContextSpec implemenation for hybrid query

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* HybridQueryCollectContextSpec implemenation

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* Optimizations in Hybrid Query

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* Reformat files and add javadoc

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* Add changelog

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* Update Changelog

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* Add HybridSearchCollectorResultUtilTests

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* Add unit test of HybridQueryCollectorContextSpec and HybridQueryCollectorContextSpecFactory

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* Add isHybridQuery check in postProcess of HybridAggregationProcessor

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* Fix unit tests

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

* Modify comments to use release branch code

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>

---------

Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>
Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>
Signed-off-by: vibrantvarun <jainvarun4996@gmail.com>
@vibrantvarun vibrantvarun added the v3.3.0 Issues targeting release v3.3.0 label Sep 23, 2025
@vibrantvarun vibrantvarun changed the title Merge feature/querycollectocontextspec_in_hybrid_query to main Introducing HybridQueryCollectorContextSpec to improve the hybrid search performance Sep 23, 2025
@heemin32 heemin32 merged commit bdefd05 into opensearch-project:main Sep 23, 2025
65 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

v3.3.0 Issues targeting release v3.3.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants