-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Disable scoring of Keyword Term search by default, allow user use old logic using useSimilarity parameter #17889
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disable scoring of Keyword Term search by default, allow user use old logic using useSimilarity parameter #17889
Conversation
|
❌ Gradle check result for 6b60c9c: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
{"run-benchmark-test": "id_1"} |
|
{"run-benchmark-test": "id_4"} |
|
The Jenkins job url is https://build.ci.opensearch.org/job/benchmark-pull-request/2828/ . Final results will be published once the job is completed. |
|
Edit: n/m, code out of sync I'm getting this error when trying to update the mapping: Without that useSimilarity it works: |
|
❌ Gradle check result for 493a1d3: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
server/src/main/java/org/opensearch/index/mapper/KeywordFieldMapper.java
Show resolved
Hide resolved
* will follow up with unit tests * making sure I'm on the right track Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
* will follow up with manually testing the new mapping parameter Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
493a1d3 to
ef419d5
Compare
Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
|
❌ Gradle check result for ef419d5: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
❌ Gradle check result for 16882dc: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
* also check all places where we do `X instanceof BoostQuery` to also check for `ConstantScoreQuery` Signed-off-by: Asim Mahmood <asim.seng@gmail.com>
|
Failure was real. In case I also where we do |
|
❌ Gradle check result for c8e4dcd: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
❕ Gradle check result for c8e4dcd: UNSTABLE Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure. |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #17889 +/- ##
============================================
+ Coverage 72.43% 72.47% +0.03%
- Complexity 66952 66976 +24
============================================
Files 5467 5470 +3
Lines 309605 309707 +102
Branches 45043 45052 +9
============================================
+ Hits 224272 224466 +194
+ Misses 66978 66904 -74
+ Partials 18355 18337 -18 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
4308f4c
into
opensearch-project:main
… logic using useSimilarity parameter (opensearch-project#17889) * Initial attempt to add constant scorer for term keyword search * will follow up with unit tests * making sure I'm on the right track Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * will follow up with manually testing the new mapping parameter Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Add changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit test Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Apply style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * since term is a basic query type, it affects many that build on top * none of these should cause a regression, infact this change should fix any possible regression these query types, which may not be copatured in big5 datasets Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix more unit tests Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix flakly test * due to the random nature of org.opensearch.index.query.SimpleQueryStringBuilderTests.testToQuery, this can sometimes fail with `but: was <ConstantScore(mapped_string_2:za)>` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting * Fix style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Change parameter to use snake_case: use_similarity Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix integ tests * also check all places where we do `X instanceof BoostQuery` to also check for `ConstantScoreQuery` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> --------- Signed-off-by: Asim Mahmood <asim.seng@gmail.com> Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting Signed-off-by: Asim M <asim.seng@gmail.com> Signed-off-by: Sriram Ganesh <srignsh22@gmail.com>
… logic using useSimilarity parameter (opensearch-project#17889) * Initial attempt to add constant scorer for term keyword search * will follow up with unit tests * making sure I'm on the right track Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * will follow up with manually testing the new mapping parameter Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Add changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit test Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Apply style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * since term is a basic query type, it affects many that build on top * none of these should cause a regression, infact this change should fix any possible regression these query types, which may not be copatured in big5 datasets Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix more unit tests Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix flakly test * due to the random nature of org.opensearch.index.query.SimpleQueryStringBuilderTests.testToQuery, this can sometimes fail with `but: was <ConstantScore(mapped_string_2:za)>` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting * Fix style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Change parameter to use snake_case: use_similarity Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix integ tests * also check all places where we do `X instanceof BoostQuery` to also check for `ConstantScoreQuery` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> --------- Signed-off-by: Asim Mahmood <asim.seng@gmail.com> Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting Signed-off-by: Asim M <asim.seng@gmail.com> Signed-off-by: Harsh Kothari <techarsh@amazon.com>
… logic using useSimilarity parameter (opensearch-project#17889) * Initial attempt to add constant scorer for term keyword search * will follow up with unit tests * making sure I'm on the right track Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * will follow up with manually testing the new mapping parameter Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Add changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit test Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix changelog entry Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Apply style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix unit tests * since term is a basic query type, it affects many that build on top * none of these should cause a regression, infact this change should fix any possible regression these query types, which may not be copatured in big5 datasets Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix more unit tests Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix flakly test * due to the random nature of org.opensearch.index.query.SimpleQueryStringBuilderTests.testToQuery, this can sometimes fail with `but: was <ConstantScore(mapped_string_2:za)>` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting * Fix style check Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Change parameter to use snake_case: use_similarity Signed-off-by: Asim Mahmood <asim.seng@gmail.com> * Fix integ tests * also check all places where we do `X instanceof BoostQuery` to also check for `ConstantScoreQuery` Signed-off-by: Asim Mahmood <asim.seng@gmail.com> --------- Signed-off-by: Asim Mahmood <asim.seng@gmail.com> Signed-off-by: Asim Mahmood <asim.seng@gmail.com> # Please enter the commit message for your changes. Lines starting Signed-off-by: Asim M <asim.seng@gmail.com> Signed-off-by: Harsh Kothari <techarsh@amazon.com>
Description
Related Issues
Resolves #17823
Check List
Testing with useSimilarity:true, took time 10ms, score 1.0
Testing with useSimilarity:true, took time 200ms, score 0.8...