-
Notifications
You must be signed in to change notification settings - Fork 132
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE] Hybrid search using keyword matching and kNN #717
Comments
@rhvaz |
@rhvaz On doing some deep-dive on the proposed solution, the way ES has implement the K-NN search and the way OpenSearch has implemented the K-NN search is very different. The ES K-NN clause is outside the main query clause and they uses search type as DFS search type to first gather the results for K-NN Query at Coordinator Node level and then pass them along for text search at shard level. But in OpenSearch, as we have implemented the K-NN clause inside the query the clause the full query object is passed to shards and filtering happens at shard level. New Proposed Solution: In the meanwhile please add some more details about your use case which will help us prioritizing the work. |
Sure @navneet1v! I would say one of the main use cases is to provide good search results even for small queries. When a consumer starts typing the name of a product or category if we simply do semantic search using kNN the results are poor. If we combine that with keyword matching we can provide good recommendations for any query length. I think this has applications in multiple e-commerce sites if people are looking to build a search product using OpenSearch. |
Hi @rhvaz thanks for the support. Have you tried using this type of query:
Example:
If yes, I want to know problem did you saw by using above query, where the scores of K-NN and text-match queries are getting combined via bool query at per shard level. Also as per your comments you can change the function score via sigmoid function. Link: opensearch-project/OpenSearch#4557 (comment) |
Hi @rhvaz I am closing this issue as the work to do the hybrid search which includes:
is being taken care via this feature request. opensearch-project/neural-search#123 . I will add more details on that issue going forward. Please do a +1 on the feature. |
Is your feature request related to a problem?
I would like to search against multiple text fields and a kNN vector. Currently I can only filter the results based on keyword matching against the text fields and then re-rank them using vector metrics such as cosine similarity or l2 norm. This means I miss out on many relevant candidates.
What solution would you like?
This solution from ES https://www.elastic.co/guide/en/elasticsearch/reference/master/knn-search.html#_combine_approximate_knn_with_other_features
What alternatives have you considered?
I could do two distinct queries to the same index and add logic to rank the final results, but this is quite messy and not good for latency.
The text was updated successfully, but these errors were encountered: