Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle fuzzy search #85

Closed
fulmicoton opened this issue Feb 20, 2017 · 5 comments
Closed

Handle fuzzy search #85

fulmicoton opened this issue Feb 20, 2017 · 5 comments

Comments

@fulmicoton
Copy link
Collaborator

Handle matching with a given levenshtein distance of at most k.
k should be a function of the length of the query though.

It might be nice to contribute to the fst crate with the original implementation of levenshtein automaton.

@drusellers
Copy link
Contributor

https://github.com/BurntSushi/fst/blob/master/fst-levenshtein/src/lib.rs has the fst code.

@fulmicoton would this be a collector?

@fulmicoton
Copy link
Collaborator Author

No, probably a query...

We want something that intersects the term dictionary with the levenshtein automaton like what you see in BurntSushi code.

Maybe we should just pre-compute the full docset of documents that match the fuzzy query, and then have the scorer iterate over those, I am unsure.

@fulmicoton fulmicoton added this to the 0.7.0 milestone Jun 13, 2018
@fulmicoton fulmicoton modified the milestones: 0.7.0, 0.8 Sep 16, 2018
@fulmicoton fulmicoton modified the milestones: 0.8, 0.9 Dec 26, 2018
@fulmicoton fulmicoton removed their assignment Jan 25, 2019
@rhanka
Copy link

rhanka commented Jan 23, 2021

Is this feature officially out of scope ?

@fulmicoton
Copy link
Collaborator Author

No. It is not out of scope. It will happen eventually.

If you want fuzzy search on one term, the FuzzyTermQuery does it.
The levenshtein automata stuff has been implemented, and is compatible with the fst crate.

@shikhar
Copy link
Collaborator

shikhar commented Feb 1, 2022

Dupe or at least related: #563

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants