Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Limited search capabilities in CJK languages #777

Open
quiple opened this issue Oct 18, 2024 · 2 comments
Open

Limited search capabilities in CJK languages #777

quiple opened this issue Oct 18, 2024 · 2 comments

Comments

@quiple
Copy link

quiple commented Oct 18, 2024

The current search feature only allows you to search for words separated by spaces or symbols, but Japanese and Chinese don't use spaces, and Korean don't use spaces before postpositions, making it very difficult to get the results I want.

@bnewbold
Copy link
Collaborator

Hi! we do some special processing and indexing for Japanese text specifically (we had a large early Japanese user community). have you tested in that language specifically? we could potentially do similar indexing for other languages in the future.

@quiple
Copy link
Author

quiple commented Oct 21, 2024

Hi! we do some special processing and indexing for Japanese text specifically (we had a large early Japanese user community). have you tested in that language specifically? we could potentially do similar indexing for other languages in the future.

I just checked and it does seem to separate words when searching in Japanese as you said, maybe it has a built-in dictionary?

And Chinese search is weird, probably because of the mix of Hanzi and Kanji, and Korean search is useless.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants