Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Japanese, Chinese and other non-ASCII searching not quite working #901

Open
matrixbot opened this issue Dec 15, 2023 · 2 comments
Open

Japanese, Chinese and other non-ASCII searching not quite working #901

matrixbot opened this issue Dec 15, 2023 · 2 comments

Comments

@matrixbot
Copy link
Collaborator

matrixbot commented Dec 15, 2023

This issue has been migrated from #901.


Hi - on a synapse homeserver running on postgresql 9.4, and connecting via vector, I'm having trouble searching Japanese.

If I enter:

バッタと鈴虫

(grasshopper and cricket)
I get a search hit when I search バッタ but, not when I search 鈴虫.

It appears that the beginning of a post is ok, but, anywhere in the middle of the post is not searchable. I tried this on my own homeserver, and on the main matrix homeserver. Same result. Kent on the main forum also reproduced.

Is there a possibility to search using regex?

@matrixbot matrixbot changed the title Dummy issue Japanese, Chinese and other non-ASCII searching not quite working Dec 21, 2023
@matrixbot matrixbot reopened this Dec 21, 2023
@code-gal
Copy link

code-gal commented Aug 4, 2024

Here is a temporary solution for Chinese:给 Matrix Synapse 添加中文搜索

@fofwisdom
Copy link

Korean is also an issue. CJK attaches an postpositions to the end of a word, so searching for a phrase is pointless. For Asians, Synapse seems useless for work or community.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants