Skip to content

[Question]: Multilingual support between embedding knowledge base, retrieval testing, search, and assistant chat #4503

Open
@predoctech

Description

@predoctech

Describe your problem

As this project has a Chinese/English focus I tried to experiment with a bilingual test case.
So the source document is in Chinese:
Screenshot from 2025-01-16 12-31-16
Embedding is done with maidalun1020/bce-embedding-base_v1, which I understood to be a Bilingual and Crosslingual Embedding model.
I work under the assumption that it means while the source document is in Chinese, I will be able to perform retrieval testing, search, and chat in English should the semantic meaning of a chunk matches. Obviously the LLM deployed (Gemini) needs to be bilingual as well which is the case.
However that is not what I have experienced with.
Retrieval testing: Always return with "no data"
Search: No result
Screenshot from 2025-01-16 12-39-39
Chat: Knowledge base is empty
Screenshot from 2025-01-16 12-41-21
Please advise if multilingual support is available in Ragflow, or if what has attempted wasn't the correct approach for such a purpose? Thanks.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions