Skip to content

[Bug]: Keywords and questions should not be left out, especially the last dozen or so chunks. #5522

Open
@lonrencn

Description

@lonrencn

Is there an existing issue for the same bug?

  • I have checked the existing issues.

RAGFlow workspace code commit ID

commit 5fdfb8d

RAGFlow image version

v0.16.0-177-g5fdfb8d4 slim

Other environment information

ubuntu 24
ollama qwen2.5:32b
bgm-m3

Actual behavior

I have been conducting tests for several days. When using academic monographs to create a knowledge base, I noticed that in the process of assigning keywords and questions to the chunks, the first 20 chunks are often omitted. That is, when I open the document and check the first and second pages of the chunks, there are no keywords or questions. The parsing settings are such that everything is enabled except for the knowledge graph.

Expected behavior

https://567.daoson.top:8443/#s/_U6PL34A ---> test document

Steps to reproduce

normal control

Additional information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    🐞 bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions