Skip to content

[Bug]: v0.15.1-46 ERROR Book chunk / Pdf object has no attribute page_chars #4345

Open
@dromeuf

Description

@dromeuf

Is there an existing issue for the same bug?

  • I have checked the existing issues.

RAGFlow workspace code commit ID

e24af69e96304a0767129b98af3e929cf1c06930

RAGFlow image version

v0.15.1-46-g8674156d slim

Other environment information

Linux 5.15.167.4-microsoft-standard-WSL2

Actual behavior

Hi, When I try to import this file with Book chunking I get error PDF :

My solution at this time is to convert PDF to DOCX...

Kinds regards, David.

09:19:51 Page(121~133): Done (0.45s)
09:19:51 Task has been received.
09:19:51 Page(133~145): OCR started
09:19:54 Page(133~145): [**ERROR]Internal server error while chunking: Pdf object has no attribute page_chars
09:19:54 [ERROR][Exception]: 'Pdf' object has no attribute 'page_chars'**
09:19:54 Task has been received.
09:19:54 Page(145~157): OCR started
09:20:00 Page(145~157): OCR finished (6.40s)
09:20:10 Page(145~157): Layout analysis (9.44s)

Expected behavior

No response

Steps to reproduce

idem

Additional information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    🐞 bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions