Skip to content

unitable表格解析错误 #82

@jylong91

Description

@jylong91

请提供下述完整信息以便快速定位问题
(Please provide the following information to quickly locate the problem)

  • 系统环境/System Environment
  • 使用的是哪门语言的程序/Which programing language
  • 所使用语言相关版本信息/Version
  • OnnxRuntime版本/OnnxRuntime Version
  • 可复现问题的demo/Demo of reproducible problems
  • 完整报错/Complete Error Message
  • 可能的解决方案/Possible solutions:

mineru_v132_test | Traceback (most recent call last):
mineru_v132_test | File "/home/volumes/MinerU/src/pdf2md.py", line 100, in pdf2md
mineru_v132_test | mineru_interface(pdf_path, pdf_dir, table_parsing=table_parsing)
mineru_v132_test | File "/home/volumes/MinerU/src/mineru.py", line 21, in mineru_interface
mineru_v132_test | do_parse(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/tools/common.py", line 118, in do_parse
mineru_v132_test | infer_result = ds.apply(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/data/dataset.py", line 212, in apply
mineru_v132_test | return proc(self, *args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/doc_analyze_by_custom_model.py", line 197, in doc_analyze
mineru_v132_test | analyze_result = batch_model(images)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/batch_analyze.py", line 167, in call
mineru_v132_test | self.model.table_model.predict(new_image)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/sub_modules/table/rapidtable/rapid_table.py", line 63, in predict
mineru_v132_test | table_results = self.table_model(np.asarray(image), ocr_result)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/main.py", line 106, in call
mineru_v132_test | pred_structures, cell_bboxes, _ = self.table_structure(copy.deepcopy(img))
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
mineru_v132_test | return func(*args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/table_structure/table_structure_unitable.py", line 151, in call
mineru_v132_test | bboxes[:, 0::2] *= scale_w # 缩放 x 坐标
mineru_v132_test | IndexError: too many indices for array: array is 1-dimensional, but 2 were indexed
mineru_v132_test | [2025-04-17 14:58:57,726] [ ERROR] pdf2md.py:120 - pdf file parser error: too many indices for array: array is 1-dimensional, but 2 were indexed
mineru_v132_test | Traceback (most recent call last):
mineru_v132_test | File "/home/volumes/MinerU/src/pdf2md.py", line 100, in pdf2md
mineru_v132_test | mineru_interface(pdf_path, pdf_dir, table_parsing=table_parsing)
mineru_v132_test | File "/home/volumes/MinerU/src/mineru.py", line 21, in mineru_interface
mineru_v132_test | do_parse(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/tools/common.py", line 118, in do_parse
mineru_v132_test | infer_result = ds.apply(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/data/dataset.py", line 212, in apply
mineru_v132_test | return proc(self, *args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/doc_analyze_by_custom_model.py", line 197, in doc_analyze
mineru_v132_test | analyze_result = batch_model(images)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/batch_analyze.py", line 167, in call
mineru_v132_test | self.model.table_model.predict(new_image)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/sub_modules/table/rapidtable/rapid_table.py", line 63, in predict
mineru_v132_test | table_results = self.table_model(np.asarray(image), ocr_result)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/main.py", line 106, in call
mineru_v132_test | pred_structures, cell_bboxes, _ = self.table_structure(copy.deepcopy(img))
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
mineru_v132_test | return func(*args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/table_structure/table_structure_unitable.py", line 151, in call
mineru_v132_test | bboxes[:, 0::2] *= scale_w # 缩放 x 坐标
mineru_v132_test | IndexError: too many indices for array: array is 1-dimensional, but 2 were indexed

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions