Skip to content

Comparison Score #4

Closed
Closed
@hreikin

Description

@hreikin

Check extracted content against other extraction techniques to create a comparison score:

  • Target elements in json
  • OCR PDF pages converted to images
  • Extract text from PDF files with PyMuPDF
  • Compare JSON, OCR, PyMuPDF extracted text and give ratio/score
  • Handle zero division errors

Metadata

Metadata

Assignees

No one assigned

    Labels

    developmentImprovements or features being worked on.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions