Skip to content

[BUG] Omnisearch Fails to Extract from 15% of PDFs #7

Closed as not planned
@LloydThinks

Description

@LloydThinks

Problem description:

Omnisearch appears to fail on many PDFs. I understand that some PDFs, particularly from decades ago, will not be able to be indexed. However, I am finding that ~15% of my PDFs are not being indexed, which concerns me.

I am well aware that this is likely not a bug with Omnisearch itself, but rather a limitation of the PDF indexing library, or something similar. However, I would like to know if there is anything I can do to understand what is going on. The Developer Console logs provide no additional information.

Your environment:

  • Omnisearch version: 1.9.1
  • Obsidian version: 1.1.9
  • Operating system: macOS Ventura 13.0
  • Number of notes in your vault (approx.): 134 MD files, 481 PDFs in a Resources folder. Omnisearch says it has 689 files total before indexing.
  • Other plugins that may be related to the issue: Full plugin list: Advanced Tables, Linter, and Omnisearch

Thank you,
Lloyd

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions