You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
I've been struggling with the import of multiple pdfs. I need to create a corpus, but for some reason I continue getting the same error while using pdftools as a method to extract the texts using the tm package. It works if I try to import just one pdf however.
This is what I do:
PDF error: Invalid Font Weight
PDF error: Invalid Font Weight
PDF error: Invalid Font Weight
PDF error: Invalid Font Weight
[...]
PDF error (218): Illegal character <2f> in hex string
PDF error: Couldn't find trailer dictionary
PDF error: Couldn't find trailer dictionary
PDF error: Couldn't read xref table
Error in poppler_pdf_text(loadfile(pdf), opw, upw) : PDF parsing failure.
Hi,
I've been struggling with the import of multiple pdfs. I need to create a corpus, but for some reason I continue getting the same error while using pdftools as a method to extract the texts using the tm package. It works if I try to import just one pdf however.
This is what I do:
This is what I get
My sessioninfo
This is an example of the PDFs I'm using. It's this entire batch that doesn't work, also from different sources.
12.pdf
The text was updated successfully, but these errors were encountered: