order of Text extraction from pdf is difference when the pdf files are generated in different platform(win and linux) #3816
Unanswered
kvrameshreddy
asked this question in
Looking for help
Replies: 1 comment 8 replies
-
I believe you that the files are different - without looking at your comparison script. What I am actually interested in why you believe they should be identical. Have they been created by the same program, running once on each of the platforms? With identical input data each? |
Beta Was this translation helpful? Give feedback.
8 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi @JorjMcKie ,
i am trying to extract text from two pdf files, which are generated on different platforms (windows and linux) and compare them to identify the differences.
In this use case, even though the two files are similar i see few words come in a different order and failing while comparing.
i am attaching the pdf files and compare logic, can you help me out in solving this case
Customers95.pdf
Customers92.pdf
Customers95.pdf_vs_Customers92.pdf_text.pdf
Beta Was this translation helpful? Give feedback.
All reactions