This script is designed to convert particular PDF content into Excel sheet.
Process :
- The program will decode the "full_pay.pdf" present in assests folder, into readable format.
- This will scrap the required text from the pdf into excel based on different columns.
- Then save the excel file as "full_pay.xlsx" in assests folder.
Note :
- Paste the pdf to convert into excel into assests folder. And make sure the name of the pdf is set to "full_pay.pdf".
- The converted Excel will be created and will be available in the folder assests with name "full_pay.xlsx"
- The program will take approximately 20-30 seconds for a page in pdf to fetch details, depending on the processing speed of your system.
Technical info - Modules used :
- pdfminer.six
- numpy
- openpyxl
- io
- sys