使用python爬虫编写百度文库自动下载脚本,可以下载 'txt/doc/ppt'文件。以最大程度还原doc文件原格式,精准定位图片在文字中的位置。
Use python crawler to write Baidu library automatic download script, you can download 'txt / doc / ppt' file. Restore the original format of the doc file to the greatest extent, and accurately locate the position of the picture in the text.
需要python3环境才可运行, 有时间会打包成exe文件再发布。
It needs a python3 environment to run, and will be packaged into an exe file before it is released.
本脚本仅供学习交流,不人不承担任何法律责任。
This script is only for learning and communication, no one does not assume any legal responsibility.