A capture tool used to capture pictures from web like Sina, LOFTER and huaban.
BeautifulSoup
-bs4
requests
scrapy
pillow
- install
python3
and libs - update your Parameters of each kind of web
- run
main.py
or***_spider.py
to capture images fromsina
orlofter
- run
huaban/run.py
to capture images fromhuaban
USERNAME
: username of huaban which you want to captureROOT_DIR
: directories where to store the images
uid
: user-id(10 numbers) of sina weibo that you want to capturecookies
: your cookies after login the sina weibopath
: directory to save the pictures
username
: username of lofter that you want to capturepath
: directory to save the pictures, see the function_get_path
inlofter_spider.py
query_number
: number of blogs in each query packet, default value is 40