A scraper focused on the website booktoscrape.com, it scrapes every categories of the websites, and search for a next page if it exists. The script download every images and sort them by their categories, same for the book data extracted
This repository is a clean, updated version of this one. For commit history, please refer to the original repository.
Create virtual environment.
python -m venv venv
Activate venv (the virtual environment)
.\Scripts\activate.bat
Import the libraries
pip install -r requirements.txt
Run the project
python run_scraper.py
Ulterior research idea : Visual representation of the data
