Skip to content

run_scraper.py for scraping purposes. | Projet OpenClassroom #2

Notifications You must be signed in to change notification settings

MaximeJB/BooktoScrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

P2 - Utilisez les bases de Python pour l'analyse de marché

A scraper focused on the website booktoscrape.com, it scrapes every categories of the websites, and search for a next page if it exists. The script download every images and sort them by their categories, same for the book data extracted

This repository is a clean, updated version of this one. For commit history, please refer to the original repository.

image du logo

Create virtual environment.

python -m venv venv

Activate venv (the virtual environment)

.\Scripts\activate.bat

Import the libraries

pip install -r requirements.txt

Run the project

python run_scraper.py

Ulterior research idea : Visual representation of the data

About

run_scraper.py for scraping purposes. | Projet OpenClassroom #2

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages