Zalando Beauty New Product Scraper is a Python script for scraping product information from the Zalando website and saving it to a CSV file. It allows you to collect and monitor new product arrivals in a specific category over time.
Before using the Zalando Product Scraper, ensure you have the following prerequisites installed:
- Python 3.x
- pip (Python package manager)
- Clone the repository to your local machine:
git clone https://github.com/SilentJMA/Zalando-Beauty-New-Product-Scraper.git
- Change to the project directory:
cd Zalando-Beauty-New-Product-Scraper
- Install the required Python packages:
pip install -r requirements.txt
To run the scraper, simply execute the script using Python:
python zalando_beauty_scraper.py
The script will scrape the product information, save it to the specified CSV file, and wait for the next scheduled run.
Each import creates a new CSV file with a timestamp in the filename. Previous CSV files are retained in the specified directory. The CSV files contain data for new products found during each import.
The scraper extracts the following product information:
The script checks for new products by comparing product names with previous imports.
Only new products are saved to the CSV files.
You can monitor product arrivals over time and track changes.
The script runs on a schedule defined by the import_interval variable. It will automatically update the product data at the specified interval.
Contributions are welcome! If you have suggestions, improvements, or bug fixes, please open an issue or create a pull request.