Welcome to the Pages24 Scraper. This repository houses a powerful scraper designed to extract valuable data from the Pages24 website. As a platform providing diverse information, Pages24 offers a range of listings and details. This scraper streamlines the process of collecting data from Pages24, creating a valuable dataset for various applications.
- Efficiently extracts data from the Pages24 website.
- Provides a comprehensive dataset including listing details.
- Facilitates analysis, database creation, and other creative projects using the extracted data.
Follow these steps to set up the Pages24 Scraper:
- Python 3.9
- Install required Python packages:
- BeautifulSoup4:
pip install beautifulsoup4
- Requests:
pip install requests
- BeautifulSoup4:
-
Clone this repository to your local machine using Git:
git clone https://github.com/adil6572/pages24-scraper.git cd pages24-scraper
-
Install the required Python packages:
pip install beautifulsoup4 requests
To utilize the Pages24 Scraper, follow these steps:
- Open the main file (
main.py
)
# Input Parameters for the Scraper
FILENAME = 'data.csv'
scrape_url = 'https://www.pages24.com/somerset-tx/san-antonio'
-
Open the main file (
main.py
) and ReplaceFILENAME = data.csv
with your preferred filename. -
Rpelace
scrape_url = 'YOUR URL'
to the url that you want to scrape from pages24 website
-
Run the scraper:
python main.py
-
The scraped data will be saved in a CSV file with items structured as follows:
name,url,street_address,locality,postal_code,phone_number,description,keywords
San Antonio River Authority,https://www.pages24.com/somerset-tx/1814639-san-antonio-river-authority,20334 S Payne Rd,Somerset,78069,(830) 429-2160,Providers of in Somerset,
You can now use this CSV file for various purposes, such as analysis, database creation, or any other creative project you have in mind.
We welcome contributions from the community to enhance the Pages24 Scraper project. If you have ideas for improvements, encounter issues, or want to add new features, feel free to open an issue or submit a pull request. Your contributions are valuable and play a crucial role in the ongoing development of this project. Thank you for considering being a part of our open-source community!
This project is licensed under the MIT License - see the LICENSE file for details.