Welcome to the Grab Web Scraper project! This Python-based web scraping tool is designed to extract and analyze data from the GrabFood website. The scraper collects information about restaurants, recommended merchants, and various details related to the GrabFood platform.
- Web Scraping: Extracts data from the GrabFood website, focusing on restaurant information and recommended merchants.
- Data Processing: Parses the extracted data and organizes it into structured formats.
- API Requests: Fetches detailed information about specific merchants using Grab's API.
To get started with this project, follow the installation instructions and explore the various features available.
- Web Scraping: The initial script (1_webScraper.py) uses BeautifulSoup to scrape HTML content from the GrabFood website.
- Data Processing: The extracted data is then processed and organized, and relevant information is stored in a JSON file.
- API Requests: The ResListFetch script (2_RestListFetch.py) uses the collected IDs to make API requests and fetch detailed information about specific merchants.
- Data Combining: The ListMaking script (3_listMaking.py) combines the collected data into a structured format.
- Response Grab: The final script (4_responseGrab.py) uses the collected IDs to fetch responses and extract specific details about merchants.
-
Web Scraping: Utilize BeautifulSoup for efficient extraction of data from the GrabFood website, focusing on restaurants and recommended merchants.
-
Data Processing: Process and organize the extracted data into a structured format for further analysis and presentation.
-
API Requests: Make API requests to Grab's servers using the collected merchant IDs to fetch detailed information about specific merchants.
-
Data Combination: Combine the collected data into a cohesive and structured format using ListMaking script for comprehensive analysis.
-
Response Grab: Fetch responses using the collected IDs, extracting specific details about merchants for a deeper understanding.
-
Ease of Use: Designed to be user-friendly, allowing seamless execution of scripts to gather and process data without unnecessary complexity.
-
Customizable: Easily adapt the scripts to accommodate changes in the GrabFood website structure or modify them for specific use cases.
-
Contributions Welcome: We encourage contributions from the community to enhance and improve the functionality of the web scraper.
-
Documentation: Detailed documentation to guide users on installation, usage, and contributing to the project.
-
Open Source: Released under an open-source license, fostering collaboration and transparency within the developer community.
-
Scalability: Designed with scalability in mind to handle a large volume of data efficiently.
-
Modular Architecture: Scripts are organized in a modular manner, making it easy to understand, extend, or modify specific functionalities.
Before getting started with the GrabFood Web Scraper project, make sure you have the following prerequisites installed:
-
Python: Ensure you have Python installed on your machine. You can download and install it from python.org.
-
Dependencies: Install the required Python libraries using the following command:
pip install -r requirements.txt
Follow these steps to set up and run the GrabFood Web Scraper:
- Clone the Repository:
git clone https://github.com/your-username/grabfood-web-scraper.git
- Change Directory
cd grabfood-web-scraper - Install Dependencies
pip install -r requirements.txt
- Make the script executable
chmod +x run_scripts.sh
- Execute the script
./run_scripts.sh