OpenSooq Data Miner is a web application designed to scrape data from OpenSooq, an online marketplace, based on user-defined parameters such as category URL and number of pages to scrape using python. The application sends the scraped data to an API server, which processes the data using Python scripts with BeautifulSoup and requests libraries. The scraped data is then stored in a CSV file, which users can download and preview.
- Customizable Scraping: Users can specify the category URL and the number of pages to scrape data from.
- API Integration: Data scraped from OpenSooq is sent to an API server for processing.
- Scraping with BeautifulSoup: Python scripts utilize BeautifulSoup library to scrape data from web pages.
- CSV Export: Scraped data is stored in a CSV file format for easy download and preview.
- Input Parameters: Users provide the category URL and specify the number of pages to scrape on the web page.
2. Scraping Process: When the user presses the button, the web application sends the data to the API server.
-
Data Scraping: Python scripts on the API server utilize BeautifulSoup and requests libraries to scrape data from OpenSooq.
-
CSV Generation: The scraped data is then stored in a CSV file format.
-
Download and Preview: Users can download the CSV file containing the scraped data and preview it.
- Python: Used for scripting and backend data processing.
- FastAPI: Web framework for developing API for Applcation.
- BeautifulSoup: Python library for web scraping.
- Requests: Python library for making HTTP requests.
- CSV: File format used for storing scraped data.
- HTML/CSS/JavaScript: Frontend technologies for building the user interface.
To get started with OpenSooq Data Miner, follow these steps:
- Clone the repository to your local machine.
- Install the necessary dependencies by running
pip install -r requirements.txt
. - Run the FastAPI application using
python -m uvicorn os_scarper_fastapi:app --reload
. - Access the application through your web browser and start scraping data from OpenSooq.
Contributions to this repository are welcome!