Py Web Scraper

Welcome to the Py Web Scraper repository! This project is a robust web scraping tool developed in Python. The purpose of this README is to guide you through the prerequisites for this project and provide instructions on how to clone and run it.

Prerequisites

Before cloning and running this project, ensure you have the following installed:

Python: The project is written in Python. Ensure you have Python 3.x installed. You can download it here.
pip: This is the package installer for Python. If you installed Python from python.org, you likely already have pip. Otherwise, you can get instructions here.
Virtual Environment (Optional): It's a good practice to run Python projects within a virtual environment to manage dependencies. You can read more about it here.
ChromeDriver:
- This project uses ChromeDriver to interact with the Chrome web browser. Ensure the version of ChromeDriver aligns with your installed version of the Chrome browser.
- You can download ChromeDriver here. After downloading, place the chromedriver executable in the root folder of this project.
- Alternatively, you can add the path of the chromedriver binary to your system's PATH environment variable.

Cloning and Setting Up

Follow these steps to get the project up and running:

Clone the Repository:

git clone https://github.com/ynevet/py-web-scraper.git

Navigate to the Project Directory:
```
cd py-web-scraper
```

Set Up a Virtual Environment (Optional):

python3 -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install Required Packages:
```
pip install -r requirements.txt
```

Usage

After setting up, you can run the main script:

python main.py

Upon successful execution of the script, you should expect two generated files:

A CSV file.
A parquet file.

Both files will contain the scraped data.

Contributions and Issues

Feel free to fork this repository and submit pull requests. If you encounter any issues or have suggestions, please open an issue in the repository

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Py Web Scraper

Prerequisites

Cloning and Setting Up

Usage

Contributions and Issues

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ynevet/py-web-scraper

Folders and files

Latest commit

History

Repository files navigation

Py Web Scraper

Prerequisites

Cloning and Setting Up

Usage

Contributions and Issues

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages