Web-Scraping-Starter-Kit

A repository designed to help freshers grasp the basics of web scraping. This kit provides simple guides and examples to build a strong foundation in web scraping.

Repository Contents

This repository includes four essential Python scripts for web scraping:

Web.py
This script introduces the basics of web scraping. It captures and prints data from a website to the terminal.
WebDataToExcel.py
This script extracts data from a website and saves it to an Excel sheet, with two columns: Heading and Content.
WebImgToFolder.py
This script retrieves image source paths via web scraping and downloads the images, saving them to a specified folder.
PaginatedDataSetToExcel.py
This script scrapes data from a paginated site and saves it to an Excel sheet with seven separate columns, organized page by page.

How to Use

Clone the Repository

git clone https://github.com/gayanukabulegoda/Web-Scraping-Starter-Kit.git

Navigate to the Project Directory
```
cd Web-Scraping-Starter-Kit
```
Run the Scripts

For Web.py:
```
python Web.py
```
For WebDataToExcel.py:
```
python WebDataToExcel.py
```
For WebImgToFolder.py:
```
python WebImgToFolder.py
```
For PaginatedDataSetToExcel.py:
```
python PaginatedDataSetToExcel.py
```

Dependencies

Ensure you have the required Python libraries installed. You can install them using pip:

pip install requests beautifulsoup4 pandas

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

For any questions or inquiries, please contact me via LinkedIn.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
lk.ijse.webScrapping		lk.ijse.webScrapping
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web-Scraping-Starter-Kit

Repository Contents

How to Use

Dependencies

License

Contact

About

Releases

Packages

Languages

License

gayanukabulegoda/Web-Scraping-Starter-Kit

Folders and files

Latest commit

History

Repository files navigation

Web-Scraping-Starter-Kit

Repository Contents

How to Use

Dependencies

License

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages