Wikipedia Web Scraping Python Project

Description: This Python script utilizes BeautifulSoup and requests libraries to scrape data from the Wikipedia page listing the largest companies in the United States by revenue. The data is then processed and converted into a structured DataFrame using the Pandas library. The final step involves exporting this data to a CSV file.

Usage:

Clone the repository: git clone https://github.com/SaiSurajMatta/Wikipedia-Web-Scraping-Python-Project
Install the required dependencies: pip install beautifulsoup4 requests pandas
Run the file : Wikipedia_Web_Scraping_Project.ipynb

Requirements:

Python 3
BeautifulSoup
Requests
Pandas

How to Contribute:

Fork the repository.
Create a new branch: git checkout -b feature/new-feature.
Make your changes and commit them: git commit -m 'Add new feature'.
Push to the branch: git push origin feature/new-feature.
Create a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Wikipedia_Web_Scraping_Project.ipynb		Wikipedia_Web_Scraping_Project.ipynb
companies.csv		companies.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Wikipedia Web Scraping Python Project

About

Uh oh!

Releases

Packages

Languages

SaiSurajMatta/Wikipedia-Web-Scraping-Python-Project

Folders and files

Latest commit

History

Repository files navigation

Wikipedia Web Scraping Python Project

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages