- Beautiful Soup
- Pandas
- re (Regular Expressions)
- getData(): This method takes in a single parameter i.e. the url to gather/scrape data and store it in excel file after undergoing some cleaning operations using regular expressions.
- Then the above method is looped so as to gather data from all available pages.
- Now a deep cleaning of data is performed using pandas.
- Finally the cleaned data is stored as an excel file.
@author: Obed Junias