Skip to content

A program to parsing and querying user lists in IMDB.

Notifications You must be signed in to change notification settings

Bekci/IMDBQuerier

Repository files navigation

IMDBQuerier

This project is written to parsing films from IMDB user lists based on some attributes. It uses Selenium and BeautifulSoup to obtain and parse the film data.

Until now, the project can parse films based on their:

  • Runtime
  • Score
  • Year
  • Genre
  • Type (TV show or film)

Currently, one can make the exact queries on the refine section at the bottom of each user list. However, it is hard to apply your selections to all lists.

Requirements

Selenium and BeautifulSoup modules are necessary for the project. Other than that, you will need a WebDriver. The project is using ChromeDriver but you can change it to the other supported browsers easily.

If you have changed the driver, make sure to change the below code accordingly.

# main.py line 16
driver = webdriver.Chrome()

Here is a link for the Firefox driver.

Usage

First of all, change the values in the parse_options dictionary in the parser_config.py.

Then, change the value of list_url variable in the main.py code to the list wanted to be parsed.

Run the code, the output html will apear in list_htmls folder.

About

A program to parsing and querying user lists in IMDB.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published