Skip to content

Python module to scrape data from IMDb Top 250 and parse xml of movie data from OMDb API (www.omdbapi.com)

License

Notifications You must be signed in to change notification settings

AnanyaSharma22/pymdb

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

pymdb

Python script to scrape data from IMDb Top250 (www.imdb.com/chart/top) and parse xml of movie data from OMDb API (www.omdbapi.com)

##Installation Download the pymdb.py script into your Python27 folder and import it in your scripts.

##Usage Put the source in the folder where you want to use it and then import as import pymdb.

##Functions top250_id() Pulls out ImDb Title ID's from the Imdb Top 250 webpage and adds them to a list. Returns List

top250_yearcount()
    Creates a dictionary of number of movies of a particular year in the Imdb Top 250.
    Returns a dictionary

top_250()
    Pulls out movie names from the Imdb Top 250 webpage and adds them to a dictionary with its position.
    Returns dictionary

years_top250()
        Creates a dictionary of year of the movies of the top 250 in the order of ranking.
    Returns  dictionary

##Using the Movie and MovieId Class Initialise: m = pymdb.Movie("The Shawshank Redemption") m = pymdb.Movie("The Shawshank Redemption",1994) m = pymdb.MovieId("tt0111161")

Use Class functions:
  m.year()
  m.director()
  m.getposter()
  .
  .
  type help(pymdb.Movie) for more.

##Example script: Download posters of all movies in IMDb Top250.

import pymdb
ids = pymdb.top250_id()
for id in ids:
  m = pymdb.MovieId(id)
  m.getposter()

P.S. The Movie class works for any movie. It doesn't have to be in the Top 250.

About

Python module to scrape data from IMDb Top 250 and parse xml of movie data from OMDb API (www.omdbapi.com)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%