Skip to content

charangajjala/IMDB_Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

IMDB Scraper

Python Framework Data Base Library Front End

A web scraper project which extracts various details of movies, episodes and tv shows from the official IMDB website.

Demo

Basic Details

  • API Demo 2022-10-18

  • Data Base Demo 2022-10-18 (1)

Reviews sample

  • API Demo 2022-10-18 (3)
  • DB demo 2022-10-18 (2)

API Reference

Get basic details of movies and tv shows.

  POST https://imdb--scraper.herokuapp.com//search
  Body:
      kwd : search word
      type: movie or tv_show or tv_episode 
Body Type Description
JSON string Required

Get review of a movie or tv show

  POST https://imdb--scraper.herokuapp.com/reviews
  Body:
     num : number of review to get
Parameter Type Description
kwd, type string Required

Web Scraping

Scraped the various details of a show

Basic details

  • authors , description, genres, popularity, rating, release date etc.

Review details

  • date, name, ratingm title, helped_votes, total_votes, spoiler

Setup and Installation

After cloning/ downloading the code, create a virtual environment with python >3.6 as explained here.

Client (React) Setup

  cd /client
  npm install 
  cd /client/src
  npm run

Server (Flask) Setup

  cd /server
  pip install -r requirements.txt
  py app.py - to activate the server

Database setup (Mongo DB) Setup

1. Follow this mongodb atlas [tutorial](https://www.mongodb.com/basics/mongodb-atlas-tutorial) to create your own cluster.
2. Create a .env file and copy the code in env sample file. Then replce your cluster password in the place holder.
## Deployement on Heroku

Login or signup in order to create virtual app. You can either connect your github profile or download ctl to manually deploy this project.

  • Our next step would be to follow the instruction given on Heroku Documentation to deploy a web app.
  • Also, refer this to setup enviornment variables related to selenium and chrome driver extension on Heroku.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published