Skip to content

A fun little bot that scrapes Wikiquote.org for movie quotes and persists or prints them at random.

Notifications You must be signed in to change notification settings

JamesSauer/Go-Movie-Quote-Bot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Movie Quote Bot

  • Finds all movie entries on WikiQuote.org
  • Scrapes quotes off the pages
  • Optionally persists quotes in a Postgres database
  • Picks one at random and prints it to the console

Usage

$ go get github.com/JamesSauer/Go-Movie-Quote-Bot
$ cd $GOPATH/src/github.com/JamesSauer/Go-Movie-Quote-Bot
$ go install
$ mqbot
> That's part of your problem: you haven't seen enough movies. All of life's riddles are answered in the movies.
>   - Davis, Grand Canyon (1991 film)

By default, mqbot attempts to retrieve a quote from the database before scraping Wikiquote.
It checks the MQBOT_POSTGRES environment variable for a connection string.

To test the database connection:

$ mqbot testdb
> Successfully connected to DB!

To set up the schema:

$ mqbot initdb
> Successfully set up database schema.

To force scraping a fresh quote, use the --fresh or -f flag:

$ mqbot -f
> ...

To avoid using scraping as a fallback, use the --database or -db flag:

$ mqbot -db
> ...

To scrape a random movie entry and persist all its quotes to the database:

$ mqbot scrape1
> Successfully scraped and saved the entry for the movie "Angels with Dirty Faces"!

To scrape ALL movie entries:

$ mqbot scrapeall
> This command will attempt to scrape the entirety of wikiquote.org's movie quotes.
> This might take more than 10 minutes. Do you want to proceed? (yes/y/no/n)
$ yes
> Scraped 2378 pages in 12m10.7716143s!

To check some stats:

$ mqbot stats
> MQBot has collected and stored 24809 quotes from 6281 characters and 2377 movies this far.

About

A fun little bot that scrapes Wikiquote.org for movie quotes and persists or prints them at random.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published