Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
bigcases.py		bigcases.py
bigcases.sh		bigcases.sh
bigcases_list.py		bigcases_list.py
bigcases_scrape_docs.py		bigcases_scrape_docs.py
bigcases_settings.py		bigcases_settings.py
create_database.sql		create_database.sql
dbconnect.py		dbconnect.py
object_cache.py		object_cache.py
pacer-rss.py		pacer-rss.py
pacer_rss_feeds.py		pacer_rss_feeds.py

Repository files navigation

Big-Cases

This is the basic Python code behind the @big_cases Twitter bot.

The bot uses PACER RSS feeds to gather the latest filings from 74 U.S. District Courts and five federal courts of appeals and stores the docket entries in a database. It matches new filings against a preselected list of major cases, scrapes matching documents from PACER, uploads them to a DocumentCloud project and posts the results on Twitter.

Note that some federal district courts -- including those in the Eastern District of Virginia, the District of Maryland and the District of Hawaii -- have elected not to publish an RSS feed from their dockets. As a result, Big Cases can't follow cases in those districts. Most federal courts of appeal also do not pulbish RSS feeds of new docket entries.

How it basically works

Big-Cases uses three main Python scripts to gather docket entries, scrape dockets and tweet the results. This is mostly a vestigate of the fact that the bot was built as an add-on to a larger scraping tool. The scripts run in order, every four minutes:

pacer_rss.py - Scrapes PACER's RSS feeds, matches them against a list of selected cases and stores the results in a MySQL database.
bigcases_scrape_docs.py - Attempts to scrape documents from PACER based on new matches identified from RSS feeds. It also publishes the results on DocumentCloud.
bigcases.py - Tweets the results.

Settings and lists of stuff

The list of cases Big-Cases is following is stored in bigcases_list.py; the list of available PACER RSS feeds is stored in pacer_rss_feeds.py.

Other stuff to know

Big-Cases has only been tested on a machine running CentOS Linux 7.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Big-Cases

How it basically works

Settings and lists of stuff

Other stuff to know

About

Releases

Packages

Languages

madho/Big-Cases

Folders and files

Latest commit

History

Repository files navigation

Big-Cases

How it basically works

Settings and lists of stuff

Other stuff to know

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages