Skip to content
Change the repository type filter

All

    Repositories list

    • An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
      Python
      MIT License
      212000Updated Feb 24, 2020Feb 24, 2020
    • Example of how to use the new Census API to programmatically scrape data by a variety of geographies
      Python
      0300Updated May 18, 2016May 18, 2016
    • ca-warn

      Public
      A collection and conversion of WARN notices from California
      01200Updated May 13, 2016May 13, 2016
    • A how-to do a mass collection of FEC data using the command-line and regular expressions
      02900Updated Mar 18, 2016Mar 18, 2016
    • Unpacking the U.S. Senate Lobbying Disclosure databases
      0000Updated Mar 8, 2016Mar 8, 2016
    • Storing, archiving, and transcribing the State of Michigan emails released in the wake of the Flint water poisoning
      Python
      0200Updated Mar 1, 2016Mar 1, 2016
    • Download and search the raw text of Secretary Hillary Clinton's released emails
      Python
      72000Updated Feb 22, 2016Feb 22, 2016
    • leso_1033

      Public
      Data from the Pentagon's surplus-equipment-to-local-law-enforcement program
      HTML
      1800Updated Dec 17, 2015Dec 17, 2015
    • Python scripts to compile the NYPD's historical precinct level data into one CSV
      Python
      1200Updated Nov 17, 2015Nov 17, 2015
    • A snapshot of the FAA Section 333 PDFs and their text extracts as of Oct. 1, 2015
      1010Updated Oct 2, 2015Oct 2, 2015
    • A snapshot of Crunchbase's data from October 2013, before its licensing change.
      Other
      82400Updated Sep 11, 2015Sep 11, 2015
    • USGS Earthquake Archive Data
      Python
      0000Updated Aug 16, 2015Aug 16, 2015
    • A repo to produce the Tutorial data because I don't have a good idea on how to coordinate things
      Python
      1000Updated Aug 16, 2015Aug 16, 2015
    • FEC data as gathered from its API
      Python
      0000Updated Aug 10, 2015Aug 10, 2015
    • U.S. Congress member data
      Python
      0100Updated Aug 10, 2015Aug 10, 2015
    • Congress + Twitter
      Python
      1000Updated Aug 10, 2015Aug 10, 2015
    • U.S. Social Security Administration's set of baby names
      Python
      1100Updated Aug 8, 2015Aug 8, 2015
    • Homepage for the stash where I hoard my computational journalism datasets
      0200Updated Aug 3, 2015Aug 3, 2015