This repository provides Python code and tools to obtain and use publicly available data on federal rulemakings. The purpose of this repository is to facilitate research on regulation using government big data.
This repository will be complemented and updated routinely.
The current version of the repository contains the following foders:
This folder contains Python code to obtain and organize data from the semiannual Unified Agenda of Regulatory and Deregulatory Actions since Fall 1995. The code will generate a clean CSV file covering the rulemaking actions published in the Unified Agendas that an user specifies. For each action, the CSV file contains most of the information available on a Unified Agenda page (see an example here), including RIN, rule title, abstract, agency, priority (i.e., economically significant, significant, etc.), RIN status, rule stage, CFR citation, legal authority, legal deadlines, and timetable (i.e., action, date, and FR citation).
Unified Agenda Data | |
---|---|
Source: | Reginfo.gov Unified Agenda XML Reports |
Timeframe: | Fall 1995 - The latest available date |
This folder contains Python code to obtain and organize data on rulemaking actions reviewed by the Office of Information and Regulatory Affairs (OIRA) since 1981. The code will generate a clean CSV file covering all the actions reviewed by OIRA during the time period that an user specifies. The CSV file contains information on each action including RIN, rule title, agency code, agency name, date and year received, date and year completed, rule stage, economic significance, major rule status, legal deadline, and decision.
OIRA Review Data | |
---|---|
Source: | Reginfo.gov Regulatory Review XML Reports |
Timeframe: | 1981 - The latest available year |
This folder contains Python code that may be useful for conducting some basic data analysis, such as generating page/word/character counts and readability scores for a batch of PDF/Word documents.