Skip to content

MPEDS/mpeds-coder

Repository files navigation

MPEDS Annotation Interface

The MPEDS Annotation Interface helps facilitate the creation of protest event datasets. While applied to the specific task of coding for protest events, this could also plausibly be used for the development of other types of event datasets or other types of text annotations tasks.

This system is built in Python using the Flask microframework. It can source articles parsed from Lexis-Nexis (using the split-ln.py script), Apache Solr, or XML files formatted in News Industry Text Format, such as the LDC's New York Times Annotated Corpus.

It also uses Bootstrap for CSS and jQuery for JavaScript. It only works in Firefox (for now).

Setup

To populate the database with example information, first run the setup script.

python setup.py

This will add five users: an admin (admin), two first-pass coders (coder1p_1, coder1p_2), and two second-pass coders (coder2p_1, coder2p_2). They will all have the password default). It will add a variable hierarchy for second-pass coding. It will also enter metadata for all the articles in the example-articles directory, and queue them up for the first-pass coders.

Then run the Flask test server with the following.

python mpeds_coder.py

Publications

  1. Hanna, Alex. 2017. MPEDS: Automating the Generation of Protest Event Data. SocArXiv. DOI: 10.31235/osf.io/xuqmv.
  2. Oliver, Pamela, Chaeyoon Lim, Morgan Matthews and Alex Hanna. 2022. "Black Protests in the United States, 1994-2010." Sociological Science 9(May):275-312. DOI: 10.15195/v9.a12.
  3. Oliver, Pamela, Alex Hanna, Chaeyoon Lim. 2022. “Constructing Relational and Verifiable Protest Event Data: Four Challenges and Some Solutions” Forthcoming in Mobilization. Preprint available https://osf.io/preprints/socarxiv/d89g7/

DOI

The DOI for this repository has been created with Zenodo.

DOI

You can cite this software as:

Hanna, Alex and David Skalinder. 2018. MPEDS Annotation Interface (v1.0.0). Zenodo. https://doi.org/10.5281/zenodo.6953975

Acknowledgments

Development of this interface has been supported by a National Science Foundation Graduate Research Fellowship and National Science Foundation grants SES-1423784 and SES-1918342. Thanks to Emanuel Ubert, Katie Fallon, and David Skalinder for working with this system since its inception, and to countless annotators who have put a significant time working with and refining this system.