This repository contains two Jupyter Notebooks of Python code:
- download.ipynb - downloads Chronicling America's Title Essay content from the Library of Congress API and saves it as a dataset. It describes through the code necessary to perform these operations and gives the reader pointers to customize their own datasets.
- analysis.ipynb - transforms this dataset using basic Natural Language Processing techniques.
To run this code in your browser, click the Binder button below:
This project was created by Daniel Evans while working as a Pathways Intern with the Division of Preservation and Access at the National Endownment for the Humanities.
These notebooks are meant to assist in drawing out research questions from American print culture. They act as starting points through which researchers can download all title essays and associated metadata, but also allow researchers to curate their own data sets through the selection of specific time period, location, or metadata fields. Furthermore, the code within them provides points for researchers to ask further questions from Chronicling America that are not immediately available in the current metadata or full-text search.