Skip to content

danieljohnevans/chronamdatasets

Repository files navigation

chronamdatasets

This repository contains two Jupyter Notebooks of Python code:

  • download.ipynb - downloads Chronicling America's Title Essay content from the Library of Congress API and saves it as a dataset. It describes through the code necessary to perform these operations and gives the reader pointers to customize their own datasets.
  • analysis.ipynb - transforms this dataset using basic Natural Language Processing techniques.

To run this code in your browser, click the Binder button below:

Binder

This project was created by Daniel Evans while working as a Pathways Intern with the Division of Preservation and Access at the National Endownment for the Humanities.

These notebooks are meant to assist in drawing out research questions from American print culture. They act as starting points through which researchers can download all title essays and associated metadata, but also allow researchers to curate their own data sets through the selection of specific time period, location, or metadata fields. Furthermore, the code within them provides points for researchers to ask further questions from Chronicling America that are not immediately available in the current metadata or full-text search.

About

Repo for creating Chron Am Essay Data sets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published