chronamdatasets

This repository contains two Jupyter Notebooks of Python code:

download.ipynb - downloads Chronicling America's Title Essay content from the Library of Congress API and saves it as a dataset. It describes through the code necessary to perform these operations and gives the reader pointers to customize their own datasets.
analysis.ipynb - transforms this dataset using basic Natural Language Processing techniques.

To run this code in your browser, click the Binder button below:

This project was created by Daniel Evans while working as a Pathways Intern with the Division of Preservation and Access at the National Endownment for the Humanities.

These notebooks are meant to assist in drawing out research questions from American print culture. They act as starting points through which researchers can download all title essays and associated metadata, but also allow researchers to curate their own data sets through the selection of specific time period, location, or metadata fields. Furthermore, the code within them provides points for researchers to ask further questions from Chronicling America that are not immediately available in the current metadata or full-text search.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analysis.ipynb		analysis.ipynb
chronam.csv		chronam.csv
download.ipynb		download.ipynb
final.csv		final.csv
lc_output.csv		lc_output.csv
raw.csv		raw.csv
requirements.txt		requirements.txt
user_agents.txt		user_agents.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

chronamdatasets

About

Releases

Packages

Contributors 2

Languages

License

danieljohnevans/chronamdatasets

Folders and files

Latest commit

History

Repository files navigation

chronamdatasets

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages