This repository contains the code and datasets used in the paper "Citation Amnesia: NLP and Other Academic Fields Are in a Citation Age Recession". Our work analyzes citation patterns across various academic fields to investigate trends in citation ages, highlighting the growing focus on recent literature at the expense of older, foundational works.
data/
: Contains datasets by field, including citation relationships, grant information, institutional affiliations, and more.figures/
: Directory containing generated figures and plots.output/
: Contains generated tables and other outputs from the analysis.analysis.ipynb
: Jupyter notebook with main plots and analysis from the paper.helpers.py
: Python module with functions for calculations and postprocessing.dataset.py
: Python script for downloading and handling the dataset.
To replicate the analysis or explore the datasets, follow these steps:
- Clone the repository to your local machine.
- Ensure you have the required dependencies installed. A list of dependencies can be found in
requirements.txt
. - Run
dataset.py
to download and prepare the datasets. - Use
helpers.py
for specific calculations and data postprocessing tasks. - Open and execute the
analysis.ipynb
notebook for visualizations and further analysis.
To download and prepare the dataset, run:
python dataset.py
This script will download the necessary data files and prepare them for analysis.
helpers.py
contains functions used for various calculations and data processing tasks. These functions are used within the analysis notebook but can also be imported and used in other scripts.
Open analysis.ipynb
in a Jupyter environment with R (for example with GitHub Codespaces) to see the key plots and analysis performed in the study.
This project is licensed under the MIT License - see the LICENSE file for details.
If you use the code or data from this repository in your research, please cite our paper:
@article{wahle2024citationamnesia,
title={Citation Amnesia: NLP and Other Academic Fields Are in a Citation Age Recession},
author={Wahle, Jan Philip and Ruas, Terry and Abdalla, Mohamed and Gipp, Bela and Mohammad, Saif M.},
journal={arXiv preprint arXiv:2402.12046},
year={2024}
}
Contributions to this project are welcome! Please submit issues and pull requests with any suggestions, corrections, or enhancements.