CoAID (Covid-19 heAlthcare mIsinformation Dataset) is a diverse COVID-19 healthcare misinformation dataset, including fake news on websites and social platforms, along with users' social engagement about such news. It includes 5,216 news, 296,752 related user engagements, 958 social platform posts about COVID-19, and ground truth labels.
If you use this dataset, we appreciate it if you cite the following paper:
@misc{cui2020coaid,
title={CoAID: COVID-19 Healthcare Misinformation Dataset},
author={Limeng Cui and Dongwon Lee},
year={2020},
eprint={2006.00885},
archivePrefix={arXiv},
primaryClass={cs.SI}
}
Version 0.1 (05/17/2020)
- initial version corresponding to arXiv paper
Version 0.2 (08/03/2020)
- added data from May 1, 2020 through July 1, 2020
Version 0.3 (11/03/2020)
- added data from July 1, 2020 through September 1, 2020
Version 0.4 (01/08/2021)
- added data from September 1, 2020 through November 1, 2020