This repository contains supplementary code for the following paper:
Bots as Active News Promoters: A Digital Analysis of COVID-19 Tweets
DOI: 10.3390/info11100461Ahmed Al-Rawi, Vishal Shukla
This code is being used to perform analyses on COVID19 Twitter dataset collected by Twitter Capture and Analysis Toolset (TCAT). It generates files for the identified active users, Botometer scores for the user handles, terms, hashtags, emojis and sentiment for the tweets.
This code has the following listed requirements to run successfully.
- Python 3.x
- Apache Spark
To install Python package dependencies, use pip install -r requirements.txt
command.
The collected Twitter data is expected to be in data
directory. Same directory will be used by the script to write generated output files. The entire dataset will be made available upon request.
.
├── data
│ ├── tcat_COVID-20200403-20200405------------fullExport--a04e82a9e6.csv # Collected data files
│ ├── tcat_COVID-20200404-20200405------------fullExport--a04e82a9e6.csv
│ ├── tcat_COVID-20200412-20200414------------fullExport--ecc23817f6.csv
. .
. .
. .
│ └── tcat_COVID-20200414-20200416------------fullExport--ecc23817f6.csv
├── LICENSE
├── README.md
├── requirements.txt
└── src
├── bots.py
├── main.py
└── udfs.py
python ./src/main.py