The goal of this project is to build a community driven cyberbullying detection system.
- Cyberbullying datasets - Mendeley Data
- TweetBLM: A Hate Speech Dataset and Analysis of BlackLivesMatter-related Microblogs on Twitter | Zenodo
- Hate Speech Identification - dataset by crowdflower | data.world
- Hate Speech Twitter annotations by Waseem and Hovy
- A Large-Scale English Multi-Label Twitter Dataset for Cyberbullying and Online Abuse Detection - ACL Anthology
- Cyber Bullying Types Datasets | IEEE DataPort
- Mendeley Data - Cyberbullying Datasets - sourced from Kaggle, Twitter, Wikipedia Talk pages and YouTube
- Training BERT for Cyberbullying Detection - HF Trainer Baseline
- A Large-Scale English Multi-Label Twitter Dataset for Online Abuse Detection
- Wikipedia Detox (used by the Mendeley Data)
- Unsupervised Cyberbullying Detection via Time-Informed Gaussian Mixture Model
- Towards Understanding and Detecting Cyberbullying in Real-world Images
- Detection of Cyberbullying Incidents on the Instagram Social Network
- XBully: Cyberbullying Detection within a Multi-Modal Context