GitHub - crystalyue/multi-view-rumor-detection

Rumor Detection on Social Media: A Multi-View Model using Self-Attention Mechanism

This repository is the source code of paper: Rumor Detection on Social Media: A Multi-View Model using Self-Attention Mechanism

Dependencies

Python 3.6 PyTorch 0.4.1 Numpy 1.15.4

Preparation

Download Weibo dataset at http://alt.qcri.org/~wgao/data/rumdect.zip
Download Tencent AI Lab Embedding Corpus at https://ai.tencent.com/ailab/nlp/embedding.html and place it at path 'embedding_data/tencent/Tencent_AILab_ChineseEmbedding.txt'
Download Baidu Senta Corpus at https://github.com/baidu/Senta
Download Bert pre-trained model at https://github.com/google-research/bert
Fine-tuning Bert pre-trained model with Baidu Senta Corpus. You can refer Bert readme file to finish this work.
Extract source post and replies of Weibo dataset(save the source post and all replies of each event in a file), get sentiment view input embbedding using the fine-tuned Bert model and pack the result file into 'embedding_data/bert/weibo_sentiment.tar'
Use Jieba to process the content and replies of each event file(for each post in each event file, cut 'text' into a word list through Jieba) and save the corpus in directory 'text_data/weibo/dataset', place the label file at path 'text_data/weibo/label/weibo.txt'

Running

Run the command to train content view model and get prediction in test dataset:

python train.py -corpus weibo -content

Run the command to train reply view model and get prediction in test dataset:

python train.py -corpus weibo -reply

Run the command to train sentiment view model and get prediction in test dataset:

python train.py -corpus weibo -bert_vector -bert_vector_type sentiment

The results are saved at directory 'text_data/weibo/result'

After getting three views' predictions of test dataset(the train and test dataset are the same for three views), please deploy a vote program to get final prediction.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.idea		.idea
embedding_data		embedding_data
text_data		text_data
.DS_Store		.DS_Store
attention.py		attention.py
config.py		config.py
dataset.py		dataset.py
embedding.py		embedding.py
event.py		event.py
model.py		model.py
preview.py		preview.py
readme.md		readme.md
train.py		train.py
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rumor Detection on Social Media: A Multi-View Model using Self-Attention Mechanism

Dependencies

Preparation

Running

About

Releases

Packages

Languages

crystalyue/multi-view-rumor-detection

Folders and files

Latest commit

History

Repository files navigation

Rumor Detection on Social Media: A Multi-View Model using Self-Attention Mechanism

Dependencies

Preparation

Running

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages