Name	Name	Last commit message	Last commit date
Latest commit History 9 Commits
analysis	analysis
finetune	finetune
pretrain	pretrain
.DS_Store	.DS_Store
.gitattributes	.gitattributes
README.md	README.md
example.pdf	example.pdf

Name

Last commit message

Last commit date

analysis

ERICA

Source code and dataset for "ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning".

The code is based on huggaface's transformers, the trained models and pre-training data can be downloaded from Google Drive.

Dependencies

Run the following script to install dependencies.

pip install -r requirement.txt

You need to install transformers and apex manually.

transformers You should install transformers manually. We use huggingface transformers to implement Bert and RoBERTa, and the version is 2.5.0. You need to clone or download transformers repo. And for convenience, we have downloaded transformers into code/pretrain/ so you can easily import it, and we have also modified some lines in the class BertForMaskedLM in src/transformers/modeling_bert.py while keeping the other codes unchanged.

apex Install apex under the offical guidance.

process pretraining data

In folder prepare_pretrain_data, we provide the codes for processing pre-training data.

Pretrain

You can use this repo to pretrain a new model. To pretrain ERICA_bert:

cd code/pretrain

python -m torch.distributed.launch --nproc_per_node 8  main.py  \
    --model DOC  --lr 3e-5 --batch_size_per_gpu 16 --max_epoch 105  \
    --gradient_accumulation_steps 16    --save_step 500  --temperature 0.05  \
    --train_sample  --save_dir ckpt_doc_dw_f_alpha_1_uncased --n_gpu 8  --debug 1  --add_none 1 \
    --alpha 1 --flow 0 --dataset_name none.json  --wiki_loss 1 --doc_loss 1 \
    --change_dataset 1  --start_end_token 0 --bert_model bert \
    --pretraining_size -1 --ablation 0 --cased 0

some explanations for hyper-parameters: temperature: \tau used in loss function of contrastive learning debug: whether to debug (we provide an example_debug file for pre-training) add_none: whether to add no_relation pair in RD loss. alpha: the proportion of masking (1 means no masking, in experiments, we find masking is not helpful as is described in the main paper, so for all models, we do not mask in the pre-training phase. However, we leave this function here for further research explorations.) flow: if masking, whether to use a linear decay wiki_loss: whether to add ED loss. doc_loss: whether to add RD loss. start_end_token: use another entity encoding method cased: whether to use cased version of BERT

Fine-tuning code for ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning. Please enter each folder for downstream task (document-level / sentence-level relation extraction, entity typing and question answering) fine-tuning. Before fine-tuning, we assume you have already pre-trained an ERICA model. Excecute the bash in each folder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ERICA

Dependencies

process pretraining data

Pretrain

About

Releases

Packages

Languages

thunlp/ERICA

Folders and files

Latest commit

History

Repository files navigation

ERICA

Dependencies

process pretraining data

Pretrain

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages