MLM_transfer

Implemetation of MLM_transfer

Environment：

python==3.6
pytorch==0.4.1
theano==1.0.4
nltk==3.0.0b2 (included)

Procedures：

Mask stage

Three methods：
- attention_based method
- frequency_ratio method
- fusion_method method
Command（Results already included, no need to run it again） bash run_preprocess.sh

Fill stage

Two steps：
- MLM -> fine_tune_cbert.py
- MLM-SS -> fine_tune_cbert_w_cls.py
- ~~MLM-PG -> fine_tune_cbert_w_cls_pg.py~~
Commands
- Corresponds to attention_based mask method
  - bash scripts/attention_based/fine_tune_yelp_attention_based.sh
  - bash scripts/attention_based/fine_tune_amazon_attention_based.sh
- Corresponds to frequency_ratio mask method
  - bash scripts/frequency_ratio/fine_tune_yelp_frequency_ratio.sh
  - bash scripts/frequency_ratio/fine_tune_amazon_frequency_ratio.sh
- Corresponds to fusion_method mask method
  - bash scripts/fusion_method/fine_tune_yelp_fusion_method.sh
  - bash scripts/fusion_method/fine_tune_amazon_fusion_method.sh

Note: The accuracy results produced here are lower than original paper, but the BLEU scores are higher. It is a trade-off between accuracy and BLEU. To achieve the same results from paper, you just need to modify fine_tune_cbert_w_cls.py: if lm_loss.item() > 1.5: => if lm_loss.item() > 1.7 or higher # line 153

We also tried to use policy_gradient instead of soft-sampling to back-propagate gradient, and we encourage you to implement it yourself.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
__pycache__		__pycache__
configs		configs
drg_tf_idf		drg_tf_idf
evaluation		evaluation
nltk		nltk
processed_data_attention_based		processed_data_attention_based
processed_data_frequency_ratio		processed_data_frequency_ratio
processed_data_fusion_method		processed_data_fusion_method
pytorch_pretrained_bert		pytorch_pretrained_bert
pytorch_pretrained_cls		pytorch_pretrained_cls
raw_data		raw_data
scripts		scripts
test_tools		test_tools
README.md		README.md
attn_cls_tf_idf.py		attn_cls_tf_idf.py
attn_cls_wd.py		attn_cls_wd.py
cnntext_wd.py		cnntext_wd.py
dataloader.py		dataloader.py
filter_style_ngrams.py		filter_style_ngrams.py
filter_style_ngrams_modified.py		filter_style_ngrams_modified.py
fine_tune_bert.py		fine_tune_bert.py
fine_tune_cbert.py		fine_tune_cbert.py
fine_tune_cbert_w_cls.py		fine_tune_cbert_w_cls.py
plot.py		plot.py
preprocess_attention_based.py		preprocess_attention_based.py
preprocess_frequency_ratio.py		preprocess_frequency_ratio.py
preprocess_fusion_method.py		preprocess_fusion_method.py
preprocess_fusion_method_for_test.py		preprocess_fusion_method_for_test.py
preprocess_train.py		preprocess_train.py
process_unable.py		process_unable.py
rnnattn_wd.py		rnnattn_wd.py
run.config		run.config
run_preprocess.log		run_preprocess.log
run_preprocess.sh		run_preprocess.sh
screenlog_run_attention_based_amazon.log		screenlog_run_attention_based_amazon.log
screenlog_run_attention_based_yelp.log		screenlog_run_attention_based_yelp.log
screenlog_run_frequency_ratio_amazon.log		screenlog_run_frequency_ratio_amazon.log
screenlog_run_frequency_ratio_yelp.log		screenlog_run_frequency_ratio_yelp.log
screenlog_run_fusion_method_amazon.log		screenlog_run_fusion_method_amazon.log
screenlog_run_fusion_method_yelp.log		screenlog_run_fusion_method_yelp.log
select_subset.py		select_subset.py
shuffle.py		shuffle.py
test_cls_wd.py		test_cls_wd.py
test_pretrained_model.py		test_pretrained_model.py
transfer.py		transfer.py
use_nltk_to_filter.py		use_nltk_to_filter.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLM_transfer

Mask stage

Fill stage

About

Releases

Packages

Languages

1024er/MLM_transfer

Folders and files

Latest commit

History

Repository files navigation

MLM_transfer

Mask stage

Fill stage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages