Skip to content

Commit

Permalink
update nlu
Browse files Browse the repository at this point in the history
  • Loading branch information
logicwong committed Mar 2, 2022
1 parent 0ec5dd9 commit 8c38a15
Show file tree
Hide file tree
Showing 3 changed files with 14 additions and 6 deletions.
14 changes: 11 additions & 3 deletions datasets.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,20 @@

We provide links to download our preprocessed dataset. If you would like to process the data on your own, we will soon provide scripts for you to do so.

## Finetuning

## Finetuning for Vision & Language Tasks
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/caption_data/caption_data.zip"> Dataset for Caption </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/refcoco_data/refcoco_data.zip"> Dataset for RefCOCO </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/refcocoplus_data/refcocoplus_data.zip"> Dataset for RefCOCO+ </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/refcocog_data/refcocog_data.zip"> Dataset for RefCOCOg </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/vqa_data/vqa_data.zip"> Dataset for VQAv2 </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/snli_ve_data/snli_ve_data.zip"> Dataset for SNLI-VE </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/coco_image_gen_data/coco_image_gen.zip"> Dataset for Text-to-Image Genearion </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/coco_image_gen_data/coco_image_gen.zip"> Dataset for Text-to-Image Genearion </a>

## Finetuning for Language Tasks
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/glue_data/cola_data.zip"> Dataset for COLA </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/glue_data/mnli_data.zip"> Dataset for MNLI </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/glue_data/mrpc_data.zip"> Dataset for MRPC </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/glue_data/qnli_data.zip"> Dataset for QNLI </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/glue_data/qqp_data.zip"> Dataset for QQP </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/glue_data/rte_data.zip"> Dataset for RTE </a>
* <a href="https://ofa-beijing.oss-cn-beijing.aliyuncs.com/datasets/glue_data/sst2_data.zip"> Dataset for SST2 </a>
4 changes: 2 additions & 2 deletions run_scripts/glue/train_cola.sh
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ user_dir=../../ofa_module
data_dir=../../dataset/glue_data
data=${data_dir}/cola_train.tsv,${data_dir}/cola_dev.tsv
restore_file=../../checkpoints/ofa_large.pt
selected_cols=3,1
selected_cols=1,2

arch=ofa_large
criterion=adjust_label_smoothed_cross_entropy
Expand All @@ -35,7 +35,7 @@ max_tgt_length=30
num_bins=1000
prompt_type="src"

for max_epoch in {7,10}; do
for max_epoch in {5,7,10}; do
echo "max_epoch "${max_epoch}
for lr in {1e-4,7e-5,6e-5,5e-5,3e-5}; do
echo "lr "${lr}
Expand Down
2 changes: 1 addition & 1 deletion run_scripts/glue/train_mrpc.sh
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ user_dir=../../ofa_module
data_dir=../../dataset/glue_data
data=${data_dir}/mrpc_train.tsv,${data_dir}/mrpc_dev.tsv
restore_file=../../checkpoints/ofa_large.pt
selected_cols=1,2,0
selected_cols=0,1,2

arch=ofa_large
criterion=adjust_label_smoothed_cross_entropy
Expand Down

0 comments on commit 8c38a15

Please sign in to comment.