ARCH: Adversarial Regularization with Caching

Code for ARCH: Efficient Adversarial Regularized Training with Caching, Findings of EMNLP 2021.

Run the code

Dependency

The most convenient way to run the code is to use this docker image: tartarusz/adv-train:azure-pytorch-apex-v1.7.0. The image supports running on Microsoft Azure.
Our implementation is modified from the Fairseq code base.

Instructions

Please refer to the Fairseq examples for dataset pre-processing.
Run pip install -e . to install locally.
Use bash get_nearest_samples.sh [path-to-checkpoint] to pre-compute a neighbor file. Here, path-to-checkpoint is any pre-trained model.
Use bash run.sh to run the code. To use random neighbors instead of pre-computed ones, remove the --neighbor-file argument and add a --prop-neighbors [prop] argument to randomly select prop indices.

Note

The major modification from the original Fairseq code base is the following.
- fairseq/criterions/cache_loss.py is the main file that handles caching.
- adv_dataset.py stores and constructs the adversarial perturbations.
- fairseq/models/transformer.py modifies embedding to include adversarial perturbations.
- fairseq/tasks/fairseq_task.py contains the adversarial training procedure.

Reference

Please cite the following paper if you use this code.

@article{zuo2021arch,
  title={ARCH: Efficient Adversarial Regularized Training with Caching},
  author={Zuo, Simiao and Liang, Chen and Jiang, Haoming and He, Pengcheng and Liu, Xiaodong and Gao, Jianfeng and Chen, Weizhu and Zhao, Tuo},
  journal={arXiv preprint arXiv:2109.07048},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
docs		docs
fairseq		fairseq
fairseq_cli		fairseq_cli
scripts		scripts
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
adv_dataset.py		adv_dataset.py
get_nearest_samples.sh		get_nearest_samples.sh
hubconf.py		hubconf.py
nearest_samples.py		nearest_samples.py
pyproject.toml		pyproject.toml
run.sh		run.sh
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ARCH: Adversarial Regularization with Caching

Run the code

Dependency

Instructions

Note

Reference

About

Uh oh!

Languages

License

SimiaoZuo/Caching-Adv

Folders and files

Latest commit

History

Repository files navigation

ARCH: Adversarial Regularization with Caching

Run the code

Dependency

Instructions

Note

Reference

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Languages