Guozheng Ma · Linrui Zhang · Haoyu Wang · Lu Li · Zilin Wang
Zhen Wang · Li Shen · Xueqian Wang · DaCheng Tao
To investigate this issue and further explore the potential of DA, we conduct comprehensive experiments to assess the impact of DA’s attributes on its efficacy and provides the following insights and improvements:
- For individual DA operations, we reveal that both ample spatial diversity and slight hardness are indispensable. Building on this finding, we introduce Random PadResize (Rand PR), a new DA operation that offers abundant spatial diversity with minimal hardness.
- For multi-type DA fusion schemes, the increased DA hardness and unstable data distribution result in the current fusion schemes being unable to achieve higher sample efficiency than their corresponding individual operations. Taking the non-stationary nature of RL into account, we propose a RL-tailored multi-type DA fusion scheme called Cycling Augmentation (CycAug), which performs periodic cycles of different DA operations to increase type diversity while maintaining data distribution consistency.
The implementation of CycAug on DMC tasks and CARLA tasks can be found in the respective subfolders named CycAug_on_DMC
and CycAug_on_CARLA
.
If you encounter any issues during the implementation process, please feel free to reach out to me via email at mgz21@mails.tsinghua.edu.cn.
If this repository is useful to you, please consider citing our paper:
@article{ma2023learning,
title={Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning},
author={Ma, Guozheng and Zhang, Linrui and Wang, Haoyu and Li, Lu and Wang, Zilin and Wang, Zhen and Shen, Li and Wang, Xueqian and Tao, Dacheng},
journal={arXiv preprint arXiv:2305.16379},
year={2023}
}