Skip to content

umd-huang-lab/COPlanner

 
 

Repository files navigation

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

ICLR 2024 Poster

Xiyao Wang · Ruijie Zheng · Yanchao Sun · Ruonan Jia · Wichayaporn Wongkamjan · Huazhe Xu · Furong Huang

Logo

🛠️ Usage

We provide scripts to train and evaluate policies of different backbones (MBPO, DreamerV2, and DreamerV3) in separate folders.

For MBPO, in the MuJoCo environment, we implement COplanner based on mbpo_pytorch. In DeepMind Control Suite, we utilize MBRL-lib.

🏷️ License

This repository is released under the MIT license. See LICENSE for additional details.

🙏 Acknowledgement

Our code is built upon mbpo_pytorch, MBRL-lib, DreamerV2-pytorch, and DreamerV3. We thank all these authors for their nicely open sourced code and their great contributions to the community.

📝 Citation

If you find our work useful, please consider citing:

@article{wang2023coplanner,
  title={COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL},
  author={Wang, Xiyao and Zheng, Ruijie and Sun, Yanchao and Jia, Ruonan and Wongkamjan, Wichayaporn and Xu, Huazhe and Huang, Furong},
  journal={arXiv preprint arXiv:2310.07220},
  year={2023}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.6%
  • Other 0.4%