Category Level Manipulation

Introduction

This repository is the official Pytorch implementation of Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations.

Dependency

Please see installation of ManiSkill and installation of ManiSkill-Learn, we build our methods on top of ManiSkill-Learn which is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge.

To incorporate this project with ManiSkill benchmark, please first overwrite the original ManiSkill-Learn folder with this one, and then install ManiSkill and ManiSkill-Learn.

Data

Please download ManiSkill demonstration dataset from here and store it in the folder "full_mani_skill_data" as instructed in ManiSkill-Learn.

Training

The training code is provided in training. We give an example of using our methods to train on the "OpenCabinetDoor" environment.

Main Experiments

Method I (GAIL): run the shell command scripts/train_rl_agent/run_GAIL_baseline_door.sh

Method II (GAIL + Progressive Growing of Discriminator): run the shell command scripts/train_rl_agent/run_GAIL_progressive_door.sh

Method III (GAIL + Self-Imitation Learning from Demonstrations): run the shell command scripts/train_rl_agent/run_GAIL_GASILfD_door.sh

Method IV (GAIL + Self-Imitation Learning from demonstrations + CLIB Expert Buffer): run the shell command scripts/train_rl_agent/run_GAIL_CLIB_door.sh

Method V (GAIL + Progressive Growing of Discriminator + Self-Imitation Learning from demonstrations + CLIB Expert Buffer): run the shell command scripts/train_rl_agent/run_GAIL_use_all_door.sh

Ablation Study with Additional Dense Reward

SAC: run the shell command scripts/train_rl_agent/run_SAC_door.sh

GAIL + Dense Reward: first temporarily modify mani_skill_learn/methods/mfrl/gail.py in line 119 and set env_r to 0.5 to enable environmental dense reward. Then run the shell command scripts/train_rl_agent/run_GAIL_baseline_door.sh

Method V + Dense Reward: first temporarily modify mani_skill_learn/methods/mfrl/gail.py in line 119 and set env_r to 0.5 to enable environmental dense reward. Then run the shell command scripts/train_rl_agent/run_GAIL_use_all_door.sh

Evaluation

The evaluation code for one certain checkpoint is as below. You should modify the config path, work-dir path and the checkpoint path for your own model:

python -m tools.run_rl {config_path} --evaluation --gpu-ids=0 \
--work-dir={work-dir_path} \
--resume-from {checkpoint_path} \
--cfg-options "env_cfg.env_name=OpenCabinetDoor-v0" "eval_cfg.num=100" "eval_cfg.num_procs=2" "eval_cfg.use_log=True" "eval_cfg.save_traj=False" "eval_cfg.save_video=True"

Validation

You can either split training dataset to construct validation dataset, or you can submit your solutions on SAPIEN Open-Source Manipulation Skill Challenge to test the performance.

Citation

If you find our work useful in your research, please consider citing:

@article{shen2022learning,
  title={Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations},
  author={Shen, Hao and Wan, Weikang and Wang, He},
  journal={arXiv preprint arXiv:2203.02107},
  year={2022}
}

License

This work and the dataset are licensed under CC BY-NC 4.0.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
mani_skill_learn		mani_skill_learn
scripts/train_rl_agent		scripts/train_rl_agent
tools		tools
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Category Level Manipulation

Introduction

Dependency

Data

Training

Main Experiments

Ablation Study with Additional Dense Reward

Evaluation

Validation

Citation

License

About

Uh oh!

Releases

Packages

Languages

wkwan7/Category_Level_Manipulation

Folders and files

Latest commit

History

Repository files navigation

Category Level Manipulation

Introduction

Dependency

Data

Training

Main Experiments

Ablation Study with Additional Dense Reward

Evaluation

Validation

Citation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages