forked from facebookresearch/mbrl-lib
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Change SAC library to use the implementation written by pranz24 (face…
…bookresearch#142) * Added pytorch_sac_pranz24 dependency. * Changes to pytorch_sac_pranz24 to make it easier to integrate with MBRL-Lib. * Changed SACAgent to use pranz24's SAC. * Added test for new add_batch of pranz24's SAC. * Changed MBPO to use pranz24's SAC. * Update to MBPO config files. * Fixed broken MBPO test. * Added batched= option to SAC.select_action (pranz24's). * Fixed bug in complete_agent_cfg caused by 'action_space' key. * Fixed incompatibility between pranz24's GaussianPolicy and hydra. * Added logger to pranz24.SAC * Added option to change target entropy in pranz24.SAC * Added add_batch method to mbrl.util.ReplayBuffer. * Changed pranz24's SAC to use mbrl.util.ReplayBuffer. * Added mbrl.util.Logger to pranz24's SAC. * Added --target_entropy arg to pranz24's SAC. * Changed MBPO to use mbrl.util.ReplayBuffer both for model and SAC. * Added option to tell pranz24's SAC.update_parameters() to use ~masks. * Added mbrl.planning.load_agent implementation for new SAC. * Updated config files for new SAC. * Added mbrl.Logger to pytorch_sac_pranz24 and removed tensorboard. * Added option to use real data for SAC in MBPO, with some small probability. * [bug-fix] Fixed incorrect termination function for humanoid in make_env. * Updated MBPO config for inverted pendulum. * Updated config files for hopper and humanoid. * Updated configs for MBPO on ant, humanoid and walker. * Run black and update CHANGELOG.
- Loading branch information
Showing
38 changed files
with
1,169 additions
and
234 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.