WIP: PETS algorithm from facebook/mbrl #531

albheim · 2021-10-13T07:47:43Z

PR Checklist

Update NEWS.md?

Base implementation of PETS (see also facebookresearch/mbrl-lib) that is currently not working. I might not have a lot of time for this in the near future so I thought I could at least put it up here in case anyone was interested.

There are a few things we can discuss on how to implement the best, for example

How to allow for checking reward and termination of states estimated by the model.
- Currently implemented this as a custom wrapper around the env that overrides reward and is_terminated with kwargs for state/action that default to the expected values if not set. Seems reasonably clean and was the most general way I could come up with, but I'm very open to other suggestions.
Should clamp be done in GaussianNetwork rather than separately implemented, and if so maybe we should allow setting the desired clamping method?
- Currently I changed to allow for setting clamping method, but I'm leaning towards having the user setting it as a last layer in the network definition might be clearer?
Do we want some overarching optimizer abstraction to allow for connecting general optimizers?
- The facebook mbrl repo has both some general optimizer for single step I think, and then a trajectoryoptimizer for full trajectories. Seemed overkill now for the first thing, but maybe would be nice to aim for something more general similar to that?

jeremiahpslewis · 2023-05-22T19:06:38Z

Closing this for now, feel free to reopen at a later point in time.

albheim and others added 11 commits September 28, 2021 17:57

extend cartpole env

53318b3

add softclamp and option to gaussiannet

40dd946

initial pets

79e47cb

update cartpole continuous

dc3e362

updates

0e6e99b

Merge branch 'master' into albheim/pets

8e4c52a

rename file

cd1742b

fix state space name

13e04ee

add logging and termination check

95b0754

add note

d9cd4ea

Merge branch 'master' into albheim/pets

ea1f60e

albheim mentioned this pull request Oct 25, 2021

Model based reinforcement learning #262

Closed

findmyway mentioned this pull request Apr 4, 2022

Next Release Plan (v0.11) #614

Closed

52 tasks

jeremiahpslewis closed this May 22, 2023

jeremiahpslewis deleted the albheim/pets branch May 23, 2023 13:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

WIP: PETS algorithm from facebook/mbrl #531

WIP: PETS algorithm from facebook/mbrl #531

Uh oh!

albheim commented Oct 13, 2021

Uh oh!

jeremiahpslewis commented May 22, 2023

Uh oh!

Uh oh!

Uh oh!

WIP: PETS algorithm from facebook/mbrl #531

WIP: PETS algorithm from facebook/mbrl #531

Uh oh!

Conversation

albheim commented Oct 13, 2021

Uh oh!

jeremiahpslewis commented May 22, 2023

Uh oh!

Uh oh!