Optimizer setting in PPO experiments

Hi! I'm using the PPO implementation for my custom environment with continuous action space. I built my custom experiments based on the PPO pendulum experiment template, where the actor and critic are defined explicitly with optimizer=ADAM(3e-4). After playing with it for a while, I realized that I have to use the optimizer defined as part of the ActorCritic type if I want to change the learning rate, etc. It looks the optimizers defined for actor and critic are not used, so it would be less confusing if the optimizer is specified only for the ActorCritic call in the template.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizer setting in PPO experiments #118

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Optimizer setting in PPO experiments #118

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions