Open

Description
Has anyone got DDPG with ou_0.2 noise parameter to converge in MountainCarContinuous-v0 environment? The rollout/return_history stays around -10 after 1 million steps. In the ddpg paper, MountainCarContinuous converges to full score way before it hits 1 million steps.
Any suggestions on how to tune it would be great.
Metadata
Metadata
Assignees
Labels
No labels