Skip to content

DDPG never converge #764

Closed
Closed
@iswaverly

Description

@iswaverly

I trained DDPG with command refered in README, which is "python -m baselines.run --alg=ddpg --env=HalfCheetah-v2 --num_timesteps=1e6".
After 1000000 steps, the reward is still negative.
I tried other games, such as "popper", none of this can I get the correct result.

I use the master branch with latest code, and my tensorflow-gpu version is 1.8.0.

Have anyone train DDPG success?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions