Skip to content
This repository was archived by the owner on May 6, 2021. It is now read-only.

Conversation

@norci
Copy link
Member

@norci norci commented Aug 31, 2020

  1. added more tensorboard logs in rl experiments.
  2. added Loss values for DDPG policy
  3. adjusted stop conditions, for faster testing.
  4. refactor.

Copy link
Member

@findmyway findmyway left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great thanks!

I never made the time to do this kind of refactoring. 😂

1. use single @info in each step.
2. added reward log in some experiments.
3. increased step number for BasicDQN MountainCar,
due to 10000 steps is not enough for it.
@findmyway findmyway merged commit 87a63f0 into JuliaReinforcementLearning:master Sep 1, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants