Calculation of scores for each episode #1

ZeratuuLL · 2019-02-23T20:34:49Z

Hi, thank you for this wonder example! Your network design was wonderful!

I am just somehow confused about the reward. It seems that you just let each agent continues to move for max_t steps and gathered all the rewards? I think this might be an overestimate for the reward since it's still accumulating reward signals even after a fall. What do you think?

Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calculation of scores for each episode #1

Calculation of scores for each episode #1

ZeratuuLL commented Feb 23, 2019

Calculation of scores for each episode #1

Calculation of scores for each episode #1

Comments

ZeratuuLL commented Feb 23, 2019