You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, thank you for this wonder example! Your network design was wonderful!
I am just somehow confused about the reward. It seems that you just let each agent continues to move for max_t steps and gathered all the rewards? I think this might be an overestimate for the reward since it's still accumulating reward signals even after a fall. What do you think?
Thank you!
The text was updated successfully, but these errors were encountered:
Hi, thank you for this wonder example! Your network design was wonderful!
I am just somehow confused about the reward. It seems that you just let each agent continues to move for max_t steps and gathered all the rewards? I think this might be an overestimate for the reward since it's still accumulating reward signals even after a fall. What do you think?
Thank you!
The text was updated successfully, but these errors were encountered: