Closed
Description
Like https://gym.openai.com/evaluations/eval_aqTWbALwQEKrLIyU9ZzmLw/ this one, is there any list of each environments's evaluation since most of environments' page did show the evaluation results like this :https://gym.openai.com/envs/Reacher-v2/ .
Also, when it said
Reacher-v1 is considered "solved" when the agent obtains an average reward of at least -3.75 over 100 consecutive episodes.
"Average reward" here means the mean of cumulative rewards(sum of one step reward within one episode) over 100 episodes?
Thanks!.
Metadata
Metadata
Assignees
Labels
No labels