-
Notifications
You must be signed in to change notification settings - Fork 181
Open
Description
Hello, I have a question about how the scores in the maze environments are calculated.
In the maze2d branch, we can get the reward and the normalized score.
The score is about 1.xx, but in the paper the score are 100.x. So did we just the original score to times 100?
Also, the variance is very small, even in the multi tasks, the variance is less than 1. I cannot reproduce these results. Can you help explain how to calculate the score in the maze environments? Thanks.
Metadata
Metadata
Assignees
Labels
No labels