Closed
Description
Describe the bug
Try to run the original notebook failed.
- The
env.reset()
returns a tuple of two items forCartPole-v1
only. We expect only the state. - The
env.step(action)
returns a tuple of 5 items forCartPole-v1
only. We expect only 4 items in the tuple. - The softmax function has a dimension of 1 which is wrong, it should be 0.
- The
record_video()
function fails forCartPole-v1
environment only for some reason.
Material
I didn't use Google Colab and here is my OS info:
- Windows 10
Metadata
Metadata
Assignees
Labels
No labels