Skip to content

[Question] ValueError: the actions value are set to NaN after 10 steps #359

Open
@DavidLudl

Description

Required prerequisites

Questions

Hello. Thank you for your effort of this interesting project.

Currently, I am working with my costomized environment. I followed the tutorial of [Environment Customization From Scratch](url) . My costomized env has a fixed steps for each epochs, 96 steps per epoch.

I have run the single step check of my env, and the results wents well. However, when I run the training loop, I found that after 10 steps, the actions values became NaN and raise the error:
BugScreenShot
After reading the exising discussions, bugs and questions, I found my question is very similar to the [https://github.com//issues/165 bug](url), that the problems may lies in the configuration. However, after I checked my configuration, there are no value that control the early termination after 10 steps. Here is my configuration:
config
config.json
Therefore, I would like to ask if there is any internal parameters of the algorithms that, for example, trunicate the env after 10 steps?

Thank you for your help.

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions