Skip to content

Alternative handling of max steps in environment #140

Closed
@oysteinsolheim

Description

@oysteinsolheim

Is it possible to have an alternative way of handling max_steps in continuing environments? As of now the terminal field is set to 'true' when the environment reaches the max_steps even though it's state is random and not a terminal one. In practice it probably doesn't matter much but as it is now the update of the value function is incorrect when reaching the max_steps, I think. Agree?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions