[Question] ValueError: the actions value are set to NaN after 10 steps

### Required prerequisites

- [X] I have read the documentation <https://omnisafe.readthedocs.io>.
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/omnisafe/issues) and [Discussions](https://github.com/PKU-Alignment/omnisafe/discussions) that this hasn't already been reported. (+1 or comment there if it has.)
- [ ] Consider asking first in a [Discussion](https://github.com/PKU-Alignment/omnisafe/discussions/new).

### Questions

Hello. Thank you for your effort of this interesting project.

Currently, I am working with my costomized environment. I followed the tutorial of [[Environment Customization From Scratch](https://colab.research.google.com/github/PKU-Alignment/omnisafe/blob/main/tutorials/English/3.Environment%20Customization%20from%20Scratch.ipynb)](url) . My costomized env has a fixed steps for each epochs, 96 steps per epoch.

I have run the single step check of my env, and the results wents well. However, when I run the training loop, I found that after 10 steps, the actions values became NaN and raise the error:
![BugScreenShot](https://github.com/user-attachments/assets/a147616a-edd1-4d90-bfb5-563f3f024205)
After reading the exising discussions, bugs and questions, I found my question is very similar to the [https://github.com/PKU-Alignment/omnisafe/issues/165 [bug](https://github.com/PKU-Alignment/omnisafe/issues?q=ValueError+label%3Abug)](url), that the problems may lies in the configuration. However, after I checked my configuration, there are no value that control the early termination after 10 steps. Here is my configuration:
![config](https://github.com/user-attachments/assets/d1b1cd6d-adb7-477d-9a08-f63b114bd770)
[config.json](https://github.com/user-attachments/files/18113185/config.json)
Therefore, I would like to ask if there is any internal parameters of the algorithms that, for example, trunicate the env after 10 steps?

Thank you for your help.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] ValueError: the actions value are set to NaN after 10 steps #359

Required prerequisites

Questions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development