Skip to content

Reset policy together with the environment during evaluation. #1024

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

yongjincho
Copy link
Contributor

@yongjincho yongjincho commented Apr 24, 2025

What this does

When evaluating the trained policy, the environment is reset at the beginning of each episode. However, since the policy is not being reset, the action queue is not cleared, and an action from the previous episode is used as the first action of the next episode.

To prevent this, I reset the policy together with the environment.

How it was tested

I tested the pi0 model on the Aloha robot using this code.

@yongjincho yongjincho changed the title Reset policy and env together Reset policy together with the environment during evaluation. Apr 24, 2025
@imstevenpmwork imstevenpmwork added bug Something isn’t working correctly policies Items related to robot policies labels Apr 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn’t working correctly policies Items related to robot policies
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants