Reset policy together with the environment during evaluation. #1024

yongjincho · 2025-04-24T04:10:28Z

What this does

When evaluating the trained policy, the environment is reset at the beginning of each episode. However, since the policy is not being reset, the action queue is not cleared, and an action from the previous episode is used as the first action of the next episode.

To prevent this, I reset the policy together with the environment.

How it was tested

I tested the pi0 model on the Aloha robot using this code.

Reset policy and env together

69a20ab

yongjincho changed the title ~~Reset policy and env together~~ Reset policy together with the environment during evaluation. Apr 24, 2025

imstevenpmwork added bug Something isn’t working correctly policies Items related to robot policies labels Apr 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reset policy together with the environment during evaluation. #1024

Reset policy together with the environment during evaluation. #1024

yongjincho commented Apr 24, 2025 •

edited

Loading

Reset policy together with the environment during evaluation. #1024

Are you sure you want to change the base?

Reset policy together with the environment during evaluation. #1024

Conversation

yongjincho commented Apr 24, 2025 • edited Loading

What this does

How it was tested

yongjincho commented Apr 24, 2025 •

edited

Loading