SAILOR is an inverse RL algorithm that learns world and reward models to search at test-time and recover from mistakes.
-
Updated
Nov 2, 2025 - Python
SAILOR is an inverse RL algorithm that learns world and reward models to search at test-time and recover from mistakes.
Active Imitation Learing with Noisy Guidance
Integrating π₀-Droid into SAILOR for robust robotic imitation learning, combining pretrained vision-language-action priors with recovery-based search on RoboMimic.
Add a description, image, and links to the learning-to-search topic page so that developers can more easily learn about it.
To associate your repository with the learning-to-search topic, visit your repo's landing page and select "manage topics."