[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.
flow robotics rl manipulation locomotion vla robot-learning fine-tuning post-training actorcritic pi0 policygradient finetuning-rl visuomotor finetuning-vision-models flowmatching onlinerl
-
Updated
Dec 23, 2025 - Python