A student–teacher framework for image understanding using step-wise reasoning, reward scoring, and Gemini 2.5 Flash as the expert evaluator.
deep-reinforcement-learning cognitive-architecture visual-reasoning image-understanding gemini-2-5 step-wise-reasoning reward-scoring student-teacher-framework
-
Updated
Dec 4, 2025 - Python