Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".
reinforcement-learning paper semi-supervised-learning bandits bandit contextual-bandits contextual-bandit self-supervised-learning nonstationary-environments
-
Updated
Sep 21, 2020 - MATLAB