Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
machine-learning reinforcement-learning deep-learning deep-reinforcement-learning neural-networks neural-program-synthesis neural-programmer-interpreter
-
Updated
Oct 3, 2023 - Python