Closed
Description
See: https://arxiv.org/abs/1602.01783 .
It described a RL method without replay memory. such as n-step Q-learning, A3C.
See: https://arxiv.org/abs/1602.01783 .
It described a RL method without replay memory. such as n-step Q-learning, A3C.