Closed
Description
In current design of distributed rl, each worker creates an independent model and make predictions separately. A better solution might be that workers on the same node share some common models. The potential benefits are :
- Fewer models to update once received the LoadParamsMsg
- Batch evaluation (on GPU) will be much faster
- This module will be useful to any general Flux models (especially MCTS and DeepCFR related algorithms in RL)