Description
Per discussion with several users, there are some possible renames we could do to clarify the internal architecture:
-
Rename
rllib.evaluation.PolicyGraph
torllib.policy.Policy
. Similarly,rllib.evaluation.TFPolicyGraph
becomesrllib.policy.TFPolicy
and so on. The files are moved to a newrllib/policy
dir. -
Move therllib/agents
directory torllib/train
. -
Add TF qualifier for consistency with Torch policies
A3CPolicyGraph => A3CTFPolicy
A3CTorchPolicyGraph => A3CTorchPolicy
-
Rename
PolicyEvaluator
toRolloutWorker
deprecatecompute_gradients()
/apply_gradients()
-
Consolidate
[local_evaluator, remote_evaluators]
to singleWorkerSet
objectworkers = WorkerSet(...) local_evaluators => workers.local_worker() remote_evaluators => workers.remote_workers()
For the most part, I think we can do these renames without breaking backwards compatibility by leaving aliases behind. Though, I'm not sure how easy moving an entire directory is.