Unify common network architectures and patterns

As said here https://github.com/JuliaReinforcementLearning/ReinforcementLearningZoo.jl/pull/93#issuecomment-699647922, id like to write down some thoughts regarding the network handling in this framework. Maybe this is also relevant to https://github.com/JuliaReinforcementLearning/ReinforcementLearningCore.jl.

1. I would like to have small collection of commonly employed network styles in RL like the `GaussianNetwork` (used in VPG, PPO and SAC) or a Twin Q Network (like in TD3 and SAC). These could then be enhanced with basic structural integrity asserts (output size of mu and sigma layer are identical) or convenience functions (e.g return test or train action from a gaussian network).
2. Im really unhappy with the definition of target networks. At the moment, these networks are commonly defined as `NeuralNetworkApproximator` including a dedicated optimizer, while they are never directly trained on. Maybe it would make sense to implement a `TargetNetwork` struct which can be constructed by just passing the original network to it and offers function for e.g. polyak averaging or hard updates (recommended in some MuJoCo environments). I have never seen an implementation in which target networks differ from their source ones...

Im not sure if it would be reasonable to implement these changes in `ReinforcementLearningCore.jl` or `ReinforcementLearningZoo.jl` as these are very DRL related.

Any thoughts on this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Unify common network architectures and patterns #139

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Unify common network architectures and patterns #139

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions