PPO
PPO-Continuous
IMPALA
V-MPO
SAC
SAC-Continuous
Discrete Learning environment is configured to CartPole-v1.
Continuous Learning environment is configured to MountainCarContinuous-v0.
You should check machines.json, parameters.json for architecture and training parameters.
python run.py
