dead_agent Command tracker: python train.py --algo dqn --env gridworld Default network architecture in policies.py -> ActorCriticPolicy (PPO) or