Integrating hydra with DQN #201

BoboBananas · 2022-06-15T19:14:47Z

Integrated hydra with DQN using structured configs.

…dev_dqn

vmoens · 2022-06-16T15:50:38Z

examples/ddpg/ddpg.py


    actor_model_explore = model[0]
-    if args.ou_exploration:
-        if args.gSDE:
+    if cfg.ou_exploration:


can we delete ou_exploration and gSDE from DQN? they make no sense here. Ideally they shouldn't even be in the config

…v_dqn # Conflicts: # examples/ddpg/ddpg.py # examples/dqn/dqn.py # examples/redq/redq.py # examples/sac/sac.py

vmoens · 2022-06-16T16:20:39Z

examples/ppo/ppo.py

@@ -10,7 +10,6 @@
 import hydra


I'm not sure where do those changes in PPO come from in this PR

vmoens · 2022-06-16T16:21:02Z

torchrl/trainers/helpers/models.py

@@ -1351,3 +1351,5 @@ class DiscreteModelConfig:
    # whether a distributional loss should be used.
    atoms: int = 51
    # number of atoms used for the distributional loss
+    gSDE: bool = False


Yeah let's not do that :) No gSDE for discrete actions

I removed epsilon greedy wrapper which removed the error with action_value being None, is this ok?

vmoens

Let's see what we can do about gSDE

vmoens

EGreedyWrapper should still be used

vmoens · 2022-06-21T15:43:09Z

examples/dqn/dqn.py

@@ -103,17 +101,8 @@ def main(cfg: "DictConfig"):
    )

    loss_module, target_net_updater = make_dqn_loss(model, cfg)
-    model_explore = EGreedyWrapper(model, annealing_num_steps=cfg.annealing_frames).to(


I think we still want that, that's the typical exploration technique for dqn

vmoens · 2022-06-21T15:43:24Z

examples/dqn/dqn.py

@@ -124,7 +113,7 @@ def main(cfg: "DictConfig"):

    collector = make_collector_offpolicy(
        make_env=create_env_fn,
-        actor_model_explore=model_explore,


let's keep model_explore

vmoens and others added 30 commits June 8, 2022 21:43

init

b035c90

Added TrainConfig dataclass to replace args

f0432f5

Added dataclass RecorderConfig for recorder.py

c67ea95

Added ReplayArgsConfig

6a29d09

Added comments for ReplayArgsConfig, TrainerConfig, and RecorderConfig

c784e3c

Added OffPolicyCollectorConfig and OnPolicyCollectorConfig dataclasses

00b11a0

Added EnvConfig

60a2ed2

Added LossConfig and LossPPOConfig

c5a174e

Added ContinuousModelConfig and DiscreteModelConfig

eab1650

Integrated hydra w/ ppo example

6d5d28f

Able to override parameters w/ yaml file provided through command line

ac4ffc9

PPO example working w/ hydra

8b87b3a

Fixed styling issues

23a4d66

Added hydra dependencies to setup.py

062ade0

Refactored args from argparser to cfg

93d2869

Fixed style issues

eeb3c81

Fixing more style issues

0fa9d4f

Refactor input config file to overriding_cfg

05b650c

Removed import of DictConfig, now using str type hinting for DictConfig

c78c495

Integrated hydra into SAC

9bfacd4

Integrated hydra into DDPG

5375822

Integrated hydra into REDQ

4488ca9

Merge remote-tracking branch 'upstream/bugfix_noopsreset' into hydra_…

c246b6a

…dev_dqn

Integrated hydra into DQN

ecee59c

Commented out config file merging

e23dd5e

Make hydra optional dependency in trainers.py

3e4a58a

change cfg comment

5da617b

Removed hydra in trainers.py

8094add

Changing hydra-core version to >=1.1 version

f986e2d

Modified tests affected by hydra change

71e96ec

BoboBananas and others added 6 commits June 14, 2022 10:59

Merge branch 'hydra_integration' into hydra_dev_ppo

d51f6f0

Fixing style issues in trainers.py

00cf44f

Added hydra dependency to environment.yml

069a0cf

Added generate_seeds import

f9156ad

Merging changes from ppo branch

3dfa976

Fixing style issues

29c5cd1

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 15, 2022

Bhuvan Basireddy and others added 4 commits June 15, 2022 12:24

Removing ppo yaml from git

edb48bf

Merge branch 'hydra_integration' into hydra_dev_dqn

5209337

Delete humanoid.yaml

8804ed1

Fixing style issues

2e30435

vmoens added the enhancement New feature or request label Jun 16, 2022

vmoens reviewed Jun 16, 2022

View reviewed changes

Merge remote-tracking branch 'origin/hydra_integration' into hydra_de…

8e43e49

…v_dqn # Conflicts: # examples/ddpg/ddpg.py # examples/dqn/dqn.py # examples/redq/redq.py # examples/sac/sac.py

vmoens reviewed Jun 16, 2022

View reviewed changes

examples/ppo/ppo.py

@@ -10,7 +10,6 @@

import hydra

Copy link

Collaborator

vmoens Jun 16, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure where do those changes in PPO come from in this PR

vmoens reviewed Jun 16, 2022

View reviewed changes

Fixed dqn example, removed epsilon greedy

03b8c78

vmoens reviewed Jun 21, 2022

View reviewed changes

vmoens and others added 2 commits June 21, 2022 16:56

BugFix: generating random values from CompositeSpec (pytorch#218)

522f0bc

Merged changes from main to fix dqn example

be2312a

vmoens merged commit 18e2a41 into pytorch:hydra_integration Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Integrating hydra with DQN #201

Integrating hydra with DQN #201

Uh oh!

BoboBananas commented Jun 15, 2022

Uh oh!

vmoens Jun 16, 2022

Uh oh!

vmoens Jun 16, 2022

Uh oh!

vmoens Jun 16, 2022

Uh oh!

BoboBananas Jun 20, 2022

Uh oh!

BoboBananas Jun 20, 2022 •

edited

Loading

Uh oh!

vmoens left a comment

Uh oh!

vmoens left a comment

Uh oh!

vmoens Jun 21, 2022

Uh oh!

vmoens Jun 21, 2022

Uh oh!

Uh oh!

Integrating hydra with DQN #201

Integrating hydra with DQN #201

Uh oh!

Conversation

BoboBananas commented Jun 15, 2022

Uh oh!

vmoens Jun 16, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Jun 16, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Jun 16, 2022

Choose a reason for hiding this comment

Uh oh!

BoboBananas Jun 20, 2022

Choose a reason for hiding this comment

Uh oh!

BoboBananas Jun 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

vmoens Jun 21, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Jun 21, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BoboBananas Jun 20, 2022 •

edited

Loading