[BugFix] Restore missing keys in data collector output #521

tcbegley · 2022-10-05T16:48:32Z

Description

This PR fixes a bug with data collectors that saw some keys being omitted from the output when e.g. random actions were applied during initialisation.

Data collectors will now attempt to determine keys from the policy spec if it is supplied and complete, otherwise they will perform a short rollout with the policy to determine which keys will be returned.

Motivation and Context

Closes #505

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)

Checklist

I still need to add a test, currently seeing an error on one particular test I don't understand.

I have read the CONTRIBUTION guide (required)
I have updated the tests accordingly (required for a bug fix or a new feature).

tcbegley

I've submitted as a draft because I'm expecting CI to fail on test/test_collectors:test_update_weights.

I've been having a bit of trouble pinning down the issue because it originates in a sub-process, but it seems to be related to

TypeError: no implementation found for 'torch.nn.linear' on types that implement __torch_function__: [<class 'torchrl.data.tensordict.tensordict.TensorDict'>]

The test defines policy = torch.nn.Linear(3, 4).cuda(1),and it seems the problem arises during env.rollout(3, policy) used to determine the keys for _tensordict_out because torch doesn't know how to handle

tensordict = policy(tensordict)

Does the policy need to be wrapped first or something before doing env.rollout?

torchrl/collectors/collectors.py

tcbegley · 2022-10-06T08:34:35Z

Weirdly test I was expecting to fail is passing after all... Could be a quirk of my dev environment? Happy to look into that more if you like, but since the tests are passing in CI perhaps this can be reviewed as is.

vmoens · 2022-10-06T11:27:33Z

Weirdly test I was expecting to fail is passing after all... Could be a quirk of my dev environment? Happy to look into that more if you like, but since the tests are passing in CI perhaps this can be reviewed as is.

Is it the TypeError error above?

tcbegley · 2022-10-06T11:30:06Z

Is it the TypeError error above?

Yes, that's the one I saw running the tests myself but hasn't failed in CircleCI checks.

torchrl/collectors/collectors.py

test/test_collector.py

torchrl/collectors/collectors.py

vmoens · 2022-10-07T14:20:36Z

torchrl/collectors/collectors.py

-            batch_size=[*self.env.batch_size, self.frames_per_batch],
-            device=self.passing_device,
+
+        # TODO: perhaps check type of policy and raise TypeError if something


we can remove this comment
Long term plan is to support any kind of policy so i'm happy with the state of the checks

torchrl/collectors/collectors.py

vmoens

LGTM thanks for this contribuiton!

codecov · 2022-10-10T08:14:13Z

Codecov Report

Merging #521 (4059df4) into main (bd0120e) will increase coverage by 0.16%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main     #521      +/-   ##
==========================================
+ Coverage   86.57%   86.73%   +0.16%     
==========================================
  Files         121      121              
  Lines       21632    21680      +48     
==========================================
+ Hits        18727    18804      +77     
+ Misses       2905     2876      -29

Flag	Coverage Δ
linux-cpu	`85.10% <90.19%> (?)`
linux-gpu	`86.55% <100.00%> (+0.16%)`	⬆️
linux-outdeps-gpu	`77.86% <49.01%> (+0.09%)`	⬆️
linux-stable-cpu	`85.08% <90.19%> (+0.02%)`	⬆️
linux-stable-gpu	`86.55% <100.00%> (+0.16%)`	⬆️
macos-cpu	`84.86% <90.19%> (+0.02%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
test/test_collector.py	`96.67% <100.00%> (+6.49%)`	⬆️
torchrl/collectors/collectors.py	`68.96% <100.00%> (+1.69%)`	⬆️
torchrl/data/tensordict/tensordict.py	`82.47% <0.00%> (-0.10%)`	⬇️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

* [BugFix] Transformed ParallelEnv meta data are broken when passing to device (#531) * [Doc] Add coverage banner (#533) * add orb decov to circleci config.yml * Add codecov badge to Readme * Revert "[BugFix] Changing the dm_control import to fail if not installed (#515)" This reverts commit d194735. * codecov coverage w/o orb in circleci * Revert "Revert "[BugFix] Changing the dm_control import to fail if not installed (#515)"" This reverts commit d0dc7de. * [CI] generation of coverage reports (#534) * update test scripts to add coverage * update test scripts to add coverage Co-authored-by: Silvestre Bahi <silvestrebahi@fb.com> * [CI] Add xml coverage reports for codecov (#537) * update test scripts to add coverage * update test scripts to add coverage * generate xml file for coverage * Update run_test.sh lint end of file * Update run_test.sh lint end of file * Update run_test.sh lint end of file Co-authored-by: Silvestre Bahi <silvestrebahi@fb.com> * permissions * permissions Co-authored-by: Silvestre Bahi <silvestrebahi@fb.com> Co-authored-by: silvestrebahi <silvestre.bahi@gmail.com> * [BugFix] Fix colab link of coding_dqn.ipynb (#543) * [BugFix] Fix optional imports (#535) * [BugFix] Restore missing keys in data collector output (#521) * Ensure data collectors return all expected keys * Rerun CI * Add tests * Format code * correct unreachable test * Fix broken test * WIP: fix initialisation with policy + test * Fix initialisation with policy + test * Reset env after rollout initialisation * fix build from spec * Check policy has spec attribute before accessing * Address comments Co-authored-by: vmoens <vincentmoens@gmail.com> * [Lint] reorganize imports (#545) [Lint] reorganize imports * [BugFix] Single-cpu compatibility (#548) * [BugFix] vision install and other deps in optdeps (#552) * init * amend * amend * amend * [Feature] Implemented device argument for modules.models (#524) Co-authored-by: Yu Shiyang <yushiyang@fb.com> * [BugFix] Fix ellipsis indexing of 2d TensorDicts (#559) * [BugFix] Additive gaussian exploration spec fix (#560) * [BugFix] Disabling video step for wandb (#561) * [BugFix] Various device fix (#558) * [Feature] Allow collectors to accept regular modules as policies (#546) * [BugFix] Fix push binary nightly action (#566) Fix Push Binary Nightly Action for linux hosts Co-authored-by: Vincent Moens <vincentmoens@gmail.com> Co-authored-by: Silvestre Bahi <silvestrebahi@fb.com> Co-authored-by: silvestrebahi <silvestre.bahi@gmail.com> Co-authored-by: Bo Liu <benjaminliu.eecs@gmail.com> Co-authored-by: Tom Begley <tomcbegley@gmail.com> Co-authored-by: Alessandro Pietro Bardelli <apbard@users.noreply.github.com> Co-authored-by: Yu Shiyang <yushiyangk@users.noreply.github.com> Co-authored-by: Yu Shiyang <yushiyang@fb.com> Co-authored-by: Pavel Solikov <psolikov15@gmail.com>

Ensure data collectors return all expected keys

5d9d568

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 5, 2022

tcbegley commented Oct 5, 2022

View reviewed changes

torchrl/collectors/collectors.py Show resolved Hide resolved

Rerun CI

c9b0bf1

tcbegley marked this pull request as ready for review October 6, 2022 08:32

Add tests

116d904

Format code

992a5b2

vmoens and others added 2 commits October 6, 2022 12:57

correct unreachable test

e2016f2

Fix broken test

d104c1b

vmoens changed the title ~~Restore missing keys in data collector output~~ [BugFix] Restore missing keys in data collector output Oct 6, 2022

vmoens added the bug Something isn't working label Oct 6, 2022