Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] VecNormV2 #2867

Merged
merged 10 commits into from
Mar 25, 2025
Merged

[Feature] VecNormV2 #2867

merged 10 commits into from
Mar 25, 2025

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Mar 23, 2025

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Mar 23, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2867

Note: Links to docs will display an error until the docs builds have been completed.

❌ 14 New Failures, 8 Unrelated Failures

As of commit f1f373f with merge base 7df8317 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Mar 23, 2025
ghstack-source-id: 5dd628a583e6b10b71561090c09724ba1ae0aa65
Pull Request resolved: #2867
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 23, 2025
@vmoens vmoens added the enhancement New feature or request label Mar 23, 2025
@vmoens
Copy link
Contributor Author

vmoens commented Mar 23, 2025

There is a noticeable difference with the first version which is that here we first normalize then update, whereas the other version first updates then normalizes.
I think it's better to use an unbiased estimate but i'm open to align to the previous version.

List of TODOs:

  • Better coverage of stateless
  • warning about deprec of v1
  • docstrings
  • add to doc
  • test collision between reward and obs specs, esp if 2 vecnorm are stacked

I guess long-term we should think about collectives for distributed but i'll leave this for enthusiastic young minds

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: 81e361a552abcc10be11e5a64a9b39db92032207
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: bccd570e410cd7c09af7e158e7418b3a59c75f48
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: 3399da3ac1e6e1ce0f4fc090f371e1852fda9cfe
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: 880f0d41808abd73c1c2e711aa8bf03fd95ae200
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: f32a44c3be87ff9ef97d0aac6ea31ba93c14c7af
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: 641379d7c9d237492e4ec3262c239510983b1d74
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: d15c79c2109fa7aed02615cb1ba995a2d822f892
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: dc96a8a0666fb40e6f02b9f1ddcbeabb0ce2f016
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: 639d07ff54be200d54621c2c4619ebd0d3d7d79e
Pull Request resolved: #2867
@vmoens vmoens merged commit f1f373f into gh/vmoens/125/base Mar 25, 2025
48 of 70 checks passed
vmoens added a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: 639d07ff54be200d54621c2c4619ebd0d3d7d79e
Pull Request resolved: #2867
@vmoens vmoens deleted the gh/vmoens/125/head branch March 25, 2025 21:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants