Skip to content

[Feature] VecNormV2 #2867

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 10 commits into from
Mar 25, 2025
Merged

[Feature] VecNormV2 #2867

merged 10 commits into from
Mar 25, 2025

Conversation

vmoens
Copy link
Collaborator

@vmoens vmoens commented Mar 23, 2025

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Mar 23, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2867

Note: Links to docs will display an error until the docs builds have been completed.

❌ 14 New Failures, 8 Unrelated Failures

As of commit f1f373f with merge base 7df8317 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens pushed a commit that referenced this pull request Mar 23, 2025
ghstack-source-id: 5dd628a
Pull Request resolved: #2867
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 23, 2025
@vmoens vmoens added the enhancement New feature or request label Mar 23, 2025
@vmoens
Copy link
Collaborator Author

vmoens commented Mar 23, 2025

There is a noticeable difference with the first version which is that here we first normalize then update, whereas the other version first updates then normalizes.
I think it's better to use an unbiased estimate but i'm open to align to the previous version.

List of TODOs:

  • Better coverage of stateless
  • warning about deprec of v1
  • docstrings
  • add to doc
  • test collision between reward and obs specs, esp if 2 vecnorm are stacked

I guess long-term we should think about collectives for distributed but i'll leave this for enthusiastic young minds

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: 81e361a
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: bccd570
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: 3399da3
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: 880f0d4
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 24, 2025
ghstack-source-id: f32a44c
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: 641379d
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: d15c79c
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: dc96a8a
Pull Request resolved: #2867
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: 639d07f
Pull Request resolved: #2867
@vmoens vmoens merged commit f1f373f into gh/vmoens/125/base Mar 25, 2025
48 of 70 checks passed
vmoens pushed a commit that referenced this pull request Mar 25, 2025
ghstack-source-id: 639d07f
Pull Request resolved: #2867
@vmoens vmoens deleted the gh/vmoens/125/head branch March 25, 2025 21:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants