New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Refactor replay based scripts #173

Merged

vwxyzjn merged 12 commits into master from refactor-replay-based-scripts

May 9, 2022

Owner

vwxyzjn commented Apr 24, 2022 •

edited

Loading

Description

This PR closes #171, closes #172, closes #168, and closes #148.

Types of changes

Bug fix

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the documentation and previewed the changes via mkdocs serve.
I have updated the tests accordingly (if applicable).

If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.

vwxyzjn added 2 commits

April 24, 2022 19:29


          Fix the seed issue: see #171

5450c83


          Quick fix

2c7cc44

vercel bot commented Apr 24, 2022 •

edited

Loading

This pull request is being automatically deployed with Vercel (learn more).
To see the status of your deployment, click below or on the icon next to each commit.

🔍 Inspect: https://vercel.com/vwxyzjn/cleanrl/7NBoCmqbCsrTeVtqKRpZUxAGFZ8N
✅ Preview: https://cleanrl-git-refactor-replay-based-scripts-vwxyzjn.vercel.app

gitpod-io bot commented Apr 24, 2022


          log episodic_length

d9cbf70

vercel bot had a problem deploying to Preview

April 24, 2022 23:35

Failure


          Fix #172

cf17e2e

vercel bot had a problem deploying to Preview

April 24, 2022 23:56

Failure


          Fix #148 and #172-style problem for SAC

dcb185c

vercel bot deployed to Preview

April 25, 2022 00:08

View deployment

vwxyzjn mentioned this pull request

Wrong direction in readme.md file #176

Closed


          Add benchmark scripts

4af7335

vercel bot commented Apr 29, 2022 •

edited

Loading

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated
cleanrl	✅ Ready (Inspect)	Visit Preview	May 9, 2022 at 9:16PM (UTC)

vercel bot deployed to Preview

April 29, 2022 21:58

View deployment


          add sac script

83060a7

vercel bot deployed to Preview

April 29, 2022 21:59

View deployment


          Removes gradient clipping reference

289874b

vercel bot deployed to Preview

April 30, 2022 00:49

View deployment


          Merge branch 'master' into refactor-replay-based-scripts

2e2dc9c

vercel bot deployed to Preview

April 30, 2022 01:42

View deployment

vwxyzjn added 2 commits

April 29, 2022 21:49


          use the latest reproduction script

64a89f8


          Remove past reproducibility script

02ab41e

vercel bot deployed to Preview

April 30, 2022 01:50

View deployment

Owner Author

vwxyzjn commented May 2, 2022 •

edited

Loading

Here are the benchmarked results. Looks like DQN in this PR gets better performance in Atari games, slightly worse results in MountainCar-v0. SAC in this PR gets a performance boost.

Given these results, I recommend we merge this PR. This PR obtains overall better performance and removes an unverified code-level optimization: gradient norm clipping for DQN.

@dosssman and @yooceii, does the result from this PR make sense to you? If it does, I will make updates to the docs and ultimately remove the old experiments. I think after this we would be ready for the 1.0 release.

CC @araffin who might be interested in this :)

Atari games

Classic control

MuJoCo

vwxyzjn mentioned this pull request

Investigate nn.utils.clip_grad_norm_ for DQN, DDPG, and TD3 #148

Closed

3 tasks

vwxyzjn requested review from dosssman and yooceii

May 2, 2022 03:39

vwxyzjn marked this pull request as ready for review

May 2, 2022 03:39

yooceii approved these changes

View reviewed changes

dosssman approved these changes

View reviewed changes

Collaborator

dosssman left a comment

Looking good on my side.
Thanks for the great work.


          update documentation

0a247b6

vercel bot deployed to Preview

May 9, 2022 21:16

View deployment

Owner Author

vwxyzjn commented May 9, 2022

Merging now.

vwxyzjn merged commit 714e786 into master

vwxyzjn mentioned this pull request

Investigate DQN's regression in MountainCar-v0 #156

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet