Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor replay based scripts #173

Merged
merged 12 commits into from
May 9, 2022
Merged

Conversation

vwxyzjn
Copy link
Owner

@vwxyzjn vwxyzjn commented Apr 24, 2022

Description

This PR closes #171, closes #172, closes #168, and closes #148.

Types of changes

  • Bug fix

Checklist:

  • I've read the CONTRIBUTION guide (required).
  • I have ensured pre-commit run --all-files passes (required).
  • I have updated the documentation and previewed the changes via mkdocs serve.
  • I have updated the tests accordingly (if applicable).

If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.

  • I have contacted @vwxyzjn to obtain access to the openrlbenchmark W&B team (required).
  • I have tracked applicable experiments in openrlbenchmark/cleanrl with --capture-video flag toggled on (required).
  • I have added additional documentation and previewed the changes via mkdocs serve.
    • I have explained note-worthy implementation details.
    • I have explained the logged metrics.
    • I have added links to the original paper and related papers (if applicable).
    • I have added links to the PR related to the algorithm.
    • I have created a table comparing my results against those from reputable sources (i.e., the original paper or other reference implementation).
    • I have added the learning curves (in PNG format with width=500 and height=300).
    • I have added links to the tracked experiments.
  • I have updated the tests accordingly (if applicable).

@vercel
Copy link

vercel bot commented Apr 24, 2022

This pull request is being automatically deployed with Vercel (learn more).
To see the status of your deployment, click below or on the icon next to each commit.

🔍 Inspect: https://vercel.com/vwxyzjn/cleanrl/7NBoCmqbCsrTeVtqKRpZUxAGFZ8N
✅ Preview: https://cleanrl-git-refactor-replay-based-scripts-vwxyzjn.vercel.app

@gitpod-io
Copy link

gitpod-io bot commented Apr 24, 2022

@vercel
Copy link

vercel bot commented Apr 29, 2022

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Updated
cleanrl ✅ Ready (Inspect) Visit Preview May 9, 2022 at 9:16PM (UTC)

@vwxyzjn
Copy link
Owner Author

vwxyzjn commented May 2, 2022

Here are the benchmarked results. Looks like DQN in this PR gets better performance in Atari games, slightly worse results in MountainCar-v0. SAC in this PR gets a performance boost.

Given these results, I recommend we merge this PR. This PR obtains overall better performance and removes an unverified code-level optimization: gradient norm clipping for DQN.

@dosssman and @yooceii, does the result from this PR make sense to you? If it does, I will make updates to the docs and ultimately remove the old experiments. I think after this we would be ready for the 1.0 release.

CC @araffin who might be interested in this :)

Atari games

BreakoutNoFrameskip-v4 PongNoFrameskip-v4 BeamRiderNoFrameskip-v4
BreakoutNoFrameskip-v4a PongNoFrameskip-v4a BeamRiderNoFrameskip-v4a

Classic control

CartPole-v1 Acrobot-v1 MountainCar-v0
CartPole-v1 (1) Acrobot-v1 (1) MountainCar-v0 (1)

MuJoCo

HalfCheetah-v2 Walker2d-v2 Hopper-v2
HalfCheetah-v2 (1) Walker2d-v2 (1) Hopper-v2 (1)
HalfCheetah-v2 (2) Walker2d-v2 (2) Hopper-v2

@vwxyzjn vwxyzjn requested review from dosssman and yooceii May 2, 2022 03:39
@vwxyzjn vwxyzjn marked this pull request as ready for review May 2, 2022 03:39
Copy link
Collaborator

@dosssman dosssman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looking good on my side.
Thanks for the great work.

@vwxyzjn
Copy link
Owner Author

vwxyzjn commented May 9, 2022

Merging now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
3 participants