forked from vwxyzjn/cleanrl
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
PPO + JAX + EnvPool + Atari (vwxyzjn#227)
* PPO + jax + envpool + atari * fix bug: only report metric when lifes are used up * pre-commit * quick fix * Quick refactor * push changes * pre-commit and use EnvPool's new API * update envpool * update docs * update ppo benchmark script * update docs * use the latest envpool interface * update envpool to the latest version * update pyproject.toml * update lock files * Quick clarification * Update docs * remove non benchmarked script * update docs * revert poetry changes * docs fix * remove uncessary code, add docs * add a note one envpool * update test cases * explain `get_action_and_value` * fix indent * Fix weird error with `np.mean`. See below: We got this message. See vwxyzjn#227 (comment) ``` NotImplementedError: Got <class 'jaxlib.xla_extension.DeviceArray'>, but numpy array, torch tensor, or caffe2 blob name are expected. ``` * update docs * pre-commit * add note on `charts/avg_episodic_return` * update reproducibility script * add note on value function clipping
- Loading branch information
Showing
22 changed files
with
157,414 additions
and
32 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.