Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor to use tyro #424

Merged
merged 44 commits into from
Nov 28, 2023
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
44 commits
Select commit Hold shift + click to select a range
cd4851e
Refactor to use tyro
vwxyzjn Oct 16, 2023
b97d54f
push
vwxyzjn Oct 16, 2023
b87a015
psuh
vwxyzjn Oct 16, 2023
896f346
refactor
vwxyzjn Oct 16, 2023
6220645
fix pre-commit
vwxyzjn Oct 16, 2023
adbf836
fix pre-commit
vwxyzjn Oct 16, 2023
8af1e13
fix commend
vwxyzjn Oct 16, 2023
0b61550
Merge branch 'master' into refactor-tyro
sdpkjc Oct 16, 2023
96a56b8
refactor
vwxyzjn Oct 16, 2023
a8795a9
Merge branch 'refactor-tyro' of https://github.com/vwxyzjn/cleanrl in…
vwxyzjn Oct 16, 2023
cb6b47a
update poetry
vwxyzjn Oct 16, 2023
cfeedb0
fix test case
vwxyzjn Oct 16, 2023
9c0959c
quick fix
vwxyzjn Oct 16, 2023
5f3f716
fix
vwxyzjn Oct 17, 2023
08f4392
update optuna
vwxyzjn Oct 17, 2023
de6c829
quick change
vwxyzjn Oct 17, 2023
b09e088
fix ppg
vwxyzjn Oct 17, 2023
e92cf57
quick fix
vwxyzjn Oct 17, 2023
57b05fb
fix optuna
vwxyzjn Oct 17, 2023
17f49db
quick change
vwxyzjn Oct 17, 2023
cbbdc8b
fix
vwxyzjn Oct 17, 2023
e69b317
quick change
vwxyzjn Oct 17, 2023
f83a218
quick change
vwxyzjn Oct 17, 2023
86e6275
fix bug in multi-gpu
vwxyzjn Nov 8, 2023
bf5368a
refactor benchmark, support slurm
vwxyzjn Nov 8, 2023
aec360b
remove mujoco_py stuff
vwxyzjn Nov 9, 2023
46efc25
add slurm template
vwxyzjn Nov 9, 2023
072eafb
pre-commit
vwxyzjn Nov 9, 2023
b2542e0
update ddpg docs
vwxyzjn Nov 9, 2023
33a5609
update td3 docs
vwxyzjn Nov 9, 2023
4d8c3da
update sac
vwxyzjn Nov 9, 2023
70702cf
bug fix
vwxyzjn Nov 13, 2023
4c09502
Merge branch 'refactor-tyro' of https://github.com/vwxyzjn/cleanrl in…
vwxyzjn Nov 13, 2023
60b71f7
update docs
vwxyzjn Nov 27, 2023
7a96de2
update ppo docs
vwxyzjn Nov 27, 2023
89846df
bump version
vwxyzjn Nov 27, 2023
4f0dc48
bump version
vwxyzjn Nov 27, 2023
d821748
bump test cases
vwxyzjn Nov 27, 2023
7880155
add benchmark utility docs
vwxyzjn Nov 27, 2023
50ec155
bump test
vwxyzjn Nov 27, 2023
940595a
fix #418
vwxyzjn Nov 27, 2023
b0caf45
update requirements.txt
vwxyzjn Nov 27, 2023
aaf7dd0
test
vwxyzjn Nov 27, 2023
2fb4814
add numpy
vwxyzjn Nov 28, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
quick fix
  • Loading branch information
vwxyzjn committed Oct 17, 2023
commit e92cf572d29f905b64f9faad6a5be89eb45731e1
2 changes: 0 additions & 2 deletions cleanrl/ppg_procgen.py
Original file line number Diff line number Diff line change
Expand Up @@ -218,8 +218,6 @@ def get_pi(self, x):
args.num_iterations = args.total_timesteps // args.batch_size
args.num_phases = int(args.num_iterations // args.n_iteration)
args.aux_batch_rollouts = int(args.num_envs * args.n_iteration)
from rich.pretty import pprint
pprint(args)
assert args.v_value == 1, "Multiple value epoch (v_value != 1) is not supported yet"
run_name = f"{args.env_id}__{args.exp_name}__{args.seed}__{int(time.time())}"
if args.track:
Expand Down
2 changes: 1 addition & 1 deletion cleanrl/ppo_rnd_envpool.py
Original file line number Diff line number Diff line change
@@ -1,9 +1,9 @@
# docs and experiment results can be found at https://docs.cleanrl.dev/rl-algorithms/ppo-rnd/#ppo_rnd_envpoolpy
from dataclasses import dataclass
import os
import random
import time
from collections import deque
from dataclasses import dataclass

import envpool
import gym
Expand Down
4 changes: 2 additions & 2 deletions requirements/requirements-optuna.txt
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ certifi==2023.5.7 ; python_full_version >= "3.7.1" and python_version < "3.11"
charset-normalizer==3.1.0 ; python_full_version >= "3.7.1" and python_version < "3.11"
click==8.1.3 ; python_full_version >= "3.7.1" and python_version < "3.11"
cloudpickle==2.2.1 ; python_full_version >= "3.7.1" and python_version < "3.11"
cmaes==0.9.1 ; python_full_version >= "3.7.1" and python_version < "3.11"
cmaes==0.10.0 ; python_full_version >= "3.7.1" and python_version < "3.11"
colorama==0.4.4 ; python_full_version >= "3.7.1" and python_version < "3.11"
colorlog==6.7.0 ; python_full_version >= "3.7.1" and python_version < "3.11"
commonmark==0.9.1 ; python_full_version >= "3.7.1" and python_version < "3.11"
Expand Down Expand Up @@ -45,7 +45,7 @@ moviepy==1.0.3 ; python_full_version >= "3.7.1" and python_version < "3.11"
numpy==1.21.6 ; python_full_version >= "3.7.1" and python_version < "3.11"
oauthlib==3.2.2 ; python_full_version >= "3.7.1" and python_version < "3.11"
optuna-dashboard==0.7.3 ; python_full_version >= "3.7.1" and python_version < "3.11"
optuna==3.1.1 ; python_full_version >= "3.7.1" and python_version < "3.11"
optuna==3.3.0 ; python_full_version >= "3.7.1" and python_version < "3.11"
packaging==23.1 ; python_full_version >= "3.7.1" and python_version < "3.11"
pandas==1.3.5 ; python_full_version >= "3.7.1" and python_version < "3.11"
pathtools==0.1.2 ; python_full_version >= "3.7.1" and python_version < "3.11"
Expand Down
1 change: 0 additions & 1 deletion tests/test_tuner.py
Original file line number Diff line number Diff line change
Expand Up @@ -31,4 +31,3 @@ def test_tuner():
num_trials=1,
num_seeds=1,
)