Skip to content

[Algorithm] Td3 #684

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 37 commits into from
Jan 17, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
37 commits
Select commit Hold shift + click to select a range
e8d17fa
update td3
BY571 Nov 17, 2022
75e50c1
solve merge conflicsts and fix next observation vector problem in ini…
BY571 Nov 17, 2022
a6c375d
update default config for new tests
BY571 Nov 17, 2022
fc8f3cf
add exploration sigma check
BY571 Nov 17, 2022
29a169d
simplifiy td3 objective
BY571 Nov 17, 2022
e37ed00
set default td3 config 2step return
BY571 Nov 17, 2022
a348460
fix adapt config
BY571 Nov 17, 2022
ecd0229
take off not used imports
BY571 Nov 17, 2022
afbe84e
take off not used imports
BY571 Nov 17, 2022
9e3c3ba
fix typing error
BY571 Nov 17, 2022
ee6ef03
update td3 model description, td3 config and objective actorloss problem
BY571 Nov 18, 2022
31255c8
fix next_ get_stants_random_rollout bug
BY571 Nov 18, 2022
e155dd3
flake8 fixes
BY571 Nov 18, 2022
30a677b
formatting changes
BY571 Nov 18, 2022
22813f5
fixes flake8
BY571 Nov 18, 2022
604e072
update format
BY571 Nov 22, 2022
29ebe01
format fix
BY571 Nov 22, 2022
b70b96b
fix
BY571 Nov 22, 2022
7239e11
formatting fix
BY571 Nov 22, 2022
608dd53
format fixes
BY571 Nov 22, 2022
cdfc04b
formatting fix
BY571 Nov 22, 2022
b08136f
test fmt skip
BY571 Nov 22, 2022
9fb91d6
take off skip ufmt
BY571 Nov 22, 2022
101b2e6
remove logging file
BY571 Nov 24, 2022
dea49c4
formatting fix
BY571 Nov 24, 2022
5690548
update config order, td error, qloss sum
BY571 Dec 2, 2022
6d60d75
merge main into branch
BY571 Dec 28, 2022
a01b64b
take off td3 helper functions
BY571 Jan 2, 2023
4f1c0e0
update td3 objective
BY571 Jan 2, 2023
fc8fdd3
Merge branch 'main' into td3
BY571 Jan 2, 2023
dd72e06
cpu test verified
BY571 Jan 3, 2023
76b1e19
Merge branch 'main' into td3
BY571 Jan 3, 2023
50e3f06
fix flake8
BY571 Jan 3, 2023
7839273
update objectives and test
BY571 Jan 3, 2023
2eec084
update cfg
BY571 Jan 4, 2023
f7c315d
td3_loss_test policy_delay update
BY571 Jan 4, 2023
5d1b995
take off not needed actorcritic wrapper
BY571 Jan 4, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
58 changes: 58 additions & 0 deletions examples/td3/config.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,58 @@
# Environment
env_name: HalfCheetah-v4
env_task: ""
exp_name: "debugging"
env_library: gym
record_video: 0
normalize_rewards_online: 0
normalize_rewards_online_scale: 5
normalize_rewards_online_decay: 0.99
total_frames: 1000000
frames_per_batch: 1000
max_frames_per_traj: 1000
frame_skip: 1
from_pixels: 0
seed: 0

# Collection
init_random_frames: 25000
init_env_steps: 10000
record_interval: 10
record_frames: 10000
async_collection: 1
#collector_devices: [cuda:1,cuda:1,cuda:1,cuda:1]
collector_devices: [cpu] # ,cpu,cpu,cpu]
env_per_collector: 1
num_workers: 1

# Replay Buffer
buffer_size: 1000000

# Optimization
utd_ratio: 1.0
gamma: 0.99
loss: double
loss_function: smooth_l1
lr: 3e-4
weight_decay: 0.0
lr_scheduler: ""
optim_steps_per_batch: 128
batch_size: 256
target_update_polyak: 0.995

# Algorithm
prb: 0 # use prioritized experience replay
policy_update_delay: 2
multi_step: 0
n_steps_return: 1
activation: relu
gSDE: 0

# Logging
logger: wandb

# Extra
batch_transform: 1
buffer_prefetch: 64
norm_stats: 1
device: "cpu"
Loading