Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AOS55/issue9 #12

Merged
merged 64 commits into from
Nov 23, 2022
Merged
Changes from 1 commit
Commits
Show all changes
64 commits
Select commit Hold shift + click to select a range
ee7a9c9
Procedure to collect data from teacher
AOS55 Sep 30, 2022
3d9080e
sample over multiple seeds
AOS55 Sep 30, 2022
7686169
push sampling to datasets directory when done and option selected in …
AOS55 Sep 30, 2022
87c4345
changed obs from state input
AOS55 Sep 30, 2022
77a94f3
corrected for state observation
AOS55 Sep 30, 2022
cae00b7
load snapshot correctly
AOS55 Oct 3, 2022
39194a5
check observation is initiated correctly
AOS55 Oct 3, 2022
fcf11ac
add procedure to sample from constraints and goal_states
AOS55 Oct 3, 2022
d759e28
added skill_dim parameter
AOS55 Oct 3, 2022
c8b9db0
added pixel train param
AOS55 Oct 3, 2022
243fb4a
added configs for sampling
AOS55 Oct 3, 2022
8658285
removed extra line break
AOS55 Oct 3, 2022
fd77b64
255 divide to scale obs correctly
AOS55 Oct 3, 2022
3aaaadd
instantiate safe_set loading correctly
AOS55 Oct 3, 2022
3dc1a46
change to procedure used to upload_data
AOS55 Oct 3, 2022
d96f309
Merge branch 'AOS55/issue9' of github.com:AOS55/url-suite into AOS55/…
AOS55 Oct 3, 2022
c90dade
Safe Learning
AOS55 Oct 3, 2022
54d292a
added random_start option to safe environments
AOS55 Oct 7, 2022
eeb7aa6
corrected transform dict length for replay buffer
AOS55 Oct 7, 2022
1f0774d
added random_start
AOS55 Oct 7, 2022
121365f
added seperate model dir if random for now
AOS55 Oct 7, 2022
527ba94
added seperate dir for restart for now
AOS55 Oct 7, 2022
b7e4ab9
changed to 150 data_counts
AOS55 Oct 25, 2022
5d0e845
refactored to reflect new prioritized sampling
AOS55 Oct 25, 2022
c7f000c
printed pretrained_agent name
AOS55 Oct 25, 2022
198d9af
added optimistic forgetting rule
AOS55 Oct 25, 2022
5bf7d0b
sampling_batch updated to use smm and prioritized_sampling
AOS55 Oct 25, 2022
d9d68f1
increased number of samples ot 150
AOS55 Oct 25, 2022
88886c9
increased num_updates to 500
AOS55 Oct 25, 2022
522f899
set random_start to false for pretraining
AOS55 Oct 25, 2022
9caf0f8
corrected based on new prioritized_sampling approach
AOS55 Oct 25, 2022
7aa5182
changed the number of skill dimensions to reflect z
AOS55 Oct 25, 2022
d3fca9e
added method to view goal indicator
AOS55 Oct 25, 2022
7950f4f
changed to view loss plotter
AOS55 Oct 25, 2022
7cc92c4
assert and store transitions
AOS55 Oct 25, 2022
324d99d
convert prior to correct shape in replay_buffer storage
AOS55 Oct 25, 2022
b2d1518
fixed diagram orientation
AOS55 Oct 25, 2022
a19aab8
added svb to environment types
AOS55 Oct 25, 2022
a29bf56
using simple_velocity_bot
AOS55 Oct 26, 2022
c2bfb72
correct for state representation
AOS55 Oct 26, 2022
0341ca5
add goal_behaviour
AOS55 Oct 28, 2022
573a262
added protocol for saving if ep length not 100
AOS55 Oct 31, 2022
ef8983f
working on ant environment
AOS55 Oct 31, 2022
c07c727
sampling prioritized replay
AOS55 Oct 31, 2022
eba2b4e
skill_dim
AOS55 Oct 31, 2022
199f422
remove print
AOS55 Oct 31, 2022
66b3eeb
change to sh script
AOS55 Oct 31, 2022
913b863
added log to gitignore
AOS55 Oct 31, 2022
31edba6
pass custom skill dim through smm.yaml
AOS55 Oct 31, 2022
81a7021
added .out to gitignore
AOS55 Oct 31, 2022
f2b9d46
Merge pull request #11 from AOS55/issue9-detach
AOS55 Oct 31, 2022
1160b3a
removed line that shouldnt be there
AOS55 Nov 1, 2022
f28e793
added reward function
AOS55 Nov 1, 2022
4ec240a
pulled out smm coefficients
AOS55 Nov 1, 2022
fea7dae
change p_reward when available
AOS55 Nov 6, 2022
78312e1
logging losses
AOS55 Nov 6, 2022
8e2849d
adding longer training
AOS55 Nov 6, 2022
9d2d788
added plot param and more training
AOS55 Nov 8, 2022
44b1ed9
fixed range of log_p_star
AOS55 Nov 8, 2022
74e7c6f
added smm params for pass through and log params
AOS55 Nov 8, 2022
36d8577
add pretrain ent_coef sweep
AOS55 Nov 8, 2022
44d77aa
reduced reward required, prioritized_sampling
AOS55 Nov 23, 2022
5df60b0
sampling batch over snapshot and skill dim
AOS55 Nov 23, 2022
50e6ab0
added procedure to rename files
AOS55 Nov 23, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
added svb to environment types
  • Loading branch information
AOS55 committed Oct 25, 2022
commit a19aab879e513f101bb3555c652cc0f5b5f974ec
3 changes: 2 additions & 1 deletion utils/env_constructor.py
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,8 @@
'BipedalWalker-v3': 'gym',
'CarRacing-v2': 'gym',
'LunarLander-v2': 'gym',
'SimplePointBot': 'safe'
'SimplePointBot': 'safe',
'SimpleVelocityBot': 'safe'
}

class ExtendedTimeStep(NamedTuple):
Expand Down