File tree 2 files changed +14
-14
lines changed
2 files changed +14
-14
lines changed Original file line number Diff line number Diff line change @@ -20,6 +20,7 @@ and this project adheres to [Semantic Versioning](http://semver.org/spec/v2.0.0.
20
20
### Bug Fixes
21
21
- Fixed an issue which caused self-play training sessions to consume a lot of memory. (#3451 )
22
22
- Fixed an IndexError when using GAIL or behavioral cloning with demonstrations recorded with 0.14.0 or later (#3464 )
23
+ - Updated the ` gail_config.yaml ` to work with per-Agent steps (#3475 )
23
24
24
25
25
26
## [ 0.14.0-preview] - 2020-02-13
Original file line number Diff line number Diff line change @@ -14,27 +14,27 @@ default:
14
14
num_layers : 2
15
15
time_horizon : 64
16
16
sequence_length : 64
17
- summary_freq : 1000
17
+ summary_freq : 10000
18
18
use_recurrent : false
19
19
reward_signals :
20
20
extrinsic :
21
21
strength : 1.0
22
22
gamma : 0.99
23
23
24
24
Pyramids :
25
- summary_freq : 2000
25
+ summary_freq : 30000
26
26
time_horizon : 128
27
27
batch_size : 128
28
28
buffer_size : 2048
29
29
hidden_units : 512
30
30
num_layers : 2
31
31
beta : 1.0e-2
32
- max_steps : 5.0e5
32
+ max_steps : 1.0e7
33
33
num_epoch : 3
34
34
behavioral_cloning :
35
35
demo_path : Project/Assets/ML-Agents/Examples/Pyramids/Demos/ExpertPyramid.demo
36
36
strength : 0.5
37
- steps : 10000
37
+ steps : 150000
38
38
reward_signals :
39
39
extrinsic :
40
40
strength : 1.0
@@ -55,14 +55,14 @@ CrawlerStatic:
55
55
time_horizon : 1000
56
56
batch_size : 2024
57
57
buffer_size : 20240
58
- max_steps : 1e6
59
- summary_freq : 3000
58
+ max_steps : 1e7
59
+ summary_freq : 30000
60
60
num_layers : 3
61
61
hidden_units : 512
62
62
behavioral_cloning :
63
63
demo_path : Project/Assets/ML-Agents/Examples/Crawler/Demos/ExpertCrawlerSta.demo
64
64
strength : 0.5
65
- steps : 5000
65
+ steps : 50000
66
66
reward_signals :
67
67
gail :
68
68
strength : 1.0
@@ -71,20 +71,20 @@ CrawlerStatic:
71
71
demo_path : Project/Assets/ML-Agents/Examples/Crawler/Demos/ExpertCrawlerSta.demo
72
72
73
73
PushBlock :
74
- max_steps : 5.0e4
74
+ max_steps : 1.5e7
75
75
batch_size : 128
76
76
buffer_size : 2048
77
77
beta : 1.0e-2
78
78
hidden_units : 256
79
- summary_freq : 2000
79
+ summary_freq : 60000
80
80
time_horizon : 64
81
81
num_layers : 2
82
82
reward_signals :
83
83
gail :
84
84
strength : 1.0
85
85
gamma : 0.99
86
86
encoding_size : 128
87
- demo_path : Project/Assets/ML-Agents/Examples/PushBlock/Demos/ExpertPush .demo
87
+ demo_path : Project/Assets/Demonstrations/PushblockDemo .demo
88
88
89
89
Hallway :
90
90
use_recurrent : true
@@ -96,8 +96,8 @@ Hallway:
96
96
num_epoch : 3
97
97
buffer_size : 1024
98
98
batch_size : 128
99
- max_steps : 5.0e5
100
- summary_freq : 1000
99
+ max_steps : 1.0e7
100
+ summary_freq : 10000
101
101
time_horizon : 64
102
102
reward_signals :
103
103
extrinsic :
@@ -111,8 +111,7 @@ Hallway:
111
111
112
112
FoodCollector :
113
113
batch_size : 64
114
- summary_freq : 1000
115
- max_steps : 5.0e4
114
+ max_steps : 2.0e6
116
115
use_recurrent : false
117
116
hidden_units : 128
118
117
learning_rate : 3.0e-4
You can’t perform that action at this time.
0 commit comments