Skip to content

Commit aa7a7f1

Browse files
Deric Pangxiaomaogy
Deric Pang
authored andcommitted
Fix doc which refers to nonexistent agents. (#923)
1 parent f3f7205 commit aa7a7f1

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

docs/Training-Curriculum-Learning.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,8 +19,8 @@ obstructs the path to the goal._
1919

2020
To see this in action, observe the two learning curves below. Each displays the reward
2121
over time for an agent trained using PPO with the same set of training hyperparameters.
22-
The difference is that the agent on the left was trained using the full-height wall
23-
version of the task, and the right agent was trained using the curriculum version of
22+
The difference is that one agent was trained using the full-height wall
23+
version of the task, and the other agent was trained using the curriculum version of
2424
the task. As you can see, without using curriculum learning the agent has a lot of
2525
difficulty. We think that by using well-crafted curricula, agents trained using
2626
reinforcement learning will be able to accomplish tasks otherwise much more difficult.

0 commit comments

Comments
 (0)