Skip to content

Commit 1d94723

Browse files
committed
adding slides for cross entropy RL method
1 parent adb4193 commit 1d94723

File tree

2 files changed

+7
-4
lines changed

2 files changed

+7
-4
lines changed

08a_Basics_of_Reinforcement_Learning.ipynb

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -71,7 +71,10 @@
7171
"cell_type": "markdown",
7272
"metadata": {},
7373
"source": [
74-
"https://github.com/PacktPublishing/Deep-Reinforcement-Learning-Hands-On/blob/master/Chapter04/01_cartpole.py"
74+
"We will start with an example that we manipualted from the following book by M. Lapan: \n",
75+
"https://github.com/PacktPublishing/Deep-Reinforcement-Learning-Hands-On/blob/master/Chapter04/01_cartpole.py\n",
76+
"\n",
77+
"This is a nice introduction to RL with many pytorch examples included, similar to the one below. Here I have manipulated the example to run in jupyter, changed a few default behaviors, and added many more comments for understanding. "
7578
]
7679
},
7780
{
@@ -194,8 +197,8 @@
194197
" if example.reward >= reward_bound:\n",
195198
" # extend data arrays with obs and desired actions\n",
196199
" # extend adds elements to list from another list\n",
197-
" # Syntax: map(function, what to iterate over)\n",
198-
" # just get the actions and observations for this episode\n",
200+
" # Syntax: map(function, what to iterate over)\n",
201+
" # just get the actions and observations for this episode\n",
199202
" train_obs.extend(map(lambda step: step.observation, example.steps))\n",
200203
" train_act.extend(map(lambda step: step.action, example.steps))\n",
201204
"\n",
@@ -340,7 +343,7 @@
340343
"metadata": {},
341344
"outputs": [],
342345
"source": [
343-
"env.close()# calling this will end the current environment"
346+
"env.close()# calling this will close and delete the current environment"
344347
]
345348
},
346349
{

PDF_slides/DL_6b_ce_method.pdf

828 KB
Binary file not shown.

0 commit comments

Comments
 (0)