File tree Expand file tree Collapse file tree 3 files changed +4
-4
lines changed Expand file tree Collapse file tree 3 files changed +4
-4
lines changed Original file line number Diff line number Diff line change @@ -302,4 +302,4 @@ <h3>Double <span ><span class="katex"><span aria-hidden="true" class="katex-html
302
302
handleImages ( )
303
303
</ script >
304
304
</ body >
305
- </ html >
305
+ </ html >
Original file line number Diff line number Diff line change 554
554
555
555
<url >
556
556
<loc >https://nn.labml.ai/diffusion/stable_diffusion/latent_diffusion.html</loc >
557
- <lastmod >2022-09-15T16 :30:00+00:00</lastmod >
557
+ <lastmod >2022-12-21T16 :30:00+00:00</lastmod >
558
558
<priority >1.00</priority >
559
559
</url >
560
560
1177
1177
1178
1178
<url >
1179
1179
<loc >https://nn.labml.ai/transformers/mha.html</loc >
1180
- <lastmod >2022-09-07T16 :30:00+00:00</lastmod >
1180
+ <lastmod >2022-12-24T16 :30:00+00:00</lastmod >
1181
1181
<priority >1.00</priority >
1182
1182
</url >
1183
1183
Original file line number Diff line number Diff line change @@ -51,7 +51,7 @@ class QFuncLoss(Module):
51
51
### Target network 🎯
52
52
In order to improve stability we use experience replay that randomly sample
53
53
from previous experience $U(D)$. We also use a Q network
54
- with a separate set of paramters $\t extcolor{orange}{\t heta_i^{-}}$ to calculate the target.
54
+ with a separate set of parameters $\t extcolor{orange}{\t heta_i^{-}}$ to calculate the target.
55
55
$\t extcolor{orange}{\t heta_i^{-}}$ is updated periodically.
56
56
This is according to paper
57
57
[Human Level Control Through Deep Reinforcement Learning](https://deepmind.com/research/dqn/).
You can’t perform that action at this time.
0 commit comments