Replies: 1 comment
-
Using |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello 👋
I'm using BindsNET to train an SNN to solve the CartPole problem. Since I want a finer control of the encoding and rewards, I've opted out of using the pipeline.
Here's the structure of my network:
I noted that whenever I do
the
reward
is applied indiscriminately to all the weights ofhidden_out
(all increase/decrease), not taking into account which output neuron caused thatreward
.How can I specify which connection weights to update? Do I have to break the output layer into two separate ones? Bonus: How can I propagate this causal update through a deep network?
Beta Was this translation helpful? Give feedback.
All reactions