-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Conversation
New LSTM unit example:
Do you think it is enough? |
@antinucleon cool, you fixed it fast, I just pulled this changes. I will check it soon and let you know... |
@antinucleon I am trying out today, and possibly tomorrow too. How do I pass in the mask variables, do I feed it from my DataIter? |
Yes
|
@antinucleon I am not sure how to feed from dataiter, is it through provide_data or provide_label? Because the errors I am getting is not very informative...
|
This is how I use your lstm in my lstm_unroll
|
I am writing a data iter sample now, please wait a moment... |
Here is a shitty working example without raising error. But need more work to polish it and verify correctness. https://gist.github.com/antinucleon/c4ff26032f3a97f6aaf89680dfabe291 |
@antinucleon Thanks, I will take a look tomorrow... |
basically, mask is a vector similar in length and shape to data, but it uses 0 for padding and 1 for non-padding inputs, and it is passed through the dataiter using provide_data, i am currently modifying my dataiter for this, hang on.. |
Great! This is fast! |
@freddycct I find the bug in the shitty prototype iterator. Please let me know your experiment result. |
@antinucleon Ok, the masking layer works fine for me after calling model.fit, I also noticed that the Embedding vectors representing the PAD symbol is not changing, that means the gradient is not backpropagated to the embedding layer, and that is great, next steps, I need to test the RNN inference part, so hang on, i am continuing to test... but I hope this gives you some confidence that your code is working.. |
@antinucleon It works. My sequence to sequence is working, thanks! |
No description provided.