-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extra Encoders (Image, Video, sound, ...) #259
Comments
Maybe also a Grid Cell encoder, which converts a Cartesian coordinate into a grid-cell like encoding. |
The original nupic had a "time & date" encoder. It's not critical, but might be nice to have. I think the motivation for this encoder is that many anomalies correlate with a time of day or day of the week. HTM school video for date/time encoder: |
I wrote a vision encoder which may be added to this repo. It needs some cleanup & modification before it will be ready, and I don't know when I will get around to it. Currently it is a research prototype. It is written in python, and uses opencv. The Opencv library has functions which do log-polar transforms, and Parvo/Magno-cellular transforms. Open-CV works well and it looks pretty well researched. I wrote an encoder which converts the processed images into sparse-distributed-representations. My encoders are less well researched. Here are some images showing the log-polar & parvo/magno-cellular transforms. Here are some statistics about the SDR encoded output of eye, after viewing the dataset which contains the above image.
|
This is amazing!!
I'll need to refresh my knowledge, this is a good starter: I cite:
= image classification type of tasks
= video processing in vision tasks. motion detection/tracking.
So you needed to produce several SDRs from the image.
I am wondering how this kind of biologically plausible encoding would fare on "stupid" classification datasets, like MNIST, CIFAR10 etc. I'll be reading more chapters on vision, please share your retina code when you have time, even if it's not ready yet. Thank you |
That looks like a good source. My model is wrong with regards to many details, including the relative densities of magnocellular and parvocellular neurons.
No, this is TODO. Currently I move the eye by a small random amount between each compute cycle. Controlling where the eye looks is an open issue, and it involves action selection and motor control.
I'm keeping my latest work on the eye encoder here: https://github.com/ctrl-z-9000-times/sdr_algorithms/blob/master/eye.py I don't know when I will have time to work on it further. |
yes, according to the literature, these modifications could apply:
would the saccadic moves wrongly trigger movement/magno cells? Or would that help to encode "I moved the eye/focus, so the 'move' is caused by the move of sensor, not move of the objects in the scene"?
cool! Would you please make just an initial PR with the eye.py (or other necessarities) when you have time? I'd like to play with it in the next week and I'll try to adapt it to the current state of htm.core. I just ask for this so you author the file, so you'd get the (c) for the lines :) I'll then continue to make modifications to it. Btw, reviewing that repo of yours, we're pretty much synced, aren't we? ae, CP, SDR...are more or less here 👍 . |
For some experiments, I'd like to setup
encoders/extra/{vision,audio,...}/
with specialized encoders for multiple modalities.
There existed special repos as
But basically these were
I think it'd help our community if we provided all-around baseline.
What do you think?
EDIT:
Domains:
The text was updated successfully, but these errors were encountered: