gchochla / Deep-Representations-of-Visual-Descriptions Public

Notifications You must be signed in to change notification settings
Fork 2
Star 18

Pytorch implementation of CVPR'16 paper "Learning Deep Representations of Fine-Grained Visual Descriptions", by Reed et al.

MIT license

18 stars 2 forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
crnns4captions		crnns4captions
docs		docs
etc		etc
scripts		scripts
.gitignore		.gitignore
README.md		README.md
pylintr		pylintr
setup.py		setup.py

Repository files navigation

Learning Deep Representations of Fine-Grained Visual Descriptions

Implementation of Convolutional Recurrent Neural Nets for zero-shot retrieval of images based on corresponding captions.

CRNNs consist of 1D Convolutional blocks followed by a RNN. Convolutions decrease the sequence length of the captions, allowing the RNN to learn efficiently.