Image-Captioning-using-Attention-Mechanism

CourseWork Project, Mathematics for Machine Learning

Teamates

Adya Bhat
Aditya G H

It uses a encoder-decoder architecture to caption Images. I used a pretrained ResNet model as the encoder to extract image features, combined with an LSTM to generate captions using the extracted features.

Model Architecture

Dataset

Flickr 8K - It consists of a folder Images with images (.jpg format), and a a text file captions.txt. Each image has five captions each. containing the captions corresponding to the image file names.

The format of the text file is:

<image-filename>,<caption>\n

Results

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
image-captioning.ipynb		image-captioning.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image-Captioning-using-Attention-Mechanism

About

Releases

Packages

Languages

License

bcsamrudh/Image-Captioning-using-Attention-Mechanism

Folders and files

Latest commit

History

Repository files navigation

Image-Captioning-using-Attention-Mechanism

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages