Image Captioning

A simple app that generates captions for images using a Transformer decoder and ResNet-18 features. Upload your own image or try a sample to see what the model describes!

Project Notebook

Preprocessing, Training & Full Pipeline Notebook

Live Demo

Try it on Streamlit Cloud

Model Info

Feature Extractor: ResNet-18 (pretrained)
Decoder: Transformer (3 layers, 8 heads, 512 emb, 2048 ff, dropout 0.2)
Vocabulary: 7,234 words
Metric: BLEU-4 score: 0.18

Model Download

The app will auto-download these when you run it, so you don't need to do it manually unless you want to.

How to Run Locally

Clone this repo:

git clone https://github.com/paudelsamir/Image-Captioning-Transformer.git
cd Image-Captioning-Transformer

Install dependencies:
```
pip install -r requirements.txt
```

Run the app:

streamlit run app.py
streamlit run demo_app.py (no requirements needed)

Windows Users

# Run the setup script
setup.bat

Linux/Mac Users

# Make setup script executable and run
chmod +x setup.sh
./setup.sh

Author

@samir

This is a fun project for learning and demo purposes. For details, see the notebook above.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
config.py		config.py
demo.gif		demo.gif
demo.mp4		demo.mp4
image-captioning.ipynb		image-captioning.ipynb
image_processing.py		image_processing.py
model.py		model.py
model_loader.py		model_loader.py
requirements.txt		requirements.txt
setup.bat		setup.bat
setup.sh		setup.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Captioning

Project Notebook

Live Demo

Model Info

Model Download

How to Run Locally

Windows Users

Linux/Mac Users

Author

About

Uh oh!

Releases

Packages

Languages

License

paudelsamir/Image-Captioning-Transformer

Folders and files

Latest commit

History

Repository files navigation

Image Captioning

Project Notebook

Live Demo

Model Info

Model Download

How to Run Locally

Windows Users

Linux/Mac Users

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages