[WACV 2025]

Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation

Abstract Summary

The evolution of generative models has significantly advanced the generation of image variations. The Image Difference Captioning (IDC) task, aimed at identifying differences between images, faces challenges with real-world images due to data scarcity and the complexity of capturing nuanced differences. We introduce BLIP2IDC, a model that adapts BLIP2, an image captioning model, for IDC and uses synthetic augmentation to enrich IDC datasets, demonstrating superior performance on real-world datasets.

Getting Started

Prerequisites

Ensure you have access to the necessary datasets for training and evaluation. For synthetic data augmentation and training with BLIP2IDC, the following are required:

Python 3.8 or higher
PyTorch 1.7 or higher
Access to GPU for efficient training

Installation

Clone the BLIP2IDC repository:

git clone https://github.com/Gautier29/BLIP2IDC.git

Install the required Python packages:

cd BLIP2IDC
pip install -r requirements.txt

Usage

To start training BLIP2IDC with synthetic data augmentation, follow these steps:

Ensure you have your datasets properly set up in a directory accessible by the script.

Run the training script with the appropriate parameters:

./train_BLIP2IDC.sh

This script will automatically use the synthetic augmentation techniques described in our paper to enhance the performance of the BLIP2IDC model. Then, for evaluation, run the eval script :

./test_BLIP2IDC.sh

Accessing the Syned Dataset

Our Syned dataset, designed specifically for IDC, will be released upon publication. Samples can be found in the supplementary section of the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
code_release		code_release
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[WACV 2025]

Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation

Abstract Summary

Getting Started

Prerequisites

Installation

Usage

Accessing the Syned Dataset

About

Releases

Packages

Languages

License

gautierevn/BLIP2IDC

Folders and files

Latest commit

History

Repository files navigation

[WACV 2025]

Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation

Abstract Summary

Getting Started

Prerequisites

Installation

Usage

Accessing the Syned Dataset

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages