A natural language guided, interactive neural network model for infrared and visible image fusion

To begin with, you need to create your own virtual environment. We provide our environment configuration in environment.yml. You just need to activate your environment and run

conda env create -f environment.yml

and then activate the new virtual environment:

conda activate reconet

For testing

We provide our test datasets in ../data, which includes four test sets from four public infrared and visible image fusion datasets.

Pretrained parameters are provided in Google drive

If you just want to test the performance of our model, you just need to run:

# set project path for python
export PYTHONPATH="${PYTHONPATH}:$ROOT"
python scripts/test.py --data $your_data_path (e.g.  data/TNO_test) --ckpt checkpoint/train_llvip_622.ckpt --dst $your_save_path

$ROOT: your project root

$your_data_path: path of your testing data

$your_save_path: path of your fusion result

Then you can obtain the fusion result which we already put in ../result.

For training

If you want to train the model yourself:

# set project path for python
export PYTHONPATH="${PYTHONPATH}:$ROOT"
python scripts/train.py --data $DATA --ckpt $CHECKPOINT_PATH --lr 1e-3
python scripts/pred.py --data $DATA --ckpt $CHECKPOINT_PATH --dst $result_path

$DATA: path of your training or testing dataset

$CHECKPOINT_PATH: path of your checkpoint files

$result_path: path of your fusion result

Experiment details

When we train on LLVIP dataset, the text for task is "This is an infrared and visible image fusion task.", the text for visible images is "low light degradation" and the text for infrared images is "low contrast and blurred"; when we predict on LLVIP and MSRS dataset, the text for task is "This is an infrared and visible image fusion task.", the text for visible images is "maybe low light degradation and overexposure degradation in visible images." and the text for infrared images is "low contrast issues" ratio 0.6 0.2 0.2

when we predict on RoadScene dataset, the text for task is "This is an infrared and visible image fusion task. Should preserve more visible image information", the text for visible images is "The brightness is too high, and some scene lights are too bright; it has less noise" and the text for infrared images is "low contrast issues" ratio 0.6 0.3 0.1

Metric

The python code for calculating various metrics in our paper can refer to : CSDN blog

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
__pycache__		__pycache__
archive		archive
checkpoint		checkpoint
data		data
lightning		lightning
modules		modules
result		result
scripts		scripts
utils		utils
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A natural language guided, interactive neural network model for infrared and visible image fusion

For testing

For training

Experiment details

Metric

About

Releases

Packages

Languages

License

jijingdeyi/NLG-fusion

Folders and files

Latest commit

History

Repository files navigation

A natural language guided, interactive neural network model for infrared and visible image fusion

For testing

For training

Experiment details

Metric

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages