[Homework] Fanpeng Meng's Homework for Visual Cognition Engineering 2022-Spring, School of AIA, HUST.
In this work, I implied three algorithm on saliency detection, including non-learning-based algorithm ITTI
, MDC
and learning-based model MLNet
on SALICON Dataset, and gave a visualize result of three models.
ITTI [paper] MDC [paper] MLNet [paper] [code]
[DATASET] SALICON [web]
Using commands to install the environment:
pip -r requirements.txt
Prepare your dataset as below:
dataset
└──── SALICON
├──── images
│ ├──── test
│ │ ├──── COCO_test2014_000000000001.jpg
│ │ └──── ...
│ ├──── train
│ │ ├──── COCO_train2014_000000000001.jpg
│ │ └──── ...
│ └──── val
│ ├──── COCO_val2014_000000000001.jpg
│ └──── ...
└──── images
├──── train
│ ├──── COCO_train2014_000000000001.jpg
│ └──── ...
└──── val
├──── COCO_val2014_000000000001.jpg
└──── ...
Testing on ITTI and MDC :
python main.py
We randomly choose 10 pictures for testing, see result in output/itti
and output/mdc
.
Training on MLNet, you can also configure the training in the shell:
bash train.sh
- I trained the model on a single NVIDIA A30 (24 GB). 200 epoch with batch size of 64 training takes about 2.5 hours.
Testing on MLNet
python test.py
We also randomly choose 10 pictures for testing, see result in output/mlnet
.
- The checkpoint will be upload on Google Drive, place it at
output/mlnet/checkpoint
if you haven't trained the model, or you can change the directory intest.py
.
The result is shown below :
The result is shown by the sequence of: the real image, the ground true image, the result.
The result is shown by the sequence of: the real image, the ground true image, the result, the result with image enhancement.
The result is shown by the sequence of: the real image, the ground true image, the result.
The following works are appreciated for great credit to my work:
@article{itti1998model,
title={A model of saliency-based visual attention for rapid scene analysis},
author={Itti, Laurent and Koch, Christof and Niebur, Ernst},
journal={IEEE Transactions on pattern analysis and machine intelligence},
volume={20},
number={11},
pages={1254--1259},
year={1998},
publisher={Ieee}
}
@inproceedings{cornia2016deep,
title={A deep multi-level network for saliency prediction},
author={Cornia, Marcella and Baraldi, Lorenzo and Serra, Giuseppe and Cucchiara, Rita},
booktitle={2016 23rd International Conference on Pattern Recognition (ICPR)},
pages={3488--3493},
year={2016},
organization={IEEE}
}
@article{huang2017300,
title={300-FPS salient object detection via minimum directional contrast},
author={Huang, Xiaoming and Zhang, Yu-Jin},
journal={IEEE Transactions on Image Processing},
volume={26},
number={9},
pages={4243--4254},
year={2017},
publisher={IEEE}
}