Ivan Sosnovik*, Artem Moskalev*, and Arnold Smeulders Scale Equivariance Improves Siamese Tracking, WACV 2021.
Siamese trackers turn tracking into similarity estimation between a template and the candidate regions in the frame. Mathematically, one of the key ingredients of success of the similarity function is translation equivariance. Non-translation-equivariant architectures induce a positional bias during training, which hinders accurate localization. In real-life scenarios, however, the target undergos more transformations than just translation, e.g. rotation and scaling. In this work we focus on the later and demonstrate that extending Siamese tracking with built-in scale-equivariance improves tracking quality.
Models | OTB-2013 | OTB-2015 | VOT2016 | VOT2017 |
---|---|---|---|---|
SiamFC+ | 0.67 | 0.64 | 0.30 | 0.23 |
SE-SiamFC | 0.68 | 0.66 | 0.36 | 0.27 |
Raw results and models are available here
Environment: The code is developed with Intel(R) Xeon(R) CPU E5-2640 v4 @ 2.40GHz GPU: NVIDIA Titan RTX,
To train models and to run evaluations, you need the following dependencies
yacs
scipy
shapely
opencv-python
numpy
pytorch
torchvision
pyyaml
You will need to install python API for MATLAB engine to run the VOT benchmarks.
- Download the weights for model initialization from here. We used
SESiamFCResNet22_pretrained.pth
. You can also run the scripttransfer_weights.py
to generate your own weights. - Download the preprocessed training and testing datasets. Use the instructions provided here
- Adjust the training config in
configs/train.yaml
according to your tasks. - Run the training script
CUDA_VISIBLE_DEVICES=0 python train_siamfc.py --cfg configs/train.yaml
- To run the tracker from a snapshot, simply do the following
CUDA_VISIBLE_DEVICES=0 python test_siamfc.py \
--checkpoint snapshot/SESiamFCResNet22/checkpoint_otb.pth \
--dataset OTB2015 \
--dataset_root ~/datasets/ \
--cfg configs/test.yaml
- When the output is generated, you are able to evaluate the performance of the tracker. Use
lib/core/eval_otb.py
orlib/core/eval_vot.py
. For example,
python lib/core/eval_otb.py \
--dataset OTB2015 \
--dataset_root ~/datasets \
--result_path results \
--tracker_reg "SE*" \
The Robert Bosch GmbH is acknowledged for financial support.
If you found this work useful in your research, please consider citing
@InProceedings{Sosnovik_2021_WACV,
author = {Sosnovik, Ivan and Moskalev, Artem and Smeulders, Arnold W.M.},
title = {Scale Equivariance Improves Siamese Tracking},
booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
month = {January},
year = {2021},
pages = {2765-2774}
}
Licensed under an MIT license.