[WACV2024] HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information
This repository contains the code used for HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information 🔗 by Heitor Rapela Medeiros, Fidel A. Guerrero Pena, Masih Aminbeidokhti, Thomas Dubail, Eric Granger, Marco Pedersoli (WACV 2024).
Recently, this work was also accepted as an extended abstract in the LatinX in CV (LXCV) @CVPR2024 🔗
conda create -n hallucidet python=3.8.10
conda activate hallucidet
conda install pytorch torchvision torchaudio cpuonly -c pytorch
pip install matplotlib==3.7.0
pip install pytorch_lightning==1.5.10
pip install opencv-python==4.6.0.66
pip install imageio==2.22.4
pip install scikit-image==0.19.3
pip install scikit-learn=1.1.3
pip install pandas=1.5.3
pip install pycocotools==
pip install torchmetrics==1.5.3
pip install wandb==1.5.3
## For training the initial rgb model that is the baseline
## This is the seed123 result for the fasterrcnn on llvip dataset
CUDA_VISIBLE_DEVICES=0 python train_detector.py --pretrained --wandb-project wacv2024 --wandb-name detector_fasterrcnn_rgb_llvip_200ep_seed123 --detector fasterrcnn --modality rgb --dataset llvip --epochs 200 --batch 16 --seed 123
## Train HalluciDet
CUDA_VISIBLE_DEVICES=0 python train_hallucidet.py --pretrained --modality ir --detector-path ./lightning_logs/wacv2024/detector_fasterrcnn_rgb_llvip_200ep_seed123/llvip_rgb_fasterrcnn/best.ckpt --wandb-project wacv2024 --wandb-name detector_fasterrcnn_hallucidet_det01reg01_llvip_200ep_seed123 --detector fasterrcnn --dataset llvip --epochs 200 --batch 8 --seed 123
Download the pre-trained weights: https://huggingface.co/heitorrapela/hallucidet
## Eval for Faster R-CNN HalluciDet
CUDA_VISIBLE_DEVICES=0 python eval_hallucidet.py --pretrained --modality ir --detector-path ./checkpoints/llvip/seed123/fasterrcnn_rgb_llvip_seed123.ckpt --hallucidet-path ./checkpoints/llvip/seed123/hallucidet_llvip_seed123.ckpt --wandb-project wacv2024 --wandb-name detector_fasterrcnn_hallucidet_det01reg01_llvip_200ep_seed123 --detector fasterrcnn --dataset llvip --epochs 1 --batch 8 --seed 123
WACV2024 Recorded Video
Talk at LIVIA
@inproceedings{medeiros2024hallucidet,
title={HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information},
author={Medeiros, Heitor Rapela and Pena, Fidel A Guerrero and Aminbeidokhti, Masih and Dubail, Thomas and Granger, Eric and Pedersoli, Marco},
booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
pages={1444--1453},
year={2024}
}
Really thanks for the great open source community that provided good libraries.