This is the official repository of the accepted at WACV 2022: LMFD-PAD: Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection. The paper can be found in here.
Since the data in all used PAD datasets in our work are videos, we sample 10 frames in the average time interval of each video. In addition, the ratio of bona fide and attack is balanced by simple duplication. Finally, CSV files are generated for further training and evaluation. The format of the dataset CSV file is:
image_path,label
/image_dir/image_file_1.png, bonafide
/image_dir/image_file_2.png, bonafide
/image_dir/image_file_3.png, attack
/image_dir/image_file_4.png, attack
The training code for intra-dataset and cross-dataset experiments is same, the difference code between intra_db_main.py and cross_db_main.py is evaluation metrics.
- Example of intra-dataset training and testing:
python intra_db_main.py \ --protocol_dir 'dir_containing_csv_files' \ --backbone resnet50 \ --pretrain True \ --lr 0.001 \ --batch_size 64 \ --prefix 'custom_note' \
- Example of cross-dataset training and testing is similar:
python cross_db_main.py \ --protocol_dir 'dir_containing_csv_files' \ --backbone resnet50 \ --pretrain True \ --lr 0.001 \ --batch_size 64 \ --prefix 'custom_note' \
The results of cross-dataset evaluation under different experimental settings on four face PAD datasets. More details can be found in paper.
Four models pre-trained based on four cross-dataset experimental settings can be download via google driver. Please using the following threshold for testing those pre-trained weights. The thresholds of icm_o, ocm_i, omi_c, and oci_m models are 0.7309441, 0.6971898, 0.613508, and 0.53312653, respectively. More information and small test can be found in test.py. Please make sure give the correct model path.
if you use LMFD-HAM architecture in this repository, please cite the following paper:
@inproceedings{DBLP:conf/wacv/FangDKK22,
author = {Meiling Fang and
Naser Damer and
Florian Kirchbuchner and
Arjan Kuijper},
title = {Learnable Multi-level Frequency Decomposition and Hierarchical Attention
Mechanism for Generalized Face Presentation Attack Detection},
booktitle = {{WACV}},
pages = {1131--1140},
publisher = {{IEEE}},
year = {2022}
}
This project is licensed under the terms of the Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license. Copyright (c) 2020 Fraunhofer Institute for Computer Graphics Research IGD Darmstadt.