Texture, Shape, Order, and Relation Matter: A New Transformer Design for Sequential DeepFake Detection

Yunfei Li, Yuezun Li^*, Baoyuan Wu, Junyu Dong, Guopu Zhu, Siwei Lyu

VAS-Group, Ocean University of China

[Paper]

📢 Updates

[07/2025] The code for the extended paper has been updated.
[07/2025] extension paper released.
[12/2024] Pretrained models are uploaded.
[12/2024] Code is released.
[10/2024] Accepted to WACV 2025.
[04/2024] Added paper

🪧 Introduction

This is the official implementation of Texture, Shape and Order Matter: A New Transformer Design for Sequential DeepFake Detection. In this paper ,we describe a new Transformer design, called TSOM, by exploring three perspectives: Texture, Shape, and Order of Manipulations. Extensive experimental results demonstrate that our method outperforms others by a large margin, highlighting the superiority of our method.

The framework of the proposed method:

🛠️ Installation

Environment

We recommend using Anaconda to manage the python environment:

conda create -n tsom python=3.9
conda activate tsom
conda install -c pytorch pytorch=2.1.1 torchvision=0.16.1 cudatoolkit==11.8
conda install pandas
conda install tqdm
conda install pillow
pip install tensorboard==2.4.1

📦 Dataset Preparation

Prepare data

You can download the Seq-Deepfake dataset through this link: [Dataset]

After unzip all sub files, the structure of the dataset should be as follows:

./
├── facial_attributes
│   ├── annotations
│   |   ├── train.csv
│   |   ├── test.csv
│   |   └── val.csv
│   └── images
│       ├── train
│       │   ├── Bangs-Eyeglasses-Smiling-Young
│       │   |   ├── xxxxxx.jpg
|       |   |   ...
|       |   |   └── xxxxxx.jpg
|       |   ...
│       │   ├── Young-Smiling-Eyeglasses
│       │   |   ├── xxxxxx.jpg
|       |   |   ...
|       |   |   └── xxxxxx.jpg
│       │   └── original
│       │       ├── xxxxxx.jpg
|       |       ...
|       |       └── xxxxxx.jpg
│       ├── test
│       │   % the same structure as in train
│       └── val
│           % the same structure as in train
└── facial_components
    ...

🚀 Training

Modify train.sh and run:

sh train.sh

Please refer to the following instructions about some arguments:

--LOGNAME: Name of your project
--CONFIG: Path of the network and optimization configuration file
--DATA_DIR : Directory to the downloaded dataset.
--DATASET_NAME : Name of the selected manipulation type. Choose from 'facial_components' and 'facial_attributes'.
--RESULTS_DIR : Directory to save logs and checkpoints

You can change the network and optimization configurations by adding new configuration files under the directory ./configs/.

We also provide slurm script that supports multiple GPUs training:

sh train_slurm.sh

where PARTITION and NODE should be modified according to your own environment. The number of GPUs to be used can be set through the NUM_GPU argument.

🏃Testing

Modify test.sh and run:

sh test.sh

For the arguments in test.sh, please refer to the training instructions above, plus the following ones:

TEST_TYPE : The evaluation metrics to use. Choose from 'fixed' and 'adaptive'.
LOG_NAME  :  Should be set according to the log_name of your trained checkpoint to be tested.

We also provide slurm script for testing:

sh test_slurm.sh

🏋️ Pretrained Models

We also provide the pretrained models.

Model	Description
pretrained-r34	Trained on `facial_components` and `facial_attributes` with `resnet34` backbone.
pretrained-r50	Trained on `facial_components` and `facial_attributes` with `resnet50` backbone.

In order to try the pre-trained checkpoints, please:

download from the links in the table, unzip the file and put them under the ./results folder with the following structure:

results
├── resnet34
│    ├── facial_attributes
│    │   └── test
│    │       └── snapshots
│    │           ├── best_model_adaptive.pt
│    │           └── best_model_fixed.pt
│    └── facial_components
│        └── test
│            └── snapshots
│                ├── best_model_adaptive.pt
│                └── best_model_fixed.pt
└── resnet50
    ...

In test.sh, modify DATA_DIR to the root of your Seq-DeepFake dataset. Modify LOGNAME CONFIG and DATASET_NAME to 'test', ./configs/r34.json or ./configs/r50.json, facial_components or facial_attributes respectively.
Run test.sh.

🎓 Citation

If you find this work useful for your research, please kindly cite our paper:

@inproceedings{li2025texture,
  title={Texture, Shape and Order Matter: A New Transformer Design for Sequential DeepFake Detection},
  author={Li, Yunfei and Li, Yuezun and Wang, Xin and Wu, Baoyuan and Zhou, Jiaran and Dong, Junyu},
  booktitle={2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
  pages={202--211},
  year={2025},
  organization={IEEE}
}

@article{li2025texture,
      title={Texture, Shape, Order, and Relation Matter: A New Transformer Design for Sequential DeepFake Detection}, 
      author={Yunfei Li and Yuezun Li and Baoyuan Wu and Junyu Dong and Guopu Zhu and Siwei Lyu},
      journal={arXiv preprint arXiv:2404.13873},
      year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Texture, Shape, Order, and Relation Matter: A New Transformer Design for Sequential DeepFake Detection

[Paper]

📢 Updates

🪧 Introduction

🛠️ Installation

Environment

📦 Dataset Preparation

Prepare data

🚀 Training

🏃Testing

🏋️ Pretrained Models

🎓 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
configs		configs
datasets		datasets
figs		figs
models		models
tools		tools
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
test.sh		test.sh
test_slurm.sh		test_slurm.sh
train.py		train.py
train.sh		train.sh
train_slurm.sh		train_slurm.sh

Uh oh!

Uh oh!

OUC-VAS/TSOM

Folders and files

Latest commit

History

Repository files navigation

Texture, Shape, Order, and Relation Matter: A New Transformer Design for Sequential DeepFake Detection

[Paper]

📢 Updates

🪧 Introduction

🛠️ Installation

Environment

📦 Dataset Preparation

Prepare data

🚀 Training

🏃Testing

🏋️ Pretrained Models

🎓 Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages