Temporal Visual Saliency Transformer (TempVST)

Installing the Conda Environment

To use this code, you need to install the required dependencies using Conda. Here's how to create a new Conda environment and install the dependencies:

Clone this repository to your local machine.
Open a terminal or command prompt and navigate to the root directory of the cloned repository.
Create a new Conda environment using the following command:

conda env create -f tempvst_env.yml

Activate the new Conda environment using the following command:

conda activate tempvst_env

Training, Testing, and Evaluation

Run python train_test_eval.py --Training True --Testing True --Evaluation True for training, testing, and evaluation. The predictions will be in preds/ folder and the evaluation results will be in result.txt file.

Testing on Our Pretrained TempVST Model

Run python train_test_eval.py --Testing True --Evaluation True for testing and evaluation. The predictions will be in preds/ folder and the evaluation results will be in result.txt file.

Script Arguments Explanation

This repository contains a script with various command-line arguments that control the behavior of the script. Below is an explanation of each argument and its purpose:

Training and Testing Flags

--Training: (default: False) Set this flag to True if you want to perform training.
--Testing: (default: True) Set this flag to True if you want to perform testing.

Learning Rate and Training Parameters

--lr_decay_gamma: (default: 0.1) Learning rate decay factor.
--lr: (default: 1e-4) Initial learning rate.
--epochs: (default: 200) Number of training epochs.
--batch_size: (default: 4) Batch size for training.
--num_gpu: (default: 1) Number of GPUs to use.
--stepvalue1: (default: 30000) First step value for adjusting the learning rate.
--stepvalue2: (default: 45000) Second step value for adjusting the learning rate.
--trainset: (default: 'DHF1K') Training dataset name.
--data_root: Path to the data directory.
--img_size: (default: 224) Size of network input images.
--alternate: (default: 2) Subsampling factor.
--len_snippet: (default: 6) Length of video snippet.
--pretrained_model: (default: "80.7_T2T_ViT_t_14.pth.tar") Path to the pretrained model.

Loss Function Coefficients

You can adjust the coefficients of various loss functions using the following arguments:

--kldiv_coeff: (default: 1.0) Coefficient for KL Divergence loss.
--cc_coeff: (default: -1.0) Coefficient for CC loss.
--sim_coeff: (default: -1.0) Coefficient for Similarity loss.
--nss_coeff: (default: 1.0) Coefficient for NSS loss.
--nss_emlnet_coeff: (default: 1.0) Coefficient for NSS EMLNet loss.
--nss_norm_coeff: (default: 1.0) Coefficient for NSS Normalization loss.
--l1_coeff: (default: 1.0) Coefficient for L1 loss.

Additional Flags

Various additional flags can be set to control the inclusion of specific components:

--kldiv: (default: True) Set this flag to calculate KL Divergence.
--cc: (default: False) Set this flag to include CC loss.
--sim: (default: False) Set this flag to include Similarity loss.
--nss: (default: False) Set this flag to include NSS loss.
--nss_emlnet: (default: False) Set this flag to include NSS EMLNet loss.
--nss_norm: (default: False) Set this flag to include NSS Normalization loss.
--l1: (default: False) Set this flag to include L1 loss.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
80.7_T2T_ViT_t_14.pth.tar		80.7_T2T_ViT_t_14.pth.tar
Decoder.py		Decoder.py
README.md		README.md
TempVST.py		TempVST.py
TempVST_arch.jpg		TempVST_arch.jpg
Testing.py		Testing.py
Training.py		Training.py
Transformer.py		Transformer.py
augmentation_try.py		augmentation_try.py
dataset.py		dataset.py
dataset_2.py		dataset_2.py
dataset_3.py		dataset_3.py
dataset_4.py		dataset_4.py
loss.py		loss.py
rename_results.py		rename_results.py
t2t_vit.py		t2t_vit.py
tempvst_env.yml		tempvst_env.yml
token_performer.py		token_performer.py
token_transformer.py		token_transformer.py
train_test_eval.py		train_test_eval.py
transformer_block.py		transformer_block.py
transforms.py		transforms.py
utils.py		utils.py
video_transforms.py		video_transforms.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Temporal Visual Saliency Transformer (TempVST)

Installing the Conda Environment

Training, Testing, and Evaluation

Testing on Our Pretrained TempVST Model

Script Arguments Explanation

Training and Testing Flags

Learning Rate and Training Parameters

Loss Function Coefficients

Additional Flags

About

Uh oh!

Releases

Packages

Languages

nlazaridi/TempVST

Folders and files

Latest commit

History

Repository files navigation

Temporal Visual Saliency Transformer (TempVST)

Installing the Conda Environment

Training, Testing, and Evaluation

Testing on Our Pretrained TempVST Model

Script Arguments Explanation

Training and Testing Flags

Learning Rate and Training Parameters

Loss Function Coefficients

Additional Flags

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages