This is the codebase for "AlignNet: A Unifying Approach to Audio-Visual Alignment", WACV 2020
Project page: https://jianrenw.github.io/AlignNet/
Paper: https://arxiv.org/abs/2002.05070
The dataset can be downloaded from this link.
python3 train.py --mode train --local_distortion --global_stretch --global_shift --experiment_name YOUR_EXPERIMENT_NAME --log_dir YOUR_LOG_DIRECTORY
Remove the flags --local_distortion
, --local_distortion
, --local_distortion
if you want to keep only some of the three manipulations.