AV-RIR: Audio-Visual Room Impulse Response Estimation (CVPR 2024)

The inference code of our AV-RIR without CRIP. We provide two versions of our network optimized for RIR estimation and Speech enhancement. We also provide test samples to run the inference code.

Requirements

Python 3.8+
Cuda 11.0+
PyTorch 1.10+
numpy
pygsound
wavefile
tqdm
gdown
scipy
soundfile
librosa

Trained Model and Test Data

To download the trained model and test data to the appropriate folder structure, run the following command.

source download.sh

RIR Estimation

To run RIR estimation inference code. Go to RIR_Estimation and run the following command. The output folder will be created with the outputs.

cd RIR_Estimation/
bash submit_autoencoder.sh --start 2
cd output/autoencoder/symAD_vctk_48000_hop300/test/rir/

Speech Enhancement

To run Speech Enhancement inference code. Go to Enhancement and run the following command. The output folder will be created with the outputs.

cd Enhancement/
bash submit_autoencoder.sh --start 2
cd output/autoencoder/symAD_vctk_48000_hop300/test/clean/

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Enhancement		Enhancement
Material_Code		Material_Code
RIR_Estimation		RIR_Estimation
CVPR_2024_Anton-3.pdf		CVPR_2024_Anton-3.pdf
README.md		README.md
download.sh		download.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AV-RIR: Audio-Visual Room Impulse Response Estimation (CVPR 2024)

Requirements

Trained Model and Test Data

RIR Estimation

Speech Enhancement

About

Releases

Packages

Languages

anton-jeran/AV-RIR

Folders and files

Latest commit

History

Repository files navigation

AV-RIR: Audio-Visual Room Impulse Response Estimation (CVPR 2024)

Requirements

Trained Model and Test Data

RIR Estimation

Speech Enhancement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages