Add simulate_rir_ism method for simulating RIR with Image Source Method #2644

nateanl · 2022-08-23T18:57:42Z

No description provided.

nateanl · 2022-08-29T17:37:15Z

Doc page: https://output.circle-artifacts.com/output/job/a26503b3-98a5-4272-8add-cacb33932fef/artifacts/0/docs/prototype.functional.html#simulate-rir-ism

nateanl · 2022-08-29T20:22:34Z

cc @fakufaku

torchaudio/prototype/functional/rir.py

torchaudio/csrc/build_rir.cpp

test/torchaudio_unittest/prototype/functional/functional_test_impl.py

mthrok · 2022-08-30T02:23:58Z

test/torchaudio_unittest/prototype/functional/functional_test_impl.py

+    @skipIfNoModule("pyroomacoustics")
+    @parameterized.expand([(2, 1), (3, 4)])
+    def test_simulate_rir_ism_single_band(self, D, channel):
+        """Test simulate_rir_ism when absorption coefficients are identical for all walls."""


I do not see the test logic checks the identity described in the docstring.
It is checking against pyroomacoustic, but the test logic does not check identity for walls.

Can you elaborate?

Yes. if e_absorption is just a floating-point value, then it means it is identical to all walls. That's why the room in pyroonacoustics is constructed with materials=pra.Material(e_absorption),.
The difference of this test and the one below is if e_absorption is a 2D Tensor, the multi-band logic in https://github.com/pytorch/audio/pull/2644/files#diff-bfbe08000c5dc05901af3bad9503693a99068426351748925a322990daed0cb3R217-R222 is applied. We need a separate test for that.

mthrok · 2022-08-30T02:25:12Z

torchaudio/csrc/build_rir.cpp

+  const int64_t ir_length = irs.size(3);
+  torch::Tensor rirs =
+      torch::zeros({num_band, num_mic, rir_length}, irs.dtype());
+  rirs.requires_grad_(true);


Does this work with torch.inference_mode()?

actually, what does it accomplish?
build_rir_impl manipulates the data of rirs by pointer, so autograd cannot track it.
In this case, the modification of requires_grad=True should be done by client code.

I tried to remove this line and run autograd_test, the test fails but was working with this line.

What happens if you move require_grad to after the call to build_rir_impl ?

It also passes the autograd test.

That suggests that the call to rirs.requires_grad_(true) can be outside of this function. Perhaps can you set it in test code?

mthrok · 2022-08-30T02:28:37Z

torchaudio/csrc/build_rir.cpp

+    torch::Tensor& filters) {
+  int64_t n = centers.size(0);
+  torch::Tensor new_bands = torch::zeros({n, 2}, centers.dtype());
+  new_bands.requires_grad_(true);


Similar to above, I double the effectiveness of requires_grad_ here.

mthrok · 2022-08-30T02:29:25Z

torchaudio/prototype/functional/rir.py

+    return img_loc, att
+
+
+def _hann(x: torch.Tensor, T: int):


Can you add comment on why we cannot use torch.hann_window?

sue. The reason is the original formula of image source method truncate the function values, any points outside [-T//2. T//2] is 0.

Actually, torch.hann_window can only sample the window function at integer points, which is suitable to create the window functions for STFT and such. In the image source model, we need to sample the continuous window function at non-integer points.

@nateanl This is not addressed. We need an explanation of why this function is required.

mthrok · 2022-10-04T18:04:19Z

torchaudio/prototype/functional/rir.py

+    return torch.special.sinc(n - delay) * _hann(n - delay, 2 * pad)
+
+
+def simulate_rir_ism(


Can you decorate this function so that if RIR is not compiled, it will fail immediately with easy-to-understand message?

mthrok · 2022-10-04T18:05:35Z

torchaudio/csrc/build_rir.cpp

+      "torchaudio::build_rir(Tensor irs, Tensor delay_i, int rir_length) -> Tensor",
+      &torchaudio::rir::build_rir);
+  m.def(
+      "torchaudio::make_filter(Tensor centers, float sample_rate, int n_fft) -> Tensor",


The function name is very generic to be in torchaudio namespace. Can you make it more descriptive / specific?

mthrok · 2022-10-04T18:06:17Z

torchaudio/csrc/build_rir.cpp

+using namespace torch::indexing;
+
+namespace torchaudio {
+namespace rir {


Since this file does not have corresponding header file, can you add anonymous namespace so that it is not referable from other C++ source?

mthrok · 2022-10-04T18:11:13Z

test/torchaudio_unittest/prototype/functional/functional_test_impl.py

@@ -1,9 +1,14 @@
 import numpy as np
+
+try:
+    import pyroomacoustics as pra


Please follow

audio/test/torchaudio_unittest/backend/soundfile/info_test.py

Lines 20 to 21 in fda00bf

if _mod_utils.is_module_available("soundfile"):

import soundfile

nateanl · 2022-11-22T15:11:05Z

After discussing with @fakufaku and @NicolasHug, the gradients of converting a float tensor to int tensor and using it as indices for another tensor may require an analytic formula. Before getting the formula with proof, I will disable the autograd test and focus on the numerical correctness in the forward method.

nateanl · 2022-11-30T23:05:50Z

will address the issue in a new PR #2880

Summary: replicate of #2644 Pull Request resolved: #2880 Reviewed By: mthrok Differential Revision: D41633911 Pulled By: nateanl fbshipit-source-id: 73cf145d75c389e996aafe96571ab86dc21f86e5

facebook-github-bot added the CLA Signed label Aug 23, 2022

nateanl force-pushed the rir branch from 7f25221 to 0ec7746 Compare August 29, 2022 16:32

nateanl marked this pull request as ready for review August 29, 2022 17:09

nateanl requested a review from a team August 29, 2022 17:09