FFT system module and new audiovisual demo #38

cklosters · 2024-08-13T11:37:49Z

@lshoek: Moving this PR from private to public so @stijnvanbeek can review these changes as well.

From the original PR:

Module that wraps Kiss FFT for NAP 0.7.
Implements real-optimized FFTs (positive half-spectrum: nfft/2+1 complex frequency bins).
New audiovisual demo that demonstrates various aspects of new FFT possibilities (and compute & naprenderadvanced)

… napfft

cklosters

I was implementing some of my review changes when I noticed the following in Debug: there is a noticeable delay between the two output channels in the audiovisual demo. This is less apparant in Release mode but still there. If I change the number of outputs to 1 using the audio service configuration I don't hear any delay. Is this is an output thread / sync bug?

Note that I made sure it's not related to recent audio / threading changes -> issue persists between old and new version of NAP. When I don't connect the FFTNode inside the FFTAudioNodeComponentInstance the lag is gone:

//mFFTNode->mInput.connect(*mInput->getOutputForChannel(mResource->mChannel));

This (from what I can tell) shows that the processing step in the audio thread introduces a noticeable lag on the channel selected for FFT analysis. When I introduce a second FFTAudioNode for channel 1 the delay is gone, because the latency introduced on that channel by the FFT node is the same. I profiled the latency, which is not related to the mutex - the copy of the buffer, in debug, is enough to hear the delay.

	void FFTBuffer::supply(const std::vector<float>& samples)
	{
		// Copy samples
		{
			std::lock_guard<std::mutex> lock(mSampleBufferMutex);
			mSampleBufferA = samples;
		}

		// TODO: Use notification to transform data on different thread!
		mDirty = true;
	}

@stijnvanbeek maybe you can pitch in on what to do here?

@lshoek: I implemented many of my suggested (review) changes and optimized the threaded copy of the buffer by introducing an intermediate buffer that the original samples are moved into before being processed: b14bd70. I recommend going over my changes for validation.

cklosters · 2024-08-13T11:41:49Z

demos/audiovisual/src/audiovisualapp.cpp

+						if (resource == nullptr)
+							continue;
+
+						if (resource->mID == "RenderStars")


I thought we had something in place for rendering cube-maps on initialization, why this verbose render step on frame 1 here? Also: getObjects() should not be used, it is considered bad practice because it could return recently replaced (not active) targets, which in this case is extra bad because we're only doing it on frame one?

cklosters · 2024-08-13T11:42:36Z

demos/audiovisual/module/src/renderaudioroadcomponent.cpp

+
+namespace nap
+{
+	bool RenderAudioRoadComponentInstance::init(utility::ErrorState& errorState)


Isn't it better to simply create a new render audio road component, instead of deriving from renderable mesh component? Since you override the draw method and require a specific mesh type.

stijnvanbeek · 2024-08-27T10:49:16Z

Very nice project and module! :)

I would contemplate to (optionally) perform the FFT within the audio thread (in the FFTNode::process method) and introduce also the possibility of a reverse FFT to allow for frequency domain processing of the audio. It would be cool to be able to insert an FFTProcess polymorphically that performs any frequency domain processing. Maybe we can cooperate on this. :)

About the FFTBuffer::supply() method and the perceived delay:
I suspect that this method causes occasional CPU spikes on the audio thread (especially in debug mode) which cause the audio callback to be "late". This should normally just cause little clicks and pops and not a delay between different channels though, because the audio callback processes all the channels at the same time. So I am a bit puzzled about that.

It is good to keep in mind though that both the mutex lock as the deep copy of the buffer in the supply() method are bad practice on audio threads:

the deep copy might result in deallocating the old contents of the destination buffer and reallocating memory to hold the new copy! Pre-allocating the vector in the ctor and copying the contents using an ordinary for loop might help.
as Coen says, the mutex lock in itself does not cause a delay as long as the transform() method is not running at the same time on the other thread, which could be the pitfall here. The processing in the transform() method causes the audio callback to stall during the mutex lock.

The solution could be to get rid of the mutex and the buffer copying and use the moody camel single producer single consumer queue to queue floats from the audio thread and dequeue them in the FFTBuffer::transform() function.

This talk on realtime audio processing in C++ is nice to watch:
https://www.youtube.com/watch?v=boPEO2auJj4

cklosters · 2024-09-16T08:43:55Z

This should normally just cause little clicks and pops and not a delay between different channels though, because the audio callback processes all the channels at the same time. So I am a bit puzzled about that.

There are no clicks and pops, only a (very) noticeable delay.

the deep copy might result in deallocating the old contents of the destination buffer and reallocating memory to hold the new copy! Pre-allocating the vector in the ctor and copying the contents using an ordinary for loop might help.

This is rather far fetched. The vector (on the stack) re-allocates memory only when the the new size is greater than the old capacity. Considering the stream is relatively stable this occurs at the beginning. Without profiling I am dismissing this statement and go with what the standard tells us. A memcpy (providing the buffer is large enough) is faster than copying items one by one. I will profile this - to confirm.

as Coen says, the mutex lock in itself does not cause a delay as long as the transform() method is not running at the same time on the other thread, which could be the pitfall here. The processing in the transform() method causes the audio callback to stall during the mutex lock.

Removing the lock has no impact on the problem, making this a non issue (irrelevant) to the latency issue described above. Also: you could (should) use std::mutex::try_lock instead of a regular lock, to avoid stalling the audio thread. This is preferred because the fft analysis can occur at a lower frequency because it's generally used for visual related elements and feedback and therefore better suited to be performed on the main or optionally (different) thread, where rendering takes more time than processing (update).

The solution could be to get rid of the mutex and the buffer copying and use the moody camel single producer single consumer queue to queue floats from the audio thread and dequeue them in the FFTBuffer::transform() function.

Well, considering the above I'm not sure this will solve the issue - my suspicion is that something else is not quite right. I find it hard to believe that a single memcpy introduces such an audible lag.

lshoek · 2024-09-16T21:24:18Z

Just letting you know I've had another look at this just now, revamping a dependent project now with a Motu M2 audio interface. Addressing issues as I go and will proceed this week.

lshoek added 30 commits November 29, 2023 18:54

add napfft module and basic audiovisual demo

05bf7bf

fft audio node fixes - demo improvement shows distinct frequency bins

8201333

compile kissfft from source

38e8668

add mid frequency measurement to audiovisual demo

e82bebd

safekeeping fft on polar coordinates shader

84a6563

audiovisual demonstrate mesh normals compute shader update

25e4155

separate mesh compute from rendering - add sky - make cool road thing

0649d0a

Merge branch 'naprenderadvanced' into napfft

46c646c

three rotational dof - fix compute shader invocations - parameterization

25c7bc6

Merge branch 'naprenderadvanced' into napfft

789a6f5

Merge branch 'naprenderadvanced' into napfft

6072bce

update audiovisual demo to use renderadvanced light system

14088b6

Merge branch 'naprenderadvanced' into napfft

475f629

Merge branch 'naprenderadvanced' into napfft

55f28ae

prerender stars cube map and render reflection in road

16dc899

Merge branch 'naprenderadvanced' into napfft

e82b738

remove gridfillpolicy

ae01b11

fix plane artifacts - add dof and chroma vfx - rename fft to audioroad

ea55ad8

Merge branch 'naprenderadvanced' of github.com:naivisoftware/nap into…

bba0c40

… napfft

update to napportaudio

9622667

buffer size check on init - improve dof

7bf301d

rerender cube map when reinitialized after reload

2230a07

audiovisual demo cleanup - renderdofcomponent improvement

aa94bd0

fix depth of field component - document demo

1e0d37a

Merge branch '0.7' of github.com:naivisoftware/nap into napfft

bfff21d

audiovisual demo documentation

cba12e6

Merge branch 'naprenderadvanced' of github.com:naivisoftware/nap into…

2303a98

… napfft

Merge branch 'naprenderadvanced' into napfft

9145a9d

fix compilation after render advanced update

3e4b85e

restore reflection effect in audiovisual demo

be204f4

cklosters and others added 11 commits July 22, 2024 16:13

fix compilation GCC

d5c1bc4

Merge remote-tracking branch 'origin/napfft' into napfft

3c8d28d

fix compilation GCC

8a7a6fb

remove not used (not applied) GPL license

973404a

Merge branch 'main' into napfft

13c9c9f

change default render index to 1

254009a

change defaults

d466d7a

First batch of changes based on review

60b5d19

Second batch of review changes

e0d083c

don't perform dirty check and make dirty flag atomic

70c265f

simplify & optimize threaded sample data transfer

b14bd70

cklosters requested review from lshoek and stijnvanbeek August 13, 2024 11:37

Third batch of review changes

3b2da12

cklosters added audio Audio related Questions & Issues rendering Render related Questions & Issues enhancement New feature or request labels Aug 13, 2024

cklosters commented Aug 13, 2024

View reviewed changes

cklosters and others added 4 commits August 13, 2024 14:24

add MPL license

27dbd02

fix gcc

647f527

Merge branch 'main' into napfft

ee3df1d

Merge branch 'main' into napfft

59b9ca0

Merge branch 'main' into napfft

9673a60

fix formatted buffer zero padding using zero fill instead of clear

a50d330

lshoek added 2 commits September 23, 2024 23:12

fix error message leak in portaudioservice

23702a5

Merge branch 'main' into napfft

b1deff0

lshoek mentioned this pull request Sep 24, 2024

Enable real time scheduling (ALSA) #54

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FFT system module and new audiovisual demo #38

FFT system module and new audiovisual demo #38

cklosters commented Aug 13, 2024

cklosters left a comment •

edited

Loading

cklosters Aug 13, 2024

cklosters Aug 13, 2024

stijnvanbeek commented Aug 27, 2024

cklosters commented Sep 16, 2024 •

edited

Loading

lshoek commented Sep 16, 2024

FFT system module and new audiovisual demo #38

Are you sure you want to change the base?

FFT system module and new audiovisual demo #38

Conversation

cklosters commented Aug 13, 2024

cklosters left a comment • edited Loading

Choose a reason for hiding this comment

cklosters Aug 13, 2024

Choose a reason for hiding this comment

cklosters Aug 13, 2024

Choose a reason for hiding this comment

stijnvanbeek commented Aug 27, 2024

cklosters commented Sep 16, 2024 • edited Loading

lshoek commented Sep 16, 2024

cklosters left a comment •

edited

Loading

cklosters commented Sep 16, 2024 •

edited

Loading