Envelope triggered NaN #3408

zonkmachine · 2017-03-06T11:27:10Z

Edit: Brief intro

We produce NaN.
NaN is triggered in instrument envelopes. Specifically by changing release time while playing.
This may only be during the first loop.
NaN introduced here. 9e1cdd0
An uninitialized variable?

To reproduce.

Load test project:envelopeNaN.mmp.zip
Play until loop. Sound disappears? If yes, proof. If no, reload project and try again.

Verbose intro

The sound went out, NaN style, when I tweaked the decay on an instrument while the song was playing. I set out to see if I could pinpoint the source and I eventually managed to find a way to reproduce it reliably enought to be able to bisect this and the bug seem to have been introduced in 9e1cdd0

commit 9e1cdd0441bca0697f582be3b85d8e2d4c152c72
Author: Vesa <contact.diizy@nbl.fi>
Date:   Sun Aug 3 14:49:45 2014 +0300

    Start work on replacing/removing global locks

The test project is a simple melody with a bitinvader synt having the volume envelopes release knob controlled by an automation track. There is a reverb on the end for detection purposes as having a reverb seem to enormously aggravate the issue. The NaN is however introduced before the effect chain.
If you play this track the sound may go silent somewhere during the first loop but if it makes it to the second you're safe and need to reopen the project and repeat 'play' until it eventually crashes. For me it will go silent at least one time out of four. Since the issue seem to clear itself up after the first run, I suspect an uninitialised variable somewhere but so far I haven't managed to fix this. We divide with zero in the envelope which I set out to 'fix' in issue #3381 but fixing that doesn't affect this issue at all. I have only seen this with the release button and I think only on the first loop.
If you introduce an assert( ! isnanf( some buffer ) ); in the input of the FX chain you will see that the NaN is detected on either:

The first loop cycle on a project launched from the command line (or automatically from last session).
Immediately on pressing play on a project launched from the gui.

This is related to #1048 and #3313 . I think this one should be fixed first as it's introduced first in the sound chain.

Test project: envelopeNaN.mmp.zip

The text was updated successfully, but these errors were encountered:

michaelgregorius · 2017-03-07T18:00:51Z

@zonkmachine, I am not able to reproduce the problem. This might be the case because I don't have the zita reverb.

However, you could try to trap the critical floating point exceptions to find out the location where the NaN is introduced. Here's some example code that I put together from various source on the net:

#include <fenv.h> // For feenableexcept
#include <execinfo.h> // For backtrace and backtrace_symbols_fd
#include <unistd.h> // For STDERR_FILENO
#include <csignal> // To register the signal handler

#include <iostream>


void signalHandler( int signum ) {
   
    std::cout << "Interrupt signal (" << signum << ") received.\n";
    
    // Get a back trace
    void *array[10];
    size_t size;

    // get void*'s for all entries on the stack
    size = backtrace(array, 10);
    
    backtrace_symbols_fd(array, size, STDERR_FILENO);

    // cleanup and close up stuff here  
    // terminate program  

    exit(signum);
}

void triggerFloatingPointException()
{
    float a = 1., b = 0.;
    float c = a/b;
}

int main(void) {
    // Enable exceptions for certain floating point results
    feenableexcept(FE_INVALID   | 
                   FE_DIVBYZERO | 
                   FE_OVERFLOW  | 
                   FE_UNDERFLOW);
    
    // Install the trap handler
    // register signal SIGINT and signal handler  
    signal(SIGFPE, signalHandler);
    
    // Trigger an exception
    triggerFloatingPointException();
    
    return 0;
}

I think it should be clear from the comments what happens where. If I understand correctly the settings for the exceptions are inherited by threads so you should enable them very early in LMMS' main function.

Also for the example above the backtrace only contained addresses on my machine but I guess it might suffice to simply set a breakpoint in the signal handler (haven't tried if this really works though).

Good luck! :)

zonkmachine · 2017-03-07T18:58:14Z

Thanks for the debugging code!

@zonkmachine, I am not able to reproduce the problem. This might be the case because I don't have the zita reverb.

Oh, zita. A mistake that one. I used it as the other ones changed when bisecting. I'll change that to one we ship. Calf reverb and ReverbSC works fine for this! ;)

zonkmachine · 2017-03-07T19:04:45Z

Oh, zita. A mistake that one ... I'll change that to one we ship.

Fixed!

michaelgregorius · 2017-03-07T22:31:08Z

@zonkmachine, thanks for the updated file! I am now also able to reproduce the problem. I have applied the code that I have posted above to the main function of LMMS and got some results. As a first step I needed to fix some other places in the code that triggered floating point exceptions in a rather hackish way. However, this was only done to get to the heart of the problem.

The main problem is caused at the end of InstrumentSoundShaping::processAudioBuffer in the code of the following if clause:

if( m_envLfoParameters[Volume]->isUsed() )
{
	float volBuffer [frames];
	m_envLfoParameters[Volume]->fillLevel( volBuffer, envTotalFrames, envReleaseBegin, frames );

	for( fpp_t frame = 0; frame < frames; ++frame )
	{
		float vol_level = volBuffer[frame];
		vol_level = vol_level * vol_level;
		buffer[frame][0] = vol_level * buffer[frame][0];
		buffer[frame][1] = vol_level * buffer[frame][1];
	}
}

After the volume buffer volBuffer is allocated it is filled by the call to fillLevel in the next line. However, that method fills the complete buffer with the value -2.71678e+37 which in turn leads to problems when both these values are squared in the following line:

vol_level = vol_level * vol_level;

Multiplying these two large negative numbers will likely result in a large positive number that exceeds the maximum for float thus leading to NaNs.

At first I thought it was a problem that volBuffer in not initialized after allocation but the problem still persists if it is filled with zeroes right after the allocation. So the culprit should be m_envLfoParameters.

zonkmachine · 2017-03-07T23:10:00Z

@zonkmachine, thanks for the updated file! I am now also able to reproduce the problem. I have applied the code that I have posted above to the main function of LMMS and got some results.

Me to, it's brutal! :D
Leads me directly to #3381 and the divide by 0 issue for a starter.

zonkmachine · 2017-03-08T17:02:20Z

@michaelgregorius If you have time and energy to take this on I suggest you assign this issue to yourself and then I should probably close #3381. I put #3381 on the RC3 roadmap https://github.com/LMMS/lmms/projects/2 but it looks like it's going to take far more work than just assign some new minimum values to the knobs.

michaelgregorius · 2017-03-08T18:45:34Z

@zonkmachine, I think this is a potentially never ending story because I assume that all of the plugins contained with LMMS' might also cause floating point exceptions. So while everything might be nice with one project and set of plugins another project with another set might cause problems.

What do you think about adding a CMake option that conditionally adds and compiles the code above so that LMMS developers just have to flip a switch if they want to hunt down potential problems? It could perhaps also be used to hunt down the elusive problem described in #1048.

zonkmachine · 2017-03-08T18:57:20Z

@zonkmachine, I think this is a potentially never ending story because I assume that all of the plugins contained with LMMS' might also cause floating point exceptions. So while everything might be nice with one project and set of plugins another project with another set might cause problems.

Yes, but I'm focused on the envelope/soundshaping part here.

What do you think about adding a CMake option that conditionally adds and compiles the code above so that LMMS developers just have to flip a switch if they want to hunt down potential problems? It could perhaps also be used to hunt down the elusive problem described in #1048.

Yes to this. I had a similar plan with 28b30d6

PaulBatchelor · 2017-03-16T03:15:07Z

@zonkmachine hows this one coming? Is there anything I can do here to help out?

zonkmachine · 2017-04-11T19:46:26Z

hows this one coming?

This is a complex one and we've fixed some related issues:
#3428
#3425

I think this can be bumped to 1.3 but I will probably not have time to work on this.

michaelgregorius · 2017-07-07T20:39:23Z

I have added the pull request #3687 to provide an option to debug floating point exception. Please refer to the pull request for more details.

PhysSong · 2017-08-10T05:52:34Z

The problem is: void EnvelopeAndLfoParameters::updateSampleVars() have been thread unsafe since 9e1cdd0. It can be fixed by introducing a mutex to guarantee the thread safety.
Bring it back to 1.2.0 because the producing of NaN is a serious bug.

PhysSong · 2017-08-10T06:50:30Z

Should be fixed via #3761.

zonkmachine added bug core labels Mar 6, 2017

zonkmachine added this to the 1.2.0 milestone Mar 6, 2017

zonkmachine mentioned this issue Mar 8, 2017

dBV is actually mislabeled dBFS #3095

Merged

michaelgregorius mentioned this issue Mar 8, 2017

KILLER BUG: Inifinitely Loud Silence (ILS or rather NaN) #1048

Closed

zonkmachine self-assigned this Mar 15, 2017

zonkmachine removed their assignment Apr 11, 2017

zonkmachine modified the milestones: 1.3.0, 1.2.0 Apr 12, 2017

zonkmachine mentioned this issue Jul 7, 2017

NaN introduced with sample-exact fix in 1.0.91 #3685

Closed

zonkmachine mentioned this issue Jul 18, 2017

Allways remove infs/nans #3706

Merged

zonkmachine self-assigned this Jul 19, 2017

PhysSong modified the milestones: 1.2.0, 1.3.0 Aug 10, 2017

PhysSong self-assigned this Aug 10, 2017

PhysSong mentioned this issue Aug 10, 2017

Fix producing of NaN from Env/LFO parameter change while playing. #3761

Merged

PhysSong closed this as completed in #3761 Aug 12, 2017

zonkmachine mentioned this issue Oct 17, 2017

Fix destructor call in NotePlayHandleManager #3884

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Envelope triggered NaN #3408

Envelope triggered NaN #3408

zonkmachine commented Mar 6, 2017 •

edited

Loading

michaelgregorius commented Mar 7, 2017

zonkmachine commented Mar 7, 2017

zonkmachine commented Mar 7, 2017

michaelgregorius commented Mar 7, 2017

zonkmachine commented Mar 7, 2017

zonkmachine commented Mar 8, 2017

michaelgregorius commented Mar 8, 2017

zonkmachine commented Mar 8, 2017

PaulBatchelor commented Mar 16, 2017

zonkmachine commented Apr 11, 2017

michaelgregorius commented Jul 7, 2017

PhysSong commented Aug 10, 2017

PhysSong commented Aug 10, 2017

Envelope triggered NaN #3408

Envelope triggered NaN #3408

Comments

zonkmachine commented Mar 6, 2017 • edited Loading

michaelgregorius commented Mar 7, 2017

zonkmachine commented Mar 7, 2017

zonkmachine commented Mar 7, 2017

michaelgregorius commented Mar 7, 2017

zonkmachine commented Mar 7, 2017

zonkmachine commented Mar 8, 2017

michaelgregorius commented Mar 8, 2017

zonkmachine commented Mar 8, 2017

PaulBatchelor commented Mar 16, 2017

zonkmachine commented Apr 11, 2017

michaelgregorius commented Jul 7, 2017

PhysSong commented Aug 10, 2017

PhysSong commented Aug 10, 2017

zonkmachine commented Mar 6, 2017 •

edited

Loading