CUDA Bug: volatile isParticle Flag necessary #538

PrometheusPi · 2014-08-26T10:03:52Z

While testing the Bunch example in 2D, I discovered the folowing bug, that also exists in 3D.

Not all particles get the given momentum and thus stay at their initial possition.

This is clearly visable in 2D.

Some particles at the outer edge of the Gaussian blob do not move and stay as halo behind.
In 3D this is not directly visible.

However, in the BinEnergyElectrons.dat there are particles in the first bin (zero Energy) at the first time step for both 3D and 2D.

With the help of @psychocoderHPC - we saw that if we set isParticle in Particles.kernel to true, all particles are initialized correctly. Setting blockingKernel to on did not help. Same for setting typedef uint16_t lcellId_t; to typedef uint32_t lcellId_t; in frame_types.hpp.

The error does not occure for the KelvinHelmholtz example.

The text was updated successfully, but these errors were encountered:

PrometheusPi · 2014-08-26T10:55:34Z

I noticed that we have 128 cells in y-direction and we have 32 cells in y-direction per GPU. The black slices look like they have a length of 1/4 of the GPU length in y-direction.
Therefore, the error occurs for 8 cells at once.

psychocoderHPC · 2014-08-29T13:20:37Z

I added a workaround that solves the problem.
I will check if I can create a minimal example to submit a BUG to the nvcc developer.

please do not close this issue

fix #538 traverse frame list

PrometheusPi · 2014-09-01T09:11:53Z

#539 is a workaround. The issue will stay open till a general solution for this problem is found.

Do you agree @psychocoderHPC and @ax3l ?

ax3l · 2014-09-30T09:48:52Z

The issue stays open for now, #539 implemented a work around.

We have to write a minimal example to report the bug to get it fixed in future versions of CUDA.

PrometheusPi · 2014-11-25T21:46:06Z

Is this solved by other means than the workaround #539?

ax3l · 2014-11-25T21:51:39Z

no, that was an auto-close due to the merge to master and should stay open (we changed the scope of the issue after we fixed it).

ax3l · 2015-01-05T16:08:20Z

work-around was applied with #539 and does not cause problems right now that we know of.

new scope of this issue

Since the volatile flag should not be necessary, it looks like either a race condition in our code (that is circumvented by the flag) or a CUDA 5.5+ bug that we should write an example for.

PrometheusPi · 2018-11-12T12:16:02Z

What is the status of this issue?

psychocoderHPC · 2018-11-13T11:56:02Z

It was solved with #539 but it is not clear if the workaround can be removed or not. We should keep it open.

ax3l · 2018-11-14T00:10:27Z

We could try if it still occurs, when removing the work-around, with CUDA 8+ and if not then it was fixed upstream in nvcc.

sbastrakov · 2019-11-22T14:58:36Z

This and other volatile things might be explained by libcu++ slides from SC19 shared by @ax3l . We need to check, there are not that many occurances.
cc @psychocoderHPC .

psychocoderHPC · 2019-11-22T16:58:39Z

@sbastrakov we are setting volatile to a thread-local variable to break compiler optimizations. The example in the slide (please link when available) is showing an example where you guard data via an atomic variable. To get the correct data value you must call __threadfence to be sure that you are reading the latest version of the value.

PrometheusPi added the bug label Aug 26, 2014

PrometheusPi added this to the Open Beta milestone Aug 26, 2014

ax3l assigned psychocoderHPC Aug 26, 2014

psychocoderHPC mentioned this issue Aug 29, 2014

fix #538 traverse frame list #539

Merged

PrometheusPi added a commit that referenced this issue Sep 1, 2014

Merge pull request #539 from psychocoderHPC/fix-traverseFrameList

4280855

fix #538 traverse frame list

ax3l changed the title ~~Error in particle momentum initialization~~ CUDA Bug: volatile isParticle Flag necessary Sep 30, 2014

psychocoderHPC closed this as completed in 92be491 Nov 25, 2014

ax3l reopened this Nov 25, 2014

ax3l added affects latest release a bug that affects the latest stable release component: core in PIConGPU (core application) labels Jan 5, 2015

ax3l mentioned this issue Jan 19, 2015

Ionization in PIConGPU #595

Merged

10 tasks

ax3l mentioned this issue Jan 29, 2015

Collection of CUDA Bugs and Work Arounds #652

Closed

ax3l modified the milestones: Future, 0.2.0: Open Beta Nov 2, 2016

ax3l added component: third party third party libraries that are shipped and/or linked backend: cuda CUDA backend labels Nov 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA Bug: volatile isParticle Flag necessary #538

CUDA Bug: volatile isParticle Flag necessary #538

PrometheusPi commented Aug 26, 2014

PrometheusPi commented Aug 26, 2014

psychocoderHPC commented Aug 29, 2014

PrometheusPi commented Sep 1, 2014

ax3l commented Sep 30, 2014

PrometheusPi commented Nov 25, 2014

ax3l commented Nov 25, 2014

ax3l commented Jan 5, 2015

PrometheusPi commented Nov 12, 2018

psychocoderHPC commented Nov 13, 2018

ax3l commented Nov 14, 2018 •

edited

Loading

sbastrakov commented Nov 22, 2019

psychocoderHPC commented Nov 22, 2019

CUDA Bug: volatile isParticle Flag necessary #538

CUDA Bug: volatile isParticle Flag necessary #538

Comments

PrometheusPi commented Aug 26, 2014

PrometheusPi commented Aug 26, 2014

psychocoderHPC commented Aug 29, 2014

PrometheusPi commented Sep 1, 2014

ax3l commented Sep 30, 2014

PrometheusPi commented Nov 25, 2014

ax3l commented Nov 25, 2014

ax3l commented Jan 5, 2015

new scope of this issue

PrometheusPi commented Nov 12, 2018

psychocoderHPC commented Nov 13, 2018

ax3l commented Nov 14, 2018 • edited Loading

sbastrakov commented Nov 22, 2019

psychocoderHPC commented Nov 22, 2019

ax3l commented Nov 14, 2018 •

edited

Loading