optimize the clear head method from SetData to a CS kernel #11

SnowWindSaveYou · 2022-09-15T04:57:25Z

it can improve the performance in most scenarios，
but still an limitation,
it can't work properly on a very large screen (e.g. 4K HUD 3840*2160) due to the buffer counts over the max thread group.
but it can be fixed by split thread size into many dispatches with an index offset.

happy-turtle · 2022-09-15T11:14:45Z

Thank you! Wow, that really is a huge performance boost.
Can we maybe prevent the maximum screen size by using a two (or later even three-dimensional) number of threads? And then offset by the second dimension. I will try to explain in code comments.

SnowWindSaveYou · 2022-09-15T11:33:32Z

Thank you! Wow, that really is a huge performance boost. Can we maybe prevent the maximum screen size by using a two (or later even three-dimensional) number of threads? And then offset by the second dimension. I will try to explain in code comments.

i fixed it by using a for loop with buffersize in compute shader.

SnowWindSaveYou · 2022-09-15T11:38:34Z

Assets/OrderIndependentTransparency/Shaders/LinkedListCreation.cginc

@@ -16,7 +16,7 @@ RWByteAddressBuffer StartOffsetBuffer : register(u2);

 void createFragmentEntry(float4 col, float3 pos, uint uCoverage) {
    //Retrieve current Pixel count and increase counter
-    uint uPixelCount = FLBuffer.IncrementCounter();
+    uint uPixelCount = FLBuffer.IncrementCounter()+1;


i leave first one empty for safer debug, otherwise it's easy to make a infinite loop and make my computer crash

happy-turtle · 2022-09-15T15:12:16Z

Ah nice, I would still like to try a two-dimensional number of threads though. It would remove those big numbers. I can look at it at the next opportunity

…educe thread group count

happy-turtle · 2022-09-18T12:09:08Z

Thanks a lot for this! The whole process is a lot more performant now. I added two-dimensional thread groups and did some cleanup to align more with the other shaders.

opt clear head method from SetData to a CS kernel

9bb1443

fix the problem on large screen

ba10cbf

SnowWindSaveYou commented Sep 15, 2022

View reviewed changes

use two-dimensional thread groups to support larger resolutions and r…

f921549

…educe thread group count

happy-turtle merged commit a590df9 into happy-turtle:main Sep 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

optimize the clear head method from SetData to a CS kernel #11

optimize the clear head method from SetData to a CS kernel #11

Uh oh!

SnowWindSaveYou commented Sep 15, 2022

Uh oh!

happy-turtle commented Sep 15, 2022

Uh oh!

SnowWindSaveYou commented Sep 15, 2022

Uh oh!

SnowWindSaveYou Sep 15, 2022

Uh oh!

happy-turtle commented Sep 15, 2022 •

edited

Loading

Uh oh!

happy-turtle commented Sep 18, 2022

Uh oh!

Uh oh!

optimize the clear head method from SetData to a CS kernel #11

optimize the clear head method from SetData to a CS kernel #11

Uh oh!

Conversation

SnowWindSaveYou commented Sep 15, 2022

Uh oh!

happy-turtle commented Sep 15, 2022

Uh oh!

SnowWindSaveYou commented Sep 15, 2022

Uh oh!

SnowWindSaveYou Sep 15, 2022

Choose a reason for hiding this comment

Uh oh!

happy-turtle commented Sep 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

happy-turtle commented Sep 18, 2022

Uh oh!

Uh oh!

happy-turtle commented Sep 15, 2022 •

edited

Loading