Auto-tuning DotNetty batching + removing scheduler from batching system #4685

Aaronontheweb · 2020-12-22T23:21:08Z

Ported the FlushConsolidationHandler from Netty and used it to auto-tune batching inside Akka.Remote without having to set explicit thresholds and without having to rely on the IScheduler.

This accomplishes:

Lower Idle CPU consumption than even Move DotNetty batching scheduling off of DotNetty STEE and onto HashedWheelTimer #4678
Significantly lower latencies on systems that don't write heavily
No need for users to manually tune their threshold settings in akka.remote.dot-netty.tcp.batching - this is now handled automatically by the FlushConsolidationHandler.

close #4636
close #4563

We still have some more work to do optimizing the DedicatedThreadPool to scale down automatically, but this eliminates most of the CPU noise coming from DotNetty.

Aaronontheweb · 2020-12-22T23:35:31Z

How It Works

The FlushConsolidationHandler works by batching flushes together, rather than writes, and in this patch we've changed the TcpAssociationHandle to always call IChannel.WriteAndFlushAsync - which creates a 1:1 correlation between writes and flushes.

The algorithm is designed to capture flushes that occur in rapid succession and group them together in order to lower the total number of system calls to the socket, which improves average throughput and decreases CPU utilization.

The flushes are batched together when:

The socket is currently performing a read or
The total number of flushes is less than DefaultExplicitFlushAfterFlushes or whatever the configured value is - defaults to 30 in Akka.NET.

The flushes are allowed to pass and write out to the socket when:

The socket is writing but not reading (immediate write);
The total number of flushes is equal to DefaultExplicitFlushAfterFlushes; or
A flush has been scheduled onto the EventLoop without being cancelled.

In the third case, we don't use a time-based delay to flush the socket - instead the "flush" call is simply added to the same event queue where all of the read and write events are. It works identically to an actor's mailbox. All of the writes queued up prior to that event get flushed together.

IgorFedchenko

LGTM - just left minor note and a question about flushing while reading is in process

IgorFedchenko · 2020-12-24T17:19:08Z

src/common.props

@@ -2,7 +2,7 @@
  <PropertyGroup>
    <Copyright>Copyright © 2013-2020 Akka.NET Team</Copyright>
    <Authors>Akka.NET Team</Authors>
-    <VersionPrefix>1.4.13</VersionPrefix>
+    <VersionPrefix>1.4.14</VersionPrefix>


I guess this should not be changed in this PR, it's just build,cmd updated this file when was running locally?

That's correct

IgorFedchenko · 2020-12-24T17:26:49Z

src/core/Akka.Remote/Transport/DotNetty/BatchWriter.cs

+                // we only need to flush if we reach the explicitFlushAfterFlushes limit.
+                if (++_flushPendingCount == ExplicitFlushAfterFlushes)
+                {
+                    FlushNow(context);


So when reading is complete we will flush in ChannelReadComplete anyway. But we are trying to flush right here if there are too many flushes are pending? Are we able to flush while reading is in process? Or this is safe?

So if a read is sitting in the channel pipeline right now we assume it means that some data is being read by the current application and that might, presumably, be used to produce a response right away - we're trying to batch those writes together into as few flushes as possible and when the read "completes" (we've finished reading all of the data currently inside the buffer) it's safe to flush any writes that are currently pending. This helps reduce latency in lower traffic system AND increases throughput in higher traffic systems.

* stubbing out performance documentation per #4685 * close #4685

Aaronontheweb added 6 commits December 22, 2020 17:13

implement FlushConsolidationHandler

128c10b

first complete flush consilidator implementation

796ea5f

styling

9cb131e

fixed batch writer specs

fc984ce

removed previous BatchWriter settings + handle

0acef24

removed unused batching settings from Akka.Remote

05b18ac

Aaronontheweb added akka-remote dotnetty labels Dec 22, 2020

Aaronontheweb mentioned this pull request Dec 22, 2020

Akka.Remote - exhaustion of TCP buffer after updating from 1.3.8 to 1.4.6 #4563

Closed

IgorFedchenko approved these changes Dec 24, 2020

View reviewed changes

Aaronontheweb merged commit b8e74e0 into akkadotnet:dev Dec 28, 2020

Aaronontheweb deleted the feature/FlushConsolidator branch December 28, 2020 18:33

Aaronontheweb mentioned this pull request Dec 28, 2020

Need to update Akka.Remote performance-tuning document as of Akka.NET v1.4.14 #4697

Closed

Aaronontheweb added a commit to Aaronontheweb/akka.net that referenced this pull request Dec 30, 2020

stubbing out performance documentation per akkadotnet#4685

6facb8f

Aaronontheweb added a commit to Aaronontheweb/akka.net that referenced this pull request Dec 30, 2020

close akkadotnet#4685

a5b2d87

Aaronontheweb added a commit that referenced this pull request Dec 30, 2020

Update Akka.Remote performance guidance for Akka.NET v1.4.14 (#4703)

5e744ff

* stubbing out performance documentation per #4685 * close #4685

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto-tuning DotNetty batching + removing scheduler from batching system #4685

Auto-tuning DotNetty batching + removing scheduler from batching system #4685

Aaronontheweb commented Dec 22, 2020 •

edited

Loading

Aaronontheweb commented Dec 22, 2020 •

edited

Loading

IgorFedchenko left a comment

IgorFedchenko Dec 24, 2020

Aaronontheweb Dec 25, 2020

IgorFedchenko Dec 24, 2020

Aaronontheweb Dec 28, 2020

Auto-tuning DotNetty batching + removing scheduler from batching system #4685

Auto-tuning DotNetty batching + removing scheduler from batching system #4685

Conversation

Aaronontheweb commented Dec 22, 2020 • edited Loading

Aaronontheweb commented Dec 22, 2020 • edited Loading

How It Works

IgorFedchenko left a comment

Choose a reason for hiding this comment

IgorFedchenko Dec 24, 2020

Choose a reason for hiding this comment

Aaronontheweb Dec 25, 2020

Choose a reason for hiding this comment

IgorFedchenko Dec 24, 2020

Choose a reason for hiding this comment

Aaronontheweb Dec 28, 2020

Choose a reason for hiding this comment

Aaronontheweb commented Dec 22, 2020 •

edited

Loading

Aaronontheweb commented Dec 22, 2020 •

edited

Loading