Fix async image load #2006

kroymann · 2022-02-16T16:37:52Z

Prerequisites

I have written a descriptive pull-request title
I have verified that there are no overlapping pull-requests open
I have verified that I am following the existing coding patterns and practice as demonstrated in the repository. These follow strict Stylecop rules 👮.
I have provided test coverage for my change (where applicable)

Description

This addresses the issue outlined in #1997 where Image.LoadAsync(stream) would incorrectly use synchronous IO if the stream is seekable. The fix is simply to remove the optimization that avoids prebuffering the stream into an in-memory buffer if the stream is seekable, and instead always prebuffer the stream. This ensures that async IO is always used to read the stream.

…e subsequent image loading operations can safely use synchronous IO

…methods

…ons after preloading the stream, rather than async actions (since the async actions all ultimately operate synchronously anyway)

…nc()

src/ImageSharp/Formats/IImageDecoder.cs

…ageSharp into fix-async-image-load

src/ImageSharp/Image.FromStream.cs

…sed too early

JimBobSquarePants · 2022-02-22T03:05:36Z

Everything looks good so far from an implementation perspective. We'll need to figure out a strategy now to compare the new Async performance with Main.

JimBobSquarePants · 2022-02-24T13:52:12Z

@antonfirsov I just pushed a change to ChunkedMemoryStream that allows the size of the pooled buffer chunks to scale with the total allocation.

Allocation blocks match this scale.

LargeChunks = Min(4Mb, MemoryAllocator.BufferCapacityInBytes())
LargeChunkThreshold = Min(1Mb, LargeChunk / 4)
SmallChunks = Min(128Kb, LargeChunks / 32)

Once the total allocation within the stream hits our threshold we immediately switch to allocating larger chunk sizes.

I chose the the the values for the following reasons

Small @ 128Kb because that's the default chunk size RecycledMemoryStream uses.
Threshold @ 1Mb because that's the threshold of the ArrayPool. I was tempted to go for 512Kb here.
Large @ 4Mb because that matches the best performing pool size for our current allocator.

I'm not precious about any of these values to if you feel like they should change then please suggest alternatives.

JimBobSquarePants

I'm approving this but want a second opinion before merging.

antonfirsov · 2022-02-26T03:32:07Z

I want to take a look next week. Quite distracted from everything programming-related thanks to the ongoing Fourth Reich experiment on my continent, right in the neighborhood.

JimBobSquarePants · 2022-02-26T04:44:04Z

Of course mate. Absolute horror show over there!

antonfirsov · 2022-03-13T22:45:22Z

@kroymann can you please give me permission to push to your branch? (Thought it is always allowed for contributors for PR branches.)
I want to push my load test changes:
antonfirsov@63e897e

As an alternative, feel free to cherry pick it.

antonfirsov · 2022-03-13T22:53:46Z

I did before / after runs of asnchronous load testing:

It looks like I can repro the thread pool starvation on main : The synchronous parallel run finishes in about 6.3 seconds on my 10 core machine, while the async version needs ~9.2 seconds.
The PR seems to fix this nicely, bringing down async execution time close to the synchronous time
Unfortunately, there is a significant ~25% memory regression

We should try to address it. Will take quick look at the code changes now.

Before

After

antonfirsov

Ok, I have no instant idea to improve this.

If you guys both agree that 25% extra memory footprint is an acceptable price for correct async, I'm fine merging this. We can open an issue to track the sync/async memory gap.

Just please remember to merge my load test addition!

antonfirsov · 2022-03-14T00:04:53Z

Btw @kroymann thanks for discovering the problem, and pushing through the fix, really great job!

JimBobSquarePants · 2022-03-14T00:55:15Z

@antonfirsov Your permissions issues were likely due to Git LFS issues with GitHub.

There's a workaround that should allow you to push to the fork.

rm .git/hooks/pre-push

I'm assuming the memory overheads come from my buffer allocation? Would a drop to 2MB segments be useful?

kroymann · 2022-03-14T01:18:57Z

@antonfirsov I think @JimBobSquarePants is right about the github permissions, but I went ahead and cherry-picked your commit just to expedite things. Thank you for adding those tests!

antonfirsov · 2022-03-14T01:26:10Z

I'm assuming the memory overheads come from my buffer allocation? Would a drop to 2MB segments be useful?

2MB logical allocations lead to 4MB actual allocations since buffers are uniform.

Actually, there might be a way to create a smarter buffer growing strategy.

This is how subsequent chunks sizes are allocated currently if I get it right.

128K -> 128K -> 128K -> 128K -> 128K -> 128K -> 128K -> 128K ->
4MB -> 4MB -> 4MB  -> ...

This stressing the 128K ArrayPool unnecessarily, and often leads to a wasteful allocation in the end.

Instead, I would try to do something like:

128K -> 128K -> 128K -> 128K ->
256K -> 256K ->  256K -> 256K ->
512K -> 512K ->  512K -> 512K ->
1MB -> 1MB -> 1MB -> 1MB -> 
[skip 2MB]
4MB -> 4MB -> 4MB  -> 4MB  -> 4MB  -> 4MB  -> 4MB  -> 4MB  ->...

This would keep most of the buffers rented from ArrayPool.Shared without stressing individual buckets too much and avoid wasting space. The unmanaged pool wouldn't be touched for images smaller than 7 MB.

JimBobSquarePants · 2022-03-14T01:46:16Z

Yeah, that's how it's allocated currently. Shouldn't be too hard to update it to follow your pattern.

antonfirsov · 2022-03-14T02:08:16Z

Here is a closed formula to get the i-th chunk size according to my logic, assuming you increment i each time you allocate a chunk:

private static int GetChunkSize(int i)
{
    const int _128K = 1 << 17;
    const int _4M = 1 << 22;
    return i < 16 ? _128K * (1 << (i / 4)) : _4M;
}

Tomorrow is the last day I have access to my benchmark machine to redo the benchmarks, I will leave for two weeks afterwards.
PS: realized that tomorrow already started in Australia 😆

JimBobSquarePants · 2022-03-14T09:51:56Z

It's always tomorrow in Aus!

That's the changes pushed. I added a check to ensure we always use the min of the allocator capacity or the calculated size, in case we change things in the future or someone implements their own allocator.

antonfirsov · 2022-03-14T17:18:12Z

Looks like there is a slight improvement. The total memory gap seems to be around 20% now.

antonfirsov

LGTM, thanks for the patience!

src/ImageSharp/IO/ChunkedMemoryStream.cs

Co-authored-by: Anton Firszov <antonfir@gmail.com>

kroymann and others added 5 commits February 14, 2022 16:01

Always pre-copy the source stream into an in-memory stream so that th…

4802b4f

…e subsequent image loading operations can safely use synchronous IO

Plumb CancellationToken through IImageDecoder and IImageInfoDetector …

3a11b2f

…methods

Change WithSeekableStreamAsync() so that it executes synchronous acti…

f874218

…ons after preloading the stream, rather than async actions (since the async actions all ultimately operate synchronously anyway)

Remove IImageDecoder.DecodeAsync() and IImageInfoDetector.IdentifyAsy…

eb7b9d9

…nc()

Merge branch 'main' into fix-async-image-load

66dc526

antonfirsov reviewed Feb 16, 2022

View reviewed changes

src/ImageSharp/Formats/IImageDecoder.cs Outdated Show resolved Hide resolved

kroymann added 2 commits February 16, 2022 14:54

Merge branch 'fix-async-image-load' of https://github.com/kroymann/Im…

fdc9d82

…ageSharp into fix-async-image-load

Make the cancellationToken arguments required

1c225e9

JimBobSquarePants reviewed Feb 17, 2022

View reviewed changes

src/ImageSharp/Image.FromStream.cs Show resolved Hide resolved

kroymann and others added 4 commits February 16, 2022 18:00

Restore mistakenly deleted logic for resetting stream position

840bf82

Fix a bug in LoadAsync(filePath) where the FileStream was being dispo…

6ed13c3

…sed too early

Fix unit tests

2ee0c6e

Merge branch 'main' into fix-async-image-load

9f4223d

JimBobSquarePants mentioned this pull request Feb 22, 2022

Image.LoadAsync(stream) uses synchronous IO if the stream is seekable #1997

Closed

JimBobSquarePants added 3 commits February 22, 2022 13:44

Cleanup and normalization

2474aab

Merge branch 'main' into fix-async-image-load

84a639b

Fix build

95cdd1c

Scale allocated chunk sizes to match allocations

fd4e3a5

JimBobSquarePants approved these changes Feb 26, 2022

View reviewed changes

JimBobSquarePants added 6 commits February 28, 2022 22:08

Merge remote-tracking branch 'upstream/main' into fix-async-image-load

d245a81

Merge branch 'main' into fix-async-image-load

226953b

Merge branch 'main' into fix-async-image-load

77e3986

Merge branch 'main' into fix-async-image-load

ac1e663

Merge branch 'main' into fix-async-image-load

2745cf8

Merge branch 'main' into fix-async-image-load

b54db18

antonfirsov approved these changes Mar 13, 2022

View reviewed changes

AsyncImageSharp option for load testing

faea7d0

Use graduated buffers

1ad9e56

antonfirsov approved these changes Mar 14, 2022

View reviewed changes

src/ImageSharp/IO/ChunkedMemoryStream.cs Show resolved Hide resolved

Update src/ImageSharp/IO/ChunkedMemoryStream.cs

e554dca

Co-authored-by: Anton Firszov <antonfir@gmail.com>

JimBobSquarePants added this to the 2.*.* milestone Mar 15, 2022

JimBobSquarePants merged commit 8d7c271 into SixLabors:main Mar 15, 2022

renovate bot mentioned this pull request Mar 16, 2022

Update dependency SixLabors.ImageSharp to v2 - autoclosed RononDex/Astrobot#78

Closed

1 task

renovate bot mentioned this pull request Mar 26, 2022

chore(deps): update dependency sixlabors.imagesharp to v2 FantasticFiasco/export-image-dotnet-core#32

Merged

1 task

renovate bot mentioned this pull request Apr 5, 2022

chore: Update dependency SixLabors.ImageSharp to v2.1.3 spectreconsole/spectre.console#805

Closed

1 task

antonfirsov mentioned this pull request Nov 26, 2022

Improve Decoder/Encoder symmetry #2276

Merged

4 tasks

antonfirsov added the breaking Signifies a binary breaking change. label Oct 4, 2023

kroymann deleted the fix-async-image-load branch April 26, 2024 16:12

Uh oh!

Fix async image load #2006

Fix async image load #2006

Uh oh!

Conversation

kroymann commented Feb 16, 2022

Prerequisites

Description

Uh oh!

Uh oh!

Uh oh!

JimBobSquarePants commented Feb 22, 2022

Uh oh!

JimBobSquarePants commented Feb 24, 2022

Uh oh!

JimBobSquarePants left a comment

Choose a reason for hiding this comment

Uh oh!

antonfirsov commented Feb 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JimBobSquarePants commented Feb 26, 2022

Uh oh!

antonfirsov commented Mar 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

antonfirsov commented Mar 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before

After

Uh oh!

antonfirsov left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

antonfirsov commented Mar 14, 2022

Uh oh!

JimBobSquarePants commented Mar 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kroymann commented Mar 14, 2022

Uh oh!

antonfirsov commented Mar 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JimBobSquarePants commented Mar 14, 2022

Uh oh!

antonfirsov commented Mar 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JimBobSquarePants commented Mar 14, 2022

Uh oh!

antonfirsov commented Mar 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

antonfirsov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

antonfirsov commented Feb 26, 2022 •

edited

Loading

antonfirsov commented Mar 13, 2022 •

edited

Loading

antonfirsov commented Mar 13, 2022 •

edited

Loading

antonfirsov left a comment •

edited

Loading

JimBobSquarePants commented Mar 14, 2022 •

edited

Loading

antonfirsov commented Mar 14, 2022 •

edited

Loading

antonfirsov commented Mar 14, 2022 •

edited

Loading

antonfirsov commented Mar 14, 2022 •

edited

Loading