Improve Download Memory Usage #61

waahm7 · 2024-10-04T22:17:46Z

Description of changes:
Tokio.spawn doesn't respect the spawn order, which can result in us downloading the first num_concurrency parts in random order. For a workload of 5GB * 100 files, this can lead to very high memory usage, as seen in the diagram below. This PR refactors the exact part to be determined only once the task has been scheduled.

Uploads can also have a similar issue where we read too many parts into memory. To fix that, we will need to refactor our scheduler to be smarter so that we only read the part when we have the permit. (Created: #60)

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

aajtodd

What's this do to throughput for download? In particular the warmup/first few runs?

aajtodd · 2024-10-07T17:15:17Z

aws-s3-transfer-manager/src/operation/download.rs

@@ -129,18 +129,30 @@ fn handle_discovery_chunk(

 /// Download operation specific state
 #[derive(Debug)]
-pub(crate) struct DownloadState {}
+pub(crate) struct DownloadState {
+    current_seq: u64,


Can you not get away with an atomic here?

Thanks, updated to AtomicU64.

aajtodd · 2024-10-07T17:32:43Z

aws-s3-transfer-manager/src/operation/download.rs

    if let Some(stream) = initial_chunk {
+        let seq = handle.ctx.next_seq();


start_seq and seq aren't connected anywhere here, should probably set start_seq to seq instead of hard coded to 1

Thanks, we would have to set start_seq = seq+1. I have added a current_seq() function which will return 0 or 1 depending upon we called next_seq or not.

waahm7 · 2024-10-07T18:58:55Z

What's this do to throughput for download? In particular the warmup/first few runs?

It didn’t help much since we were only doing 125 out of 3840 parts out of order.

waahm7 added 10 commits October 1, 2024 15:11

print req spawn order

1eeb33f

initial hack

bd19162

print fix

bc29ab6

Merge branch 'main' into waqar/download-perf

7509c69

cleanup

4dec608

refactoring

a845c6e

fix

cac44a4

seq instead of part_number

ba93083

renames

2d96404

rename

9f246ce

waahm7 requested a review from a team as a code owner October 4, 2024 22:17

fmt

9f16242

aajtodd reviewed Oct 7, 2024

View reviewed changes

waahm7 added 3 commits October 7, 2024 11:54

PR Feedback

639019c

comment update

5282233

lint

cde6845

aajtodd approved these changes Oct 7, 2024

View reviewed changes

ysaito1001 approved these changes Oct 7, 2024

View reviewed changes

use div_ceil

8590466

waahm7 merged commit 3dd20c8 into main Oct 8, 2024
12 of 13 checks passed

waahm7 deleted the waqar/download-perf branch October 8, 2024 21:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Download Memory Usage #61

Improve Download Memory Usage #61

waahm7 commented Oct 4, 2024

aajtodd left a comment

aajtodd Oct 7, 2024

waahm7 Oct 7, 2024

aajtodd Oct 7, 2024

waahm7 Oct 7, 2024

waahm7 commented Oct 7, 2024 •

edited

Loading

		if let Some(stream) = initial_chunk {
		let seq = handle.ctx.next_seq();

Improve Download Memory Usage #61

Improve Download Memory Usage #61

Conversation

waahm7 commented Oct 4, 2024

aajtodd left a comment

Choose a reason for hiding this comment

aajtodd Oct 7, 2024

Choose a reason for hiding this comment

waahm7 Oct 7, 2024

Choose a reason for hiding this comment

aajtodd Oct 7, 2024

Choose a reason for hiding this comment

waahm7 Oct 7, 2024

Choose a reason for hiding this comment

waahm7 commented Oct 7, 2024 • edited Loading

waahm7 commented Oct 7, 2024 •

edited

Loading