Optimize part size for checksummed read #315

monthonk · 2023-06-26T15:19:24Z

The prefetcher stores data received from each input stream as a part in the part queue structure. Usually, the part size is pretty big (8 MB or more) and the checksum validation always has to be done against an entire part even if we only read a small portion of that part.

This makes checksummed read much slower than non-checksummed read. We could make it more efficient by making the part smaller or ideally align the part size to the read size so that we don't have to compute the checksum on unnecessary bytes.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and I agree to the terms of the Developer Certificate of Origin (DCO).

The prefetcher stores data received from each input stream as a part in the part queue structure. Usually, the part size is pretty big (8 MB or more) and the checksum validation always has to be done against an entire part even if we only read a small portion of that part. This makes checksummed read much slower than non-checksummed read. We could make it more efficient by making the part smaller or ideally align the part size to the read size so that we don't have to compute the checksum on unnecessary bytes. Signed-off-by: Monthon Klongklaew <monthonk@amazon.com>

jamesbornholt · 2023-06-26T16:42:09Z

mountpoint-s3/src/prefetch.rs

+        let min_part_size = 128 * 1024;
+        self.preferred_part_size = length.max(min_part_size);


I think if you initialize preferred_part_size to 128KiB, this can just be

Suggested change

let min_part_size = 128 * 1024;

self.preferred_part_size = length.max(min_part_size);

self.preferred_part_size = self.preferred_part_size.max(length);

Also worth a comment on why we chose 128KiB.

Also, should we set a maximum here, like 1MiB?

For choosing 128k, it's the linux readahead size and seem to be a reasonable minimum value. I will update the comment, also happy to change if you have better suggestion.

I'm not sure we should put a maximum value though. If the read size is really big then we will have to combine data from multiple parts and the extend operation is quite expensive.

but if we change the logic the use max value between last preferred_part_size and current length I think there should be a maximum, otherwise it will keep growing bigger.

The effective maximum is the client's part size anyway, so probably we should enforce it here just to be explicit.

Signed-off-by: Monthon Klongklaew <monthonk@amazon.com>

monthonk had a problem deploying to PR integration tests June 26, 2023 15:19 — with GitHub Actions Failure

monthonk force-pushed the part_size branch from 1ec1796 to 8828c0e Compare June 26, 2023 15:27

monthonk temporarily deployed to PR integration tests June 26, 2023 15:27 — with GitHub Actions Inactive

monthonk had a problem deploying to PR integration tests June 26, 2023 15:27 — with GitHub Actions Failure

monthonk force-pushed the part_size branch from 8828c0e to 7f68cba Compare June 26, 2023 15:39

monthonk had a problem deploying to PR integration tests June 26, 2023 15:39 — with GitHub Actions Error

monthonk temporarily deployed to PR integration tests June 26, 2023 15:39 — with GitHub Actions Inactive

monthonk added the performance PRs to run benchmarks on label Jun 26, 2023

monthonk temporarily deployed to PR benchmarks June 26, 2023 16:08 — with GitHub Actions Inactive

monthonk temporarily deployed to PR integration tests June 26, 2023 20:10 — with GitHub Actions Inactive

jamesbornholt reviewed Jun 26, 2023

View reviewed changes

Address PR comments

acf6cf4

Signed-off-by: Monthon Klongklaew <monthonk@amazon.com>

monthonk had a problem deploying to PR integration tests June 29, 2023 11:12 — with GitHub Actions Failure

monthonk temporarily deployed to PR benchmarks June 29, 2023 11:12 — with GitHub Actions Inactive

monthonk temporarily deployed to PR integration tests June 29, 2023 11:12 — with GitHub Actions Inactive

monthonk temporarily deployed to PR benchmarks June 29, 2023 11:13 — with GitHub Actions Inactive

monthonk requested a review from jamesbornholt June 29, 2023 13:25

jamesbornholt approved these changes Jun 29, 2023

View reviewed changes

jamesbornholt merged commit 971b757 into awslabs:main Jun 29, 2023

monthonk mentioned this pull request Jul 14, 2023

Remove the temporary checksum feature #378

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize part size for checksummed read #315

Optimize part size for checksummed read #315

monthonk commented Jun 26, 2023

jamesbornholt Jun 26, 2023

jamesbornholt Jun 26, 2023

monthonk Jun 27, 2023

monthonk Jun 27, 2023

jamesbornholt Jun 28, 2023

		let min_part_size = 128 * 1024;
		self.preferred_part_size = length.max(min_part_size);

	let min_part_size = 128 * 1024;
	self.preferred_part_size = length.max(min_part_size);
	self.preferred_part_size = self.preferred_part_size.max(length);

Optimize part size for checksummed read #315

Optimize part size for checksummed read #315

Conversation

monthonk commented Jun 26, 2023

jamesbornholt Jun 26, 2023

Choose a reason for hiding this comment

jamesbornholt Jun 26, 2023

Choose a reason for hiding this comment

monthonk Jun 27, 2023

Choose a reason for hiding this comment

monthonk Jun 27, 2023

Choose a reason for hiding this comment

jamesbornholt Jun 28, 2023

Choose a reason for hiding this comment