-
Notifications
You must be signed in to change notification settings - Fork 4
Open
Description
Now that we are using the crc32c module in AWS, we expected to see extremely rapid checksumming ( on the order of 1 GB/s). We're not seeing that. We're still seeing < 100MB/s in Lambda, and much slower than that in Docker. The question is, why?
Using the py-cpuinfo
package, it was confirmed that the machines AWS is giving us do have SSE 4.2 CPU extensions.
Hypotheses:
- CRC32C is no longer the bottleneck, but one of the other algorithms is now bottlenecking us
- checksums are always computed while streaming the file from S3. Perhaps that I/O is the bottleneck?
Experiments to perform:
- See how fast a file can be streamed from S3 -> Lambda.
- See how fast a file can be streamed from S3 -> Batch.
- Locally, checksum files using
dcplib.ChecksummingSink
, experimenting with inclusion/exclusion of the different checksumming algorithms to benchmark the performance of each algorithm, and their use in combination. - Craft a Lambda that checksums a local (non-streamed) file. Test the performance of the various algos in that environment.
- Craft a Docker image that checksums a local (non-streamed) file. Run it in AWS Batch. Test the performance of the various algos in that environment.
Metadata
Metadata
Assignees
Labels
No labels