Truncate SpooledTemporaryFile after flushing the buffer #169

jiinus · 2016-07-08T16:53:38Z

When uploading a file in chunks with S3BotoStorageFile I noticed that the underlying SpooledTemporaryFile is never truncated but instead successive write-calls just keep appending the file.

With each flush the file is seeked to the beginning though which results in malformed data on S3 as the contents of the previous flushes are uploaded again and again on each write.

Also added the missing S3.py. License seems to permit it?

codecov-io · 2016-07-08T16:59:55Z

Codecov Report

Merging #169 into master will increase coverage by -35.73%.

@@            Coverage Diff             @@
##           master    #169       +/-   ##
==========================================
- Coverage   59.32%   23.6%   -35.73%     
==========================================
  Files          17      18        +1     
  Lines        1694    1716       +22     
==========================================
- Hits         1005     405      -600     
- Misses        689    1311      +622

Impacted Files	Coverage Δ
storages/S3.py	`0% <ø> (ø)`
storages/backends/s3boto.py	`86.37% <100%> (+0.24%)`	✅
storages/backends/ftp.py	`0% <ø> (-87.62%)`	❌
storages/backends/sftpstorage.py	`0% <ø> (-86.19%)`	❌
storages/backends/dropbox.py	`91.93% <ø> (-5.44%)`	❌
storages/backends/hashpath.py	`81.81% <ø> (-3.9%)`	❌
storages/backends/gs.py	`70.58% <ø> (-0.85%)`	❌
storages/backends/symlinkorcopy.py	`0% <ø> (ø)`	✅
storages/backends/mogile.py	`0% <ø> (ø)`	✅
... and 10 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 72cd5d6...c74af2f. Read the comment docs.

jiinus · 2016-07-08T17:16:37Z

This should also fix the problem mentioned in issue #160

jschneier · 2016-07-11T05:31:36Z

Thanks to you and @iceseyes for bringing this issue up. I'm going to write a failing test with a 5MB file and get this one fixed asap, probably with your fix.

I intentionally removed the S3.py file because it was deprecated in the original repo and I wanted to reduce my own maintenance burden, especially of already deprecated pieces of the library. New fork yadda yadda.

m-barthelemy · 2017-01-24T19:20:08Z

Hi there,

This is affecting us badly as well (from a 45MB file we end up with >15GB on S3 before Django give up).
Is there anything that could be done to help getting this fixed?

Thanks!

todofixthis · 2017-01-24T19:43:16Z

Possibly related:

danpalmer · 2017-07-20T15:06:02Z

Hi, ping on this? We have just encountered an issue where uploading ~10MB of data resulted in ~20GB of data being written to S3 - it looks like this PR is the fix we need. Any possibility of a merge?

by truncating the buffer after uploading it. Follows the approach of jschneier#169.

jiinus added 3 commits July 8, 2016 19:06

Truncate SpooledTemporaryFile after flushing the buffer

4ab4b50

Added S3.py

3b28cc4

Truncate SpooledTemporaryFile after flushing the buffer

c74af2f

todofixthis referenced this pull request Jan 30, 2017

remove gratuitous file closing

5e14712

jnm mentioned this pull request May 30, 2018

Fix data-corruption issue with s3boto and s3boto3 multipart uploads #504

Merged

jschneier closed this in #504 Jun 1, 2018

jnm added a commit to jnm/django-storages that referenced this pull request Jun 4, 2018

Fix s3boto/s3boto3 memory leak introduced in jschneier#504

7aa9b73

by truncating the buffer after uploading it. Follows the approach of jschneier#169.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Truncate SpooledTemporaryFile after flushing the buffer #169

Truncate SpooledTemporaryFile after flushing the buffer #169

jiinus commented Jul 8, 2016

codecov-io commented Jul 8, 2016 •

edited

Loading

jiinus commented Jul 8, 2016

jschneier commented Jul 11, 2016

m-barthelemy commented Jan 24, 2017

todofixthis commented Jan 24, 2017

danpalmer commented Jul 20, 2017

Truncate SpooledTemporaryFile after flushing the buffer #169

Truncate SpooledTemporaryFile after flushing the buffer #169

Conversation

jiinus commented Jul 8, 2016

codecov-io commented Jul 8, 2016 • edited Loading

Codecov Report

jiinus commented Jul 8, 2016

jschneier commented Jul 11, 2016

m-barthelemy commented Jan 24, 2017

todofixthis commented Jan 24, 2017

danpalmer commented Jul 20, 2017

codecov-io commented Jul 8, 2016 •

edited

Loading