ostree-ext/filter_tar_async: flush decompressor stream #1228

jeckersb · 2025-03-25T15:46:17Z

We need to read anything past the tar stream, otherwise we'll deadlock
via skopeo in FinishPipe.

Also remove the bit where we hold the pipe open longer, we shouldn't
need to do that anymore because we've ensured we've read everything
out of it by this point.

Resolves: #1204

Signed-off-by: John Eckersberg jeckersb@redhat.com

jeckersb · 2025-03-25T15:52:47Z

Hm should have probably run tests locally first 😆

cgwalters

Ahh, that definitely explains why it happens reliably with zstd:chunked.

But...hmmm...we have still seen it occasionally for non zstd:chunked layers though too, could it be that even gzip has a similar thing, just less likely?

jeckersb · 2025-03-25T16:07:40Z

Yeah I'm not familiar with the specifics on gzip but that could be the case.

Looking at the failing tests though, it looks like maybe when we do the filtering that causes us to not always read to the end of even the basic tar stream? I guess that could also cause the same behavior. Should be easy enough to test by writing a large enough file at the end of the stream that ends up getting filtered out.

cgwalters · 2025-03-25T16:10:36Z

If the last item is filtered out, then that will definitely happen.

But even there, in the tar format there's trailing zeros to indicate end-of-archive, and I wonder if tar-rs is always consuming those bytes? It probably should be, but not 100% sure.

jeckersb · 2025-03-25T16:49:01Z

Well I suppose the "cheater" fix here is to just trace! and ignore when we see extra data at the end of the stream since technically it's not fatal. Part of me wants to know why tar-rs doesn't seem to read all the way through, but the practical part of me just wants to move on.

We need to read anything past the tar stream, otherwise we'll deadlock via skopeo in FinishPipe. Also remove the bit where we hold the pipe open longer, we shouldn't need to do that anymore because we've ensured we've read everything out of it by this point. Resolves: bootc-dev#1204 Signed-off-by: John Eckersberg <jeckersb@redhat.com>

jeckersb · 2025-03-25T17:12:46Z

Another thing re: "why does gzip work better", note that for the gzip case we explicitly create a BufReader since it's required by bufread:GzDecoder, whereas in the zstd case it's allocating it's own BufReader::with_capacity. I haven't run down what the result of zstd_safe::DCtx::in_size() ends up being, but I suspect in the zstd case we end up buffering a lot less than the gzip case, which means it would be more likely that we "miss" the end of the stream in our buffer.

ostree-ext/src/tar/write.rs

jeckersb · 2025-03-25T18:53:46Z

ostree-ext/src/tar/write.rs

+        if n != 0 {
+            tracing::debug!("Read extra {n} bytes at end of decompressor stream");
+        }
+
        // Pass ownership of the input stream back to the caller - see below.


This comment needs removed (If we actually remove this logic)

This merged before I pushed an updated version, so follow up: - Be explicit about also flushing the tar stream, since that can also have trailing data. - Remove unneeded comment about passing the input stream Signed-off-by: John Eckersberg <jeckersb@redhat.com>

Comment tidy up from #1228

cgwalters · 2025-03-28T21:01:02Z

@jeckersb So I went to re-test #963 after this landed but I still get the same symptom. I think we should add an integration test that verifies we can fetch from a zstd:chunked image as part of fixes here.

jeckersb · 2025-04-01T14:52:01Z

@jeckersb So I went to re-test #963 after this landed but I still get the same symptom. I think we should add an integration test that verifies we can fetch from a zstd:chunked image as part of fixes here.

We do already have tests in ostree-ext for zstd and zstd:chunked:

bootc/ostree-ext/tests/it/main.rs

Lines 1674 to 1684 in c94cfdf

    
           /// Test for zstd 
        
           #[tokio::test] 
        
           async fn test_container_zstd() -> Result<()> { 
        
               test_non_gzip("zstd").await 
        
           } 
        
           /// Test for zstd:chunked 
        
           #[tokio::test] 
        
           async fn test_container_zstd_chunked() -> Result<()> { 
        
               test_non_gzip("zstd:chunked").await 
        
           }

but obviously those were passing before this change. I was only able to get it to break with a sufficiently large container image (empty worked, 50MB worked, 100MB intermittently failed, 200MB always failed). So I'll add a test to generate a large (1GB?) zstd/zstd:chunked image and import that.

jeckersb · 2025-04-01T20:04:07Z

@jeckersb So I went to re-test #963 after this landed but I still get the same symptom. I think we should add an integration test that verifies we can fetch from a zstd:chunked image as part of fixes here.

I am able to reproduce this as well with bootc switch. Worth noting that it's always failing in unencapsulate_base, whereas before I was seeing it in subsequent layers. So maybe not exactly the same issue, but certainly closely related. Should be easy (famous last words...) to figure out what the difference is between the base and non-base case with such a simple and reliable reproducer.

jeckersb · 2025-04-01T20:08:30Z

Your image also fails using the same command I was using before (just container image pull from ostree-ext):

# ostree --repo=repo refs --delete ostree/container; ./bootc internals ostree-container image pull --quiet --insecure-skip-tls-verification repo ostree-unverified-registry:quay.io/cgwalters/ostest:zstdchunked
layers already present: 0; layers needed: 65 (378.5 MB)
ERROR Importing: Unencapsulating base: failed to invoke method FinishPipe: failed to invoke method FinishPipe: write |1: broken pipe

Followup to bootc-dev#1228 We use the decompressor a couple of times in unencapsulate_base in addition to the previous fix in filter_tar_async. Instead of duplicating the decompressor flush logic in each place, wrap the decompressor in a struct and implement Drop for it so we always automatically flush the input stream when we're finished with it. Signed-off-by: John Eckersberg <jeckersb@redhat.com>

jeckersb · 2025-04-01T21:35:21Z

Followup PR should fix this

cgwalters approved these changes Mar 25, 2025

View reviewed changes

jeckersb force-pushed the broken-pipe branch from 74ccf4b to c3e0ba6 Compare March 25, 2025 16:50

cgwalters reviewed Mar 25, 2025

View reviewed changes

ostree-ext/src/tar/write.rs Show resolved Hide resolved

ostree-ext/src/tar/write.rs Show resolved Hide resolved

jeckersb commented Mar 25, 2025

View reviewed changes

cgwalters merged commit 709a3f8 into bootc-dev:main Mar 25, 2025
22 of 23 checks passed

cgwalters added a commit that referenced this pull request Mar 25, 2025

Merge pull request #1230 from jeckersb/comment-tidy-up

0e0dd2e

Comment tidy up from #1228

jeckersb mentioned this pull request Apr 1, 2025

unencapsulate: Wrap decompressor and implement Drop #1247

Merged

cgwalters mentioned this pull request Apr 3, 2025

examples: pull directly from containers storage containers/composefs-rs#89

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ostree-ext/filter_tar_async: flush decompressor stream #1228

ostree-ext/filter_tar_async: flush decompressor stream #1228

Uh oh!

jeckersb commented Mar 25, 2025

Uh oh!

jeckersb commented Mar 25, 2025

Uh oh!

cgwalters left a comment

Uh oh!

jeckersb commented Mar 25, 2025

Uh oh!

cgwalters commented Mar 25, 2025

Uh oh!

jeckersb commented Mar 25, 2025

Uh oh!

jeckersb commented Mar 25, 2025

Uh oh!

Uh oh!

Uh oh!

jeckersb Mar 25, 2025

Uh oh!

Uh oh!

cgwalters commented Mar 28, 2025

Uh oh!

jeckersb commented Apr 1, 2025

Uh oh!

jeckersb commented Apr 1, 2025

Uh oh!

jeckersb commented Apr 1, 2025

Uh oh!

jeckersb commented Apr 1, 2025

Uh oh!

Uh oh!

ostree-ext/filter_tar_async: flush decompressor stream #1228

ostree-ext/filter_tar_async: flush decompressor stream #1228

Uh oh!

Conversation

jeckersb commented Mar 25, 2025

Uh oh!

jeckersb commented Mar 25, 2025

Uh oh!

cgwalters left a comment

Choose a reason for hiding this comment

Uh oh!

jeckersb commented Mar 25, 2025

Uh oh!

cgwalters commented Mar 25, 2025

Uh oh!

jeckersb commented Mar 25, 2025

Uh oh!

jeckersb commented Mar 25, 2025

Uh oh!

Uh oh!

Uh oh!

jeckersb Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cgwalters commented Mar 28, 2025

Uh oh!

jeckersb commented Apr 1, 2025

Uh oh!

jeckersb commented Apr 1, 2025

Uh oh!

jeckersb commented Apr 1, 2025

Uh oh!

jeckersb commented Apr 1, 2025

Uh oh!

Uh oh!