Seeking multiple times #279

197g · 2025-07-10T21:56:36Z

This fixes an issue with cycle tracking that got introduced when we allowed seeking to previous images. Since the next offset of an IFD had to be unique, seeking back would not allow iterating forwards again and bring the decoder into a rather unfortunate state that was not consistent with actually rewinding as intended.

The previous strategy could not keep track of all cycles such as reflexive ones at the very start. Additionally, it would fail to be able to iterate after seeking since it was overly cautious and verified the next IFD position before allowing the read of a current IFD. The new implementation is furthermore proof against having multiple independent chains of IFD which may occur when iterating SubIFDs for thumbnails. (Note that we do not verify the tree-structure, only a freedom from cycles).

197g · 2025-07-10T21:58:59Z

There's another motivation here: This new cycle tracking implementation will allow the decoder to switch the image chain to another one, that is not in the original IFD chain. This is required for tags such as SubIFD (thumbnails etc.) that refer one image to a separate chain of child images.

fintelia · 2025-07-11T18:46:03Z

I think letting the Decoder struct switch to a different chain would add a bunch of complexity that might be better handled by having a dedicated API for reading IFD's off the main chain. The core problem is that on the main chain the TIFF spec provides guarantees of things like each image having either strips or tiles but not both. These let us check invariants immediately when we seek to an IFD and then assume they hold in the various decoding methods. In sub-IFD's, the tags required for main chain IFDs may be missing. In certain cases, the tags in a sub-IFD could have could have entirely different meanings.

src/decoder/mod.rs

197g · 2025-07-11T19:04:51Z

I'll sketch this out in the PR for it but my impression is the decoder could also have a non-image iteration mode without that becoming overly complex.

For instance, when we first switch to another chain we may set a flag in the decoder to no longer read images ahead-of-time and then only deactivate that flag when explicitly asked. Or, offer a next_directory as an alternative to next_image with the separate behavior and expect the user to advance with the correct interface. For some chains such as EXIF we do not intend to read any image. The open question is how many different types of chains to support and how many to let the user do manually through read_directory_tags and Directory::next (#278).

Also, he current behavior has downsides. We should per specification be able to handle color channels beyond the rgba and simply skip bytes that do not belong to them according to the BitsPerSampe array. However, which bytes to ignore and which bytes to read is something only the user can choose. The current implementation makes it hard to perform that correctly since user's choice may depend on the directory—which they can only inspect after the seek, which already triggers image validation. So there is a bunch of work that may be ultimately pointless and really is a single call away.

next_image being the default with validation is good, but a next_directory alternative would not add much complexity cost while keeping the implementation more usable through Decoder.

fintelia · 2025-07-11T19:55:00Z

Ok, I think that makes sense.

I could also see adding methods like Decoder::as_image() -> Option<Image> and Decoder::as_directory() -> &mut Directory so that we could enable more sharing and have fewer methods directly on Decoder.

197g added 3 commits July 10, 2025 23:48

Add subsubifds test file

a1be18b

Test ordering directory IFD

97e2305

197g force-pushed the seeking-multiple-times branch from ac575fe to 97e2305 Compare July 10, 2025 21:57

197g merged commit 6275b7a into main Jul 11, 2025
15 checks passed

197g deleted the seeking-multiple-times branch July 11, 2025 11:21

fintelia reviewed Jul 11, 2025

View reviewed changes

src/decoder/mod.rs Show resolved Hide resolved

197g mentioned this pull request Jul 12, 2025

Expose methods related to reading directory #278

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Seeking multiple times #279

Seeking multiple times #279

Uh oh!

197g commented Jul 10, 2025

Uh oh!

197g commented Jul 10, 2025

Uh oh!

Uh oh!

fintelia commented Jul 11, 2025

Uh oh!

Uh oh!

197g commented Jul 11, 2025

Uh oh!

fintelia commented Jul 11, 2025

Uh oh!

Uh oh!

Seeking multiple times #279

Seeking multiple times #279

Uh oh!

Conversation

197g commented Jul 10, 2025

Uh oh!

197g commented Jul 10, 2025

Uh oh!

Uh oh!

fintelia commented Jul 11, 2025

Uh oh!

Uh oh!

197g commented Jul 11, 2025

Uh oh!

fintelia commented Jul 11, 2025

Uh oh!

Uh oh!