Replies: 2 comments 5 replies
-
Looking at the code flow, if the file is malformed (with a negative dictionary_page_offset) we potentially fail to catch the negative offset on line 189
Technically, for a valid parquet file neither offset should be zero because parquet has a magic number as its first four bytes. Without data pages it means the row group is empty, and in theory readers should skip it (e.g. this PR does this) |
Beta Was this translation helpful? Give feedback.
-
|
It does look there were some strange behavior for offset = 0 (see the change that added this: 7a9ba61) |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I was looking at this code and wondered about the checks.
Beta Was this translation helpful? Give feedback.
All reactions