You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@mattaubury Parquet Internal doesn't regard dictionary as a "type", instead, it regard it as an "encoding", that is saied, writing dictionary to parquet doesn't means the encoding is dictionary. During read, currently because of the patch is un-merged, so we're unable to read large-binary dict from parquet
(tested on pyarrow-13.0.0, Linux x64)
When writing a dictionary encoded column of
large_string
type to Parquet file and reading it back, it is read back as a plainstring
type.Repro in Python but I see the same in C++:
I'd expect to get it back as a
dictionary<values=large_string, indices=int32, ordered=0>
type.Component(s)
C++, Parquet, Python
The text was updated successfully, but these errors were encountered: