You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Use custom thrift parser for parquet metadata (phase 1 of Thrift remodel) (#8530)
# Which issue does this PR close?
- Closes#5854.
- Closes#5775
- Related to #5853
# Rationale for this change
See issue, but this change is needed to allow greater control over
Parquet metadata handling. These changes speed up Thrift decoding about
2X, and enable further optimizations down the line, such as allowing
selective decoding of only that metadata that is needed.
# What changes are included in this PR?
In short, the `format` crate is now no longer necessary. This crate will
decode from Thrift encoded bytes straight to the metadata objects
defined by this crate (`ParquetMetaData` and children).
# Are these changes tested?
Yes, but more should be added by follow-on PRs
# Are there any user-facing changes?
Yes, many.
---------
Co-authored-by: Matthijs Brobbel <m1brobbel@gmail.com>
Co-authored-by: Andrew Lamb <andrew@nerdnetworks.org>
0 commit comments