Closed
Description
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
#6431 adds a new API for obtaining Parquet metadata. To reduce code duplication and confusion, older APIs should be deprecated and eventually removed. Uses of these older APIs within the parquet crate should be converted to the new API.
This issue will track progress of conversion to the new ParquetMetaDataReader
API.
Non breaking changes
- Deprecate methods in
parquet/src/file/footer.rs
(parse_metadata
,decode_metadata
,decode_footer
) Deprecate methods from footer.rs in favor ofParquetMetaDataReader
#6451 - Deprecate
MetadataLoader
DeprecateMetadataLoader
#6474 - Workaround missing page index inconsistencies (see comment) Workaround for missing Parquet page indexes in
ParquetMetadaReader
#6450
Breaking changes
- Add new enum value to
ParquetError
to signal need for more data AddParquetError::NeedMoreData
markParquetError
asnon_exhaustive
#6630 - Leave page index structures as
None
(rather than empty vectors) if page indexes are not present in the file ReturnNone
when Parquet page indexes are not present in file #6639 - Add async API that supports suffix reads (depends on Parquet/async: Default to suffix requests on supporting readers/object stores #6157) Parquet: Support reading Parquet metadata via suffix range requests #7334
- Deprecate
parquet::file::page_index::index_reader::{read_offset_indexes, read_columns_indexes}