-
Notifications
You must be signed in to change notification settings - Fork 1k
[thrift-remodel] Write Parquet page indexes #8427
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[thrift-remodel] Write Parquet page indexes #8427
Conversation
Some(column_index.to_thrift()) | ||
} | ||
}) | ||
.map(|column_index| Some(column_index.clone())) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This clone is unfortunate. We define the page and offset indexes as Vec<Vec<index>>
, but they start out as Vec<Vec<Option<index>>>
and ultimately return to that form. It would be nice to keep that consistent. Once the remodel is finished we can revisit this.
gogogogogogo Sorry I can't review these PRs fast enough. I am sure they are great. |
Which issue does this PR close?
Note: this targets a feature branch, not main
Rationale for this change
Continues the remodel by implementing writing of the page index structures.
What changes are included in this PR?
This PR removes the old
parquet::file::page_index::Index
enum and replaces with the newColumnIndexMetaData
struct.Are these changes tested?
Covered by existing tests
Are there any user-facing changes?
Yes.