Skip to content

Conversation

srilman
Copy link
Contributor

@srilman srilman commented Mar 14, 2025

Changes included in this PR

Support Parquet partitioned write where some of the partition columns are index columns. That can be either the index or a subset of index columns in the case of a MultiIndex.

Testing strategy

  • Run PR CI with the test enabled
  • Ran locally

User facing changes

Closes #223, all Parquet features should be supported now.

Checklist

  • Pipelines passed before requesting review. To run CI you must include [run CI] in your commit message.
  • I am familiar with the Contributing Guide
  • I have installed + ran pre-commit hooks.

Copy link

codecov bot commented Mar 14, 2025

Codecov Report

Attention: Patch coverage is 92.24138% with 9 lines in your changes missing coverage. Please review.

Project coverage is 66.66%. Comparing base (6c7b124) to head (9ad201c).
Report is 92 commits behind head on main.

❌ Your project check has failed because the head coverage (66.66%) is below the target coverage (80.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files
@@            Coverage Diff             @@
##             main     #339      +/-   ##
==========================================
+ Coverage   61.48%   66.66%   +5.18%     
==========================================
  Files         169      176       +7     
  Lines       61920    62712     +792     
  Branches     8663     8770     +107     
==========================================
+ Hits        38070    41806    +3736     
+ Misses      21430    18274    -3156     
- Partials     2420     2632     +212     

[run ci] fix tests

[run ci] another cleanup

[run ci] working

[run ci] tests

[run ci] working
@srilman srilman force-pushed the slade/index-partition branch from 9ad201c to 537bd0d Compare March 18, 2025 17:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Is there any plan to support MultiIndex DataFrames in Parquet I/O in the future?
1 participant