Skip to content

Conversation

clairemcginty
Copy link
Contributor

Parquet 0.14 supports setting extra file metadata via ParquetWriter: apache/parquet-java#1241

I added a new metadata output param to the scio-parquet Avro/Magnolify/Tensorflow bindings, which matches the naming convention we use in the scio-avro write APIs.

Copy link

codecov bot commented Jun 27, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 61.27%. Comparing base (1e88ee3) to head (5ef843f).

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #5411      +/-   ##
==========================================
+ Coverage   61.24%   61.27%   +0.03%     
==========================================
  Files         310      310              
  Lines       11058    11067       +9     
  Branches      751      774      +23     
==========================================
+ Hits         6772     6781       +9     
  Misses       4286     4286              

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@clairemcginty
Copy link
Contributor Author

This has some breaking API changes and is not urgent, can probably wait for 0.15 release.

@clairemcginty clairemcginty modified the milestone: 0.15.0 Jul 3, 2024
@clairemcginty clairemcginty changed the base branch from main to v0.15.x January 15, 2025 14:57
@clairemcginty clairemcginty force-pushed the parquet-extra-metadata branch from 5ef843f to 9521c34 Compare January 16, 2025 21:12
@clairemcginty clairemcginty requested a review from kellen January 16, 2025 21:24
@kellen kellen merged commit 182abe3 into v0.15.x Feb 13, 2025
1 check passed
@kellen kellen deleted the parquet-extra-metadata branch February 13, 2025 15:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants