-
Notifications
You must be signed in to change notification settings - Fork 98
chore(deps): update arrow-rs to v57 (major) #5482
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
|
CodSpeed Performance ReportMerging #5482 will degrade performances by 51.14%Comparing Summary
Benchmarks breakdown
Footnotes
|
a0f5d8f to
f3fe6c4
Compare
f3fe6c4 to
ab3e18f
Compare
This PR contains the following updates:
56->5756->5756->5756->5756->5756->5756->5756->5756->5756->5756->57Release Notes
apache/arrow-rs (arrow-arith)
v57.1.0Compare Source
Full Changelog
Implemented enhancements:
unsafe#8848 [arrow]shred_variant#8795 [arrow]FilterBuilder::optimizeor not #8781 [arrow]arrow-jsonsupports encoding binary arrays, but not decoding #8736 [arrow]FilterPredicateinstances to be reused for RecordBatches #8692 [arrow]ParquetRecordBatchStream(async API) in terms of the PushDecoder #8677 [parquet]FixedSizeBinary(16)shredding #8665parse_data_typeforList,ListView,LargeList,LargeListView,FixedSizeList,Union,Map,RunEndCoded. #8648 [arrow]RleDecoder::resetrather than panic #8632 [parquet]BooleanBufferBuilderandMutableBufferthat mutate directly the buffer #8618 [arrow]Fixed bugs:
List(Int64)results in nullable list in 57.0.0 and a non-nullable list in 57.1.0 #8883ThriftMetadataWriter::write_column_indexescannot handle aColumnIndexMetaData::NONE#8815 [parquet]error: item_reader def levels are Nonewhen reading nested field with row filter #8657 [parquet]FixedSizeListArrays are not handled #8623 [arrow]Documentation updates:
MutableBufferfromBuffer#8853 [arrow] (alamb)PrimitiveArrayto aVec#8771 [arrow] (alamb)arrow_integration_testcrate #8738 [arrow] (phil-opp)Performance improvements:
RowConverter::from_binaryshould opportunistically take ownership of the buffer #8685 [arrow]collect_booland removeunsafe, optimizetake_bits,take_nativefor null values #8849 [arrow] (Dandandan)BooleanBuffer::append_packed_rangeto useapply_bitwise_binary_op#8812 [arrow] (alamb)LogicalTypeinColumnOrder::get_sort_order, deprecateget_logical_type#8789 [parquet] (alamb)ArrayIterdefault impl fornth,nth_back,lastandcount#8785 [arrow] (rluvaton)SerializedPageReader#8745 [parquet] (XiangpengHao)count,nth,nth_back,lastandmaxfor BitIterator #8696 [arrow] (rluvaton)FilterPredicate::filter_record_batch#8693 [arrow] (pepijnve)RowConverter::from_binary#8686 [arrow] (mzabaluev)apply_unary_opandapply_binary_opbitwise operations #8619 [arrow] (rluvaton)Closed issues:
Field#8784arrow-integration-testcrate is linked fromarrowdocs #8739 [arrow]VariantArray::valueto return aResult<Variant>#8672Merged pull requests:
List(non-null Int64)instead ofList(nullable Int64)#8890 [parquet] [arrow] (etseidl)ArrowReaderOptions::with_virtual_columnserror rather than panic on invalid input #8867 [parquet] (alamb)RowNumberReaderwhen not all row groups are selected #8863 [parquet] (vustef)ArrayFormatters when Pretty-Printing Record Batches #8829 [arrow] (tobixdev)ListArraydocumentation #8803 [arrow] (liamzwbao)ArrowReaderOptionsandArrowReaderMetadata#8798 [parquet] (alamb)shred_variant#8796 (liamzwbao)VariantPath::is_empty#8791 (friendlymatthew)ArrayIterbenchmarks #8774 [arrow] (rluvaton)DataType::{Binary/LargeBinary/BinaryView}#8768 [arrow] (klion26)DataType::Union,DataType::Map,DataType::RunEndEncoded#8765 [arrow] (dqkqd)test_cache_projection_excludes_nested_columnsto use high level APIs #8754 [parquet] (alamb)mergeandmerge_nkernels #8753 [arrow] (pepijnve)arrow-integration-testcrate from mainarrowcrate #8740 [arrow] (phil-opp)RleDecoder::reloadrather than panic #8729 [parquet] (liamzwbao)ArrayIterdoes not report size hint correctly after advancing from the iterator back #8728 [arrow] (rluvaton)zipnow treats nulls as false in provided mask regardless of the underlying bit value #8711 [arrow] (rluvaton)arrow-avroREADME.mdversion to 57 #8695 [arrow] (jecsand838)thrift_enummacro forConvertedType#8680 [parquet] (etseidl)OffsetBuffer::from_lengths(std::iter::repeat_n(<val>, <repeat>));withOffsetBuffer::from_repeated_length(<val>, <repeat>);#8669 [arrow] (rluvaton)shred_variantfor Uuids #8666 (friendlymatthew)create_test_variant_arrayhelper method #8664 (friendlymatthew)new_repeatedtoByteArray#8659 [arrow] (rluvaton)repeat_slice_n_timestoMutableBuffer#8658 [arrow] (rluvaton)rescale_decimalfollowup #8655 [arrow] (liamzwbao)List,ListView,LargeList,LargeListView,FixedSizeList#8649 [arrow] (dqkqd)try_new_with_lengthconstructor toFixedSizeList#8624 [arrow] (connortsui20)variant_to_arrowfor utf8 #8600 [arrow] (sdf-jkl)* This Changelog was automatically generated by github_changelog_generator
v57.0.0: arrow 57.0.0Compare Source
Changelog
57.0.0 (2025-10-19)
Full Changelog
Breaking changes:
Arc<FileEncryptionProperties>everywhere to be be consistent withFileDecryptionProperties#8626 [parquet] (alamb)RunEndEncoded#8596 [arrow] (Weijun-H)ArrowError::AvroError, remaining types and roundtrip tests toarrow-avro, #8595 [arrow] (jecsand838)Mapdisplay formatting in DataType #8570 [arrow] (Weijun-H)ListViewandLargeListViewvariants #8569 [arrow] (Weijun-H)Arc<FileDecryptionProperties>to reduce size of ParquetMetadata and avoid copying whenencryptionis enabled #8470 [parquet] (alamb)DataType::Structfield names inDisplayformatting #8291 [parquet] [arrow] (emilk)DisplayforDataTypeandField#8290 [parquet] [arrow] (emilk)Implemented enhancements:
arrow-avrocrate) #4886VariantArrayiterable #8612PartialEqforVariantArray#8610VariantArray#8609VarianttoVariantArray#8606RecordBatch::project#8591 [arrow]convert_row_groups#8517 [parquet]thrift-remodelbranch with DataFusion #8513 [parquet]UnionArray::is_denseMethod Public #8503 [arrow]append_nmethod toFixedSizeBinaryDictionaryBuilder#8497 [arrow]serdedependency toserde_corewhere applicable #8451 [arrow]ParquetMetadataReaderinto IO/decoder state machine and thrift parsing #8439 [parquet]arrow_jsonlackswith_timestamp_formatfunctions likearrow_csvhad offered #8398 [arrow]shred_variantfunction #8361List#8337 [parquet]Struct#8336 [parquet]Time64(Microsecond)#8334 [parquet]Decimal128#8332 [parquet]Timestamp(Microsecond, _)andTimestamp(Nanosecond, _)#8331 [parquet]Date32#8330 [parquet]DataType::FixedSizeListsupport forcast_to_variantkernel #8281Fixed bugs:
max_predicate_cache_sizeis 0 #8542 [parquet]arrow::compute::interleavefails with struct arrays with no fields #8533 [arrow]ulimit#8406 [parquet]VariantArray::data_typereturnsStructType, causingArray::as_structto panic #8319 [parquet]Documentation updates:
Performance improvements:
zipkernel benchmarks #8654 [arrow] (rluvaton)GenericByteBuilder::append_arrayto use SIMD for extending the offsets #8388 [arrow] (rluvaton)Closed issues:
metadatafield should be marked is non-nullable #8410 [parquet]parquet-geospatialcrate #8377parquet-geometrycrate #8374Merged pull requests:
AvroBinaryFormattoAvroSoeFormat#8633 [arrow] (jecsand838)PartialEqandFromIterator<Option<..>>forVariantArray#8627 (friendlymatthew)ArrowWriter::into_serialized_writerand add docs #8621 [parquet] (alamb)BitChunks::new#8620 [arrow] (rluvaton)get_type_name#8617 (liamzwbao)Configuration
📅 Schedule: Branch creation - Between 12:00 AM and 03:59 AM, only on Monday ( * 0-3 * * 1 ) (UTC), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about these updates again.
This PR was generated by Mend Renovate. View the repository job log.