Skip to content

Update parquet to v58.2.0#6

Merged
renovate[bot] merged 1 commit into
trunkfrom
renovate/parquet
May 18, 2026
Merged

Update parquet to v58.2.0#6
renovate[bot] merged 1 commit into
trunkfrom
renovate/parquet

Conversation

@renovate
Copy link
Copy Markdown
Contributor

@renovate renovate Bot commented May 18, 2026

ℹ️ Note

This PR body was truncated due to platform limits.

This PR contains the following updates:

Package Type Update Change Pending
parquet dependencies minor 58.0.058.2.0 58.3.0
parquet_derive dependencies minor 58.0.058.2.0 58.3.0

Release Notes

apache/arrow-rs (parquet)

v58.2.0: arrow 58.2.0

Compare Source

Changelog
58.2.0 (2026-04-28)

Full Changelog

Implemented enhancements:

  • Expose ColumnCloseResult on ArrowColumnChunk #​9774 [parquet]
  • Expose FFI data structures fields #​9771 [arrow]
  • short-circuit last predicate in RowFilter when with_limit(N) is set #​9765 [parquet]
  • vectorise dict-index bounds check #​9747 [parquet]
  • Refactor RleEncoder::flush_bit_packed_run #​9734 [parquet]
  • Add benchmark for cast from/to decimals #​9728 [arrow]
  • Add a security policy for arrow-rs #​9727 [parquet] [arrow] [arrow-flight]
  • Support FixedSizeList in arrow-json reader #​9714 [arrow]
  • [Variant] Add VariantArrayBuilder::append_nulls API #​9684
  • [Json] RunEndEncoded decoder optimization #​9645 [arrow]
  • [Variant] variant_get(..., List<_>) non-Struct types support #​9615
  • [Variant] Add unshredded Struct fast-path for variant_get(..., Struct) #​9596
  • Allow setting custom line terminator for CSV writer #​9571 [arrow]
  • [Variant] Align cast logic for variant_get to cast kernel for numeric/bool types #​9564 [arrow]
  • ci: use ubuntu-slim where applicable #​9536
  • Publicly export arrow_string::Predicate and its methods? #​9480
  • Don't create CompressionContext when no compression is selected [IPC] #​9463 [arrow]
  • Parquet: Raw level buffering causes unbounded memory growth for sparse columns #​9446 [parquet]
  • Parallel Parquet Reading #​9381 [parquet]

Fixed bugs:

  • [Variant] unshred_variant panics on malformed bytes despite returning Result #​9740
  • RecordBatch::normalize() does not propagate top level null bitmap into the results #​9732 [arrow]
  • Incorrect accounting in DictEncoder::estimated_memory_size #​9719 [parquet]
  • arrow-ipc writer does not comply with spec for empty variable-size arrays #​9716 [arrow]
  • Panic when reading corrupt parquet file with truncated data instead of ParquetError #​9705 [parquet]
  • NOTICE.txt is inaccurate #​9703 [arrow]
  • Unnecessary dependency on regex crate #​9672
  • [arrow-avro] Avro reader produces incorrect results when reader schema and writer schema differ #​9655 [arrow]
  • parquet docs are broken on docs.rs #​9649
  • [Parquet] ArrowWriter with CDC panics on nested ListArrays #​9637 [parquet] [arrow] [arrow-flight]
  • Use release KEYS file for verification instead of dev KEYS #​9603
  • IPC reader: handling of dictionaries with only null values #​9595 [arrow]
  • Parquet RleDecoder::get_batch_with_dict panics on oob dictionary indices #​9434 [parquet]

Documentation updates:

Performance improvements:

Closed issues:

  • Incorrect buffer skipping for V4 Union types in IPC skip_field #​9828 [arrow]
  • Replace wildcard match in skip_field with explicit DataType handling #​9821 [arrow]
  • Column projection misalignment for ListView / LargeListView in IPC reader #​9805 [arrow]
  • Avoid panic on malformed compressed buffer prefix in IPC #​9801 [arrow]
  • DeltaByteArrayDecoder panics on invalid prefix lengths #​9796 [parquet]
  • Use NullBufferBuilder when reading json #​9781 [arrow]
  • Perfectly shredded arrays with top-level null values loss nullability when typed_value is extracted #​9701
  • [Parquet Metadata] API to determine page-index presence separately from page-index load #​9693
  • Union cast is incorrect for duplicate field names #​9664 [arrow]
  • List and ListView are missing take benchmarks #​9627 [arrow]
  • Support RunEndEncoded arrays in comparison kernels (eq, lt, etc.) #​9620 [arrow]
  • variant_get should follow JSONpath semantics #​9606
  • GenericByteViewArray: support finding total length of all strings #​9435 [arrow]

Merged pull requests:

* This Changelog was automatically generated by github_changelog_generator

v58.1.0: arrow 58.1.0

Compare Source

Changelog
58.1.0 (2026-03-20)

Full Changelog

Implemented enhancements:

  • Reuse compression dict lz4_block #​9566
  • [Variant] Add variant_to_arrow Struct type support #​9529
  • [Variant] Add unshred_variant support for Binary and LargeBinary types #​9526
  • [Variant] Add shred_variant support for LargeUtf8 and LargeBinary types #​9525
  • [Variant] variant_get tests clean up #​9517
  • parquet_variant: Support LargeUtf8 typed value in unshred_variant #​9513
  • parquet-variant: Support string view typed value in unshred_variant #​9512
  • Deprecate ArrowTimestampType::make_value in favor of from_naive_datetime #​9490 [arrow]
  • Followup for support ['fieldName'] in VariantPath #​9478
  • Speedup DELTA_BINARY_PACKED decoding when bitwidth is 0 #​9476 [parquet]
  • Support CSV files encoded with charsets other than UTF-8 #​9465 [arrow]
  • Expose Avro writer schema when building the reader #​9460 [arrow]
  • Python: avoid importing pyarrow classes ever time #​9438
  • Add append_nulls to MapBuilder #​9431 [arrow]
  • Add append_non_nulls to StructBuilder #​9429 [arrow]
  • Add append_value_n to GenericByteBuilder #​9425 [arrow]
  • Optimize from_bitwise_binary_op #​9378 [arrow]
  • Configurable Arrow representation of UTC timestamps for Avro reader #​9279 [arrow]

Fixed bugs:

  • MutableArrayData::extend does not copy child values for ListView arrays #​9561 [arrow]
  • ListView interleave bug #​9559 [arrow]
  • Flight encoding panics with "no dict id for field" with nested dict arrays #​9555 [arrow] [arrow-flight]
  • "DeltaBitPackDecoder only supports Int32Type and Int64Type" but unsigned types are supported too #​9551 [parquet]
  • Potential overflow when calling util::bit_mask::set_bits (soundness issue) #​9543 [arrow]
  • handle Null type in try_merge for Struct, List, LargeList, and Union #​9523 [arrow]
  • Invalid offset in sparse column chunk data for multiple predicates #​9516 [parquet]
  • debug_assert_eq! in BatchCoalescer panics in debug mode when batch_size < 4 #​9506 [arrow]
  • Parquet Statistics::null_count_opt wrongly returns Some(0) when stats are missing #​9451 [parquet]
  • Error "Not all children array length are the same!" when decoding rows spanning across page boundaries in parquet file when using RowSelection #​9370 [parquet]
  • Avro schema resolution not properly supported for complex types #​9336 [arrow]

Documentation updates:

Performance improvements:

Closed issues:

  • Duplicate macro definition: partially_shredded_variant_array_gen #​9492
  • Enable LargeList / ListView / LargeListView for VariantArray::try_new #​9455
  • Support variables/expressions in record_batch! macro #​9245 [arrow]

Merged pull requests:

Note

PR body was truncated to here.


Configuration

📅 Schedule: (in timezone America/Detroit)

  • Branch creation
    • Only on Monday (* * * * 1)
  • Automerge
    • At any time (no schedule defined)

🚦 Automerge: Enabled.

Rebasing: Whenever PR is behind base branch, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about these updates again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

@renovate renovate Bot enabled auto-merge (squash) May 18, 2026 05:45
@renovate renovate Bot merged commit e165adc into trunk May 18, 2026
3 checks passed
@renovate renovate Bot deleted the renovate/parquet branch May 18, 2026 05:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants