PARQUET-2432: Use ByteBufferAllocator over hardcoded heap allocation by gszadovszky · Pull Request #1278 · apache/parquet-java

gszadovszky · 2024-02-21T13:19:32Z

Updated BytesInput implementations to rely on a ByteBufferAllocator instance for allocating/releasing ByteBuffer objects.
Extend the usage of a ByteBufferAllocator instead of the hardcoded usage of heap (e.g. byte[], ByteBuffer.allocate etc.)
parquet-cli related code parts including ParquetRewriter and tests are not changed in this effort

Make sure you have checked all steps below.

Jira

My PR addresses the following PARQUET-2432 issues and references
them in the PR title. For example, "PARQUET-1234: My Parquet PR"
- https://issues.apache.org/jira/browse/PARQUET-XXX
- In case you are adding a dependency, check if the license complies with
  the ASF 3rd Party License Policy.

Tests

My PR adds the following unit tests OR does not need testing for this extremely good reason:

Commits

My commits all reference Jira issues in their subject lines. In addition, my commits follow the guidelines
from "How to write a good git commit message":
1. Subject is separated from body by a blank line
2. Subject is limited to 50 characters (not including Jira issue reference)
3. Subject does not end with a period
4. Subject uses the imperative mood ("add", not "adding")
5. Body wraps at 72 characters
6. Body explains "what" and "why", not "how"

Style

My contribution adheres to the code style guidelines and Spotless passes.
- To apply the necessary changes, run mvn spotless:apply -Pvector-plugins

Documentation

In case of new functionality, my PR adds documentation that describes how to use it.
- All the public functions and the classes in the PR contain Javadoc that explain what it does

* Updated BytesInput implementations to rely on a ByteBufferAllocator instance for allocating/releasing ByteBuffer objects. * Extend the usage of a ByteBufferAllocator instead of the hardcoded usage of heap (e.g. byte[], ByteBuffer.allocate etc.) * parquet-cli related code parts including ParquetRewriter and tests are not changed in this effort

gszadovszky · 2024-02-21T14:11:10Z

@wgtmac, if you have some time, could you check this out?

wgtmac · 2024-02-21T14:52:20Z

Sure, I will take a look by the end of this week.

wgtmac

I didn't check the test thoroughly but this overall LGTM.

wgtmac · 2024-02-23T04:50:01Z

parquet-common/src/main/java/org/apache/parquet/bytes/BytesInput.java

  public abstract void writeAllTo(OutputStream out) throws IOException;

+  /**
+   * For internal use only. It is expected that the buffer is large enough to fit the content of this {@link BytesInput}


Should we add a comment for what to expect if the content does not fit into the ByteBuffer?

wgtmac · 2024-02-23T05:22:04Z

parquet-common/src/main/java/org/apache/parquet/bytes/ConcatenatingByteBufferCollector.java

+   * @return a text representation of the memory usage of this structure
+   */
+  public String memUsageString(String prefix) {
+    return format("%s %s %d slabs, %,d bytes", prefix, getClass().getSimpleName(), slabs.size(), size);


Suggested change

return format("%s %s %d slabs, %,d bytes", prefix, getClass().getSimpleName(), slabs.size(), size);

return format("%s %s %d slabs, %d bytes", prefix, getClass().getSimpleName(), slabs.size(), size);

I've just copy-pasted this from ConcatenatingByteArrayCollector but it seems to be intentional. %,d adds separators to the value representation (e.g. 123,456,789).

wgtmac · 2024-02-23T05:22:33Z

parquet-common/src/main/java/org/apache/parquet/bytes/ReusingByteBufferAllocator.java

+import java.nio.ByteBuffer;
+
+/**
+ * A special {@link ByteBufferAllocator} implementation that keeps one {@link ByteBuffer} object and reuse it at the


Suggested change

* A special {@link ByteBufferAllocator} implementation that keeps one {@link ByteBuffer} object and reuse it at the

* A special {@link ByteBufferAllocator} implementation that keeps one {@link ByteBuffer} object and reuses it at the

wgtmac · 2024-02-23T05:32:03Z

parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ColumnChunkPageReadStore.java

-    this.allocator = allocator;
-    this.toRelease = toRelease;
+  void setReleaser(ByteBufferReleaser releaser) {
+    this.releaser = releaser;


Should we check if the passed releaser is null?

This is internal (both the method and the class are package private). I wouldn't do additional checks.

wgtmac · 2024-02-23T05:38:40Z

parquet-hadoop/src/main/java/org/apache/parquet/format/converter/ParquetMetadataConverter.java

    byte[] serializedFooter = new byte[combinedFooterLength - footerSignatureLength];
-    System.arraycopy(footerAndSignature, 0, serializedFooter, 0, serializedFooter.length);
+    // Resetting to the beginning of the footer
+    from.reset();


Should we check from.markSupported() before calling reset() and mark()?

wgtmac · 2024-02-23T05:44:49Z

parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ParquetFileWriter.java

+        allocator);
+  }
+
+  @Deprecated


The argument list grows longer now. Should we use an options class instead to avoid frequent deprecation?

From one hand I completely agree. From the other hand, ParquetFileWriter should be an internal class. It is unfortunate that it is public. I would not create yet another parameters builder for ParquetFileWriter.
I'll think about a solution somewhere in between.

gszadovszky · 2024-02-23T16:11:35Z

Thank you, @wgtmac

wangyum · 2025-04-14T23:17:48Z

Hi @gszadovszky @wgtmac It seems this patch may cause deadlock.

Found one Java-level deadlock:
=============================
"Executor 566 task launch worker for task 202024534, task 19644.1 in stage 13967543.0 of app application_1736396393732_100191":
  waiting to lock monitor 0x00007f9435525aa0 (object 0x00007f9575000a70, a java.lang.Object),
  which is held by "Task reaper-9"

"Task reaper-9":
  waiting to lock monitor 0x00007fa06b315500 (object 0x00007f963d0af788, a org.apache.hadoop.hdfs.DFSOutputStream),
  which is held by "Executor 566 task launch worker for task 202024534, task 19644.1 in stage 13967543.0 of app application_1736396393732_100191"

Java stack information for the threads listed above:
===================================================
"Executor 566 task launch worker for task 202024534, task 19644.1 in stage 13967543.0 of app application_1736396393732_100191":
	at org.apache.spark.util.UninterruptibleThread.interrupt(UninterruptibleThread.scala:96)
	- waiting to lock <0x00007f9575000a70> (a java.lang.Object)
	at org.apache.hadoop.hdfs.DataStreamer.waitAndQueuePacket(DataStreamer.java:989)
	- locked <0x00007f963d0af760> (a java.util.LinkedList)
	at org.apache.hadoop.hdfs.DFSOutputStream.enqueueCurrentPacket(DFSOutputStream.java:496)
	at org.apache.hadoop.hdfs.DFSOutputStream.enqueueCurrentPacketFull(DFSOutputStream.java:505)
	- locked <0x00007f963d0af788> (a org.apache.hadoop.hdfs.DFSOutputStream)
	at org.apache.hadoop.hdfs.DFSOutputStream.writeChunk(DFSOutputStream.java:445)
	- locked <0x00007f963d0af788> (a org.apache.hadoop.hdfs.DFSOutputStream)
	at org.apache.hadoop.fs.FSOutputSummer.writeChecksumChunks(FSOutputSummer.java:218)
	at org.apache.hadoop.fs.FSOutputSummer.flushBuffer(FSOutputSummer.java:165)
	- eliminated <0x00007f963d0af788> (a org.apache.hadoop.hdfs.DFSOutputStream)
	at org.apache.hadoop.fs.FSOutputSummer.flushBuffer(FSOutputSummer.java:146)
	- locked <0x00007f963d0af788> (a org.apache.hadoop.hdfs.DFSOutputStream)
	at org.apache.hadoop.fs.FSOutputSummer.write1(FSOutputSummer.java:137)
	at org.apache.hadoop.fs.FSOutputSummer.write(FSOutputSummer.java:112)
	- locked <0x00007f963d0af788> (a org.apache.hadoop.hdfs.DFSOutputStream)
	at org.apache.hadoop.fs.FSDataOutputStream$PositionCache.write(FSDataOutputStream.java:62)
	at java.io.DataOutputStream.write(java.base@17.0.6/DataOutputStream.java:112)
	- locked <0x00007f963d0b0a70> (a org.apache.hadoop.hdfs.client.HdfsDataOutputStream)
	at org.apache.parquet.hadoop.util.HadoopPositionOutputStream.write(HadoopPositionOutputStream.java:50)
	at java.nio.channels.Channels$WritableByteChannelImpl.write(java.base@17.0.6/Channels.java:463)
	- locked <0x00007f965dfad498> (a java.lang.Object)
	at org.apache.parquet.bytes.ConcatenatingByteBufferCollector.writeAllTo(ConcatenatingByteBufferCollector.java:77)
	at org.apache.parquet.hadoop.ParquetFileWriter.writeColumnChunk(ParquetFileWriter.java:1341)
	at org.apache.parquet.hadoop.ParquetFileWriter.writeColumnChunk(ParquetFileWriter.java:1262)
	at org.apache.parquet.hadoop.ColumnChunkPageWriteStore$ColumnChunkPageWriter.writeToFileWriter(ColumnChunkPageWriteStore.java:408)
	at org.apache.parquet.hadoop.ColumnChunkPageWriteStore.flushToFileWriter(ColumnChunkPageWriteStore.java:675)
	at org.apache.parquet.hadoop.InternalParquetRecordWriter.flushRowGroupToStore(InternalParquetRecordWriter.java:210)
	at org.apache.parquet.hadoop.InternalParquetRecordWriter.checkBlockSizeReached(InternalParquetRecordWriter.java:178)
	at org.apache.parquet.hadoop.InternalParquetRecordWriter.write(InternalParquetRecordWriter.java:154)
	at org.apache.parquet.hadoop.ParquetRecordWriter.write(ParquetRecordWriter.java:240)
	at org.apache.parquet.hadoop.ParquetRecordWriter.write(ParquetRecordWriter.java:41)
	at org.apache.spark.sql.execution.datasources.parquet.ParquetOutputWriter.write(ParquetOutputWriter.scala:39)
	at org.apache.spark.sql.execution.datasources.BaseDynamicPartitionDataWriter.writeRecord(FileFormatDataWriter.scala:357)
	at org.apache.spark.sql.execution.datasources.DynamicPartitionDataSingleWriter.write(FileFormatDataWriter.scala:403)
	at org.apache.spark.sql.execution.datasources.FileFormatDataWriter.writeWithMetrics(FileFormatDataWriter.scala:86)
	at org.apache.spark.sql.execution.datasources.FileFormatDataWriter.writeWithIterator(FileFormatDataWriter.scala:93)
	at org.apache.spark.sql.execution.datasources.FileFormatWriter$.$anonfun$executeTask$2(FileFormatWriter.scala:501)
	at org.apache.spark.sql.execution.datasources.FileFormatWriter$$$Lambda$5526/0x00000008033c9870.apply(Unknown Source)
	at org.apache.spark.util.Utils$.tryWithSafeFinallyAndFailureCallbacks(Utils.scala:1412)
	at org.apache.spark.sql.execution.datasources.FileFormatWriter$.executeTask(FileFormatWriter.scala:508)
	at org.apache.spark.sql.execution.datasources.WriteFilesExec.$anonfun$doExecuteWrite$3(WriteFiles.scala:126)
	at org.apache.spark.sql.execution.datasources.WriteFilesExec$$Lambda$5678/0x00000008034b9b10.apply(Unknown Source)
	at org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2(RDD.scala:931)
	at org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2$adapted(RDD.scala:931)
	at org.apache.spark.rdd.RDD$$Lambda$1530/0x00000008016c6000.apply(Unknown Source)
	at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:405)
	at org.apache.spark.rdd.RDD.iterator(RDD.scala:329)
	at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:93)
	at org.apache.spark.TaskContext.runTaskWithListeners(TaskContext.scala:161)
	at org.apache.spark.scheduler.Task.run(Task.scala:154)
	at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$4(Executor.scala:622)
	at org.apache.spark.executor.Executor$TaskRunner$$Lambda$965/0x0000000801376d80.apply(Unknown Source)
	at org.apache.spark.util.SparkErrorUtils.tryWithSafeFinally(SparkErrorUtils.scala:64)
	at org.apache.spark.util.SparkErrorUtils.tryWithSafeFinally$(SparkErrorUtils.scala:61)
	at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:94)
	at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:625)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(java.base@17.0.6/ThreadPoolExecutor.java:1136)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(java.base@17.0.6/ThreadPoolExecutor.java:635)
	at java.lang.Thread.run(java.base@17.0.6/Thread.java:833)
"Task reaper-9":
	at org.apache.hadoop.hdfs.DFSOutputStream.flushOrSync(DFSOutputStream.java:636)
	- waiting to lock <0x00007f963d0af788> (a org.apache.hadoop.hdfs.DFSOutputStream)
	at org.apache.hadoop.hdfs.DFSOutputStream.hflush(DFSOutputStream.java:585)
	at org.apache.hadoop.fs.FSDataOutputStream.hflush(FSDataOutputStream.java:136)
	at org.apache.parquet.hadoop.util.HadoopPositionOutputStream.close(HadoopPositionOutputStream.java:65)
	at java.nio.channels.Channels$WritableByteChannelImpl.implCloseChannel(java.base@17.0.6/Channels.java:475)
	at java.nio.channels.spi.AbstractInterruptibleChannel$1.interrupt(java.base@17.0.6/AbstractInterruptibleChannel.java:162)
	- locked <0x00007f965dfb1340> (a java.lang.Object)
	at java.lang.Thread.interrupt(java.base@17.0.6/Thread.java:997)
	- locked <0x00007f9575003ab0> (a java.lang.Object)
	at org.apache.spark.util.UninterruptibleThread.interrupt(UninterruptibleThread.scala:99)
	- locked <0x00007f9575000a70> (a java.lang.Object)
	at org.apache.spark.scheduler.Task.kill(Task.scala:263)
	at org.apache.spark.executor.Executor$TaskRunner.kill(Executor.scala:495)
	- locked <0x00007f963d1364e0> (a org.apache.spark.executor.Executor$TaskRunner)
	at org.apache.spark.executor.Executor$TaskReaper.run(Executor.scala:1001)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(java.base@17.0.6/ThreadPoolExecutor.java:1136)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(java.base@17.0.6/ThreadPoolExecutor.java:635)
	at java.lang.Thread.run(java.base@17.0.6/Thread.java:833)

Found 1 deadlock.

Hadoop code path: https://github.com/apache/hadoop/blob/rel/release-3.3.3/hadoop-hdfs-project/hadoop-hdfs-client/src/main/java/org/apache/hadoop/hdfs/DFSOutputStream.java#L631

* Reuse temporary ByteBuffer instead of keep allocating/releasing

cb5bd9c

wgtmac reviewed Feb 23, 2024

View reviewed changes

gszadovszky added 2 commits February 23, 2024 10:05

Addressing comments

f5d9ae6

Fix issues with the previous change

543c189

wgtmac approved these changes Feb 23, 2024

View reviewed changes

gszadovszky merged commit 274dc51 into apache:master Feb 23, 2024

This was referenced Nov 3, 2024

DictionaryFilter.canDrop may return false positive result when dict size exceeds 8k #3040

Closed

GH-3040: DictionaryFilter.canDrop may return false positive result when dict size exceeds 8k #3041

Merged

rymarm mentioned this pull request Apr 30, 2025

DRILL-8521: Update Parquet due to CVE-2025-30065 apache/drill#2986

Merged

	return format("%s %s %d slabs, %,d bytes", prefix, getClass().getSimpleName(), slabs.size(), size);
	return format("%s %s %d slabs, %d bytes", prefix, getClass().getSimpleName(), slabs.size(), size);

	* A special {@link ByteBufferAllocator} implementation that keeps one {@link ByteBuffer} object and reuse it at the
	* A special {@link ByteBufferAllocator} implementation that keeps one {@link ByteBuffer} object and reuses it at the

Conversation

gszadovszky commented Feb 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Jira

Tests

Commits

Style

Documentation

Uh oh!

gszadovszky commented Feb 21, 2024

Uh oh!

wgtmac commented Feb 21, 2024

Uh oh!

wgtmac left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gszadovszky commented Feb 23, 2024

Uh oh!

wangyum commented Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gszadovszky commented Feb 21, 2024 •

edited

Loading

wangyum commented Apr 14, 2025 •

edited

Loading