[SPARK-55683][SQL][FOLLOWUP] Optimize `VectorizedPlainValuesReader.readUnsignedLongs` to reuse scratch buffer and avoid per-element allocations by LuciferYang · Pull Request #54510 · apache/spark

LuciferYang · 2026-02-26T09:32:14Z

What changes were proposed in this pull request?

This pr refer to the suggestion from Copilot: #54479 (review), further optimizes VectorizedPlainValuesReader.readUnsignedLongs by introducing a reusable scratch buffer to eliminate per-element byte[] allocations introduced in the previous refactoring.

The previous implementation allocates a new byte[] per element for the encoded output:

// Previous: new byte[totalLen] per element, plus new byte[]{0} for zero values
byte[] dest = new byte[totalLen];
...
c.putByteArray(rowId, dest, 0, totalLen);

The new implementation allocates a single byte[9] scratch buffer once per batch and reuses it across all elements. Since WritableColumnVector.putByteArray copies the bytes into its internal storage immediately, the scratch buffer can be safely overwritten on the next iteration:

// New: one byte[9] allocated per batch, reused for every element
byte[] scratch = new byte[9];
for (...) {
    putLittleEndianBytesAsBigInteger(c, rowId, src, offset, scratch);
}

The scratch buffer is sized at 9 bytes to accommodate the worst case: 1 0x00 sign byte + 8 value bytes. The zero value special case is also handled via scratch, avoiding the previous new byte[]{0} allocation.

Why are the changes needed?

The previous implementation still allocates one byte[] per element for the encoded output. For a typical batch of 4096 values this means 4096 heap allocations per readUnsignedLongs call, creating GC pressure in workloads that read large UINT_64 columns. With the scratch buffer approach, the entire batch produces only 2 allocations: byte[9] (scratch) and byte[8] (direct buffer fallback read buffer), regardless of batch size.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Pass Github Action
Reused the JMH benchmark provided in [SPARK-55683][SQL] Optimize VectorizedPlainValuesReader.readUnsignedLongs #54479, and the test results are as follows:

Java 17

[info] Benchmark                                                              (numValues)  Mode  Cnt       Score      Error  Units
[info] VectorizedPlainValuesReaderJMHBenchmark.readUnsignedLongs_offHeap_New     10000000  avgt   10  233820.658 ± 1888.523  us/op
[info] VectorizedPlainValuesReaderJMHBenchmark.readUnsignedLongs_offHeap_Old     10000000  avgt   10  255563.248 ± 3500.165  us/op
[info] VectorizedPlainValuesReaderJMHBenchmark.readUnsignedLongs_onHeap_New      10000000  avgt   10  228672.684 ± 2985.496  us/op
[info] VectorizedPlainValuesReaderJMHBenchmark.readUnsignedLongs_onHeap_Old      10000000  avgt   10  275756.804 ± 2065.405  us/op

Java 21

[info] Benchmark                                                              (numValues)  Mode  Cnt       Score       Error  Units
[info] VectorizedPlainValuesReaderJMHBenchmark.readUnsignedLongs_offHeap_New     10000000  avgt   10  241977.924 ± 15125.343  us/op
[info] VectorizedPlainValuesReaderJMHBenchmark.readUnsignedLongs_offHeap_Old     10000000  avgt   10  250343.470 ±  1342.509  us/op
[info] VectorizedPlainValuesReaderJMHBenchmark.readUnsignedLongs_onHeap_New      10000000  avgt   10  212929.948 ±  1387.671  us/op
[info] VectorizedPlainValuesReaderJMHBenchmark.readUnsignedLongs_onHeap_Old      10000000  avgt   10  274561.949 ±  1226.348  us/op

Judging from the test results, the onHeap path demonstrates approximately a 17-22% improvement, while the offHeap path shows roughly a 3-9% improvement across Java 17 and Java 21.

Was this patch authored or co-authored using generative AI tooling?

Yes, Claude Sonnet 4.6 was used to assist in completing the code writing.

pan3793 · 2026-02-26T09:46:09Z

...ain/java/org/apache/spark/sql/execution/datasources/parquet/VectorizedPlainValuesReader.java

+    // putByteArray copies the bytes into arrayData(), so scratch can be safely reused
+    c.putByteArray(rowId, scratch, 0, totalLen);


could you leave the same comments at line 262?

LuciferYang added 2 commits February 26, 2026 10:56

init

64db20c

Merge branch 'apache:master' into SPARK-55683-FOLLOWUP

85932e1

LuciferYang mentioned this pull request Feb 26, 2026

[SPARK-55683][SQL] Optimize VectorizedPlainValuesReader.readUnsignedLongs #54479

Closed

pan3793 reviewed Feb 26, 2026

View reviewed changes

comments

31c8459

pan3793 approved these changes Feb 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-55683][SQL][FOLLOWUP] Optimize `VectorizedPlainValuesReader.readUnsignedLongs` to reuse scratch buffer and avoid per-element allocations#54510

[SPARK-55683][SQL][FOLLOWUP] Optimize `VectorizedPlainValuesReader.readUnsignedLongs` to reuse scratch buffer and avoid per-element allocations#54510
LuciferYang wants to merge 3 commits intoapache:masterfrom
LuciferYang:SPARK-55683-FOLLOWUP

LuciferYang commented Feb 26, 2026 •

edited

Loading

Uh oh!

pan3793 Feb 26, 2026 •

edited

Loading

Uh oh!

LuciferYang Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		// putByteArray copies the bytes into arrayData(), so scratch can be safely reused
		c.putByteArray(rowId, scratch, 0, totalLen);

Conversation

LuciferYang commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

pan3793 Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LuciferYang Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LuciferYang commented Feb 26, 2026 •

edited

Loading

pan3793 Feb 26, 2026 •

edited

Loading