perf: use aligned pointer reads for SparkUnsafeRow field accessors by andygrove · Pull Request #3670 · apache/datafusion-comet

andygrove · 2026-03-11T18:11:26Z

Which issue does this PR close?

Performance optimization for native shuffle row-to-columnar conversion.

Rationale for this change

The SparkUnsafeObject trait previously used a one-size-fits-all approach for reading primitive values: creating a byte slice via from_raw_parts, then calling from_le_bytes(slice.try_into().unwrap()). This incurs unnecessary overhead from slice creation, try_into, and unwrap on every field access.

The two implementors of SparkUnsafeObject have fundamentally different alignment guarantees:

SparkUnsafeRow: All field offsets are always 8-byte aligned. The JVM guarantees 8-byte alignment on the base address, bitset_width is a multiple of 8, and each field slot is 8 bytes. This means aligned ptr::read() is safe and optimal.
SparkUnsafeArray: The array base address may be unaligned when nested within a row's variable-length region (accessed via arbitrary byte offset), so ptr::read_unaligned() is required for correctness.

What changes are included in this PR?

Move primitive accessor method implementations (get_int, get_long, get_float, get_double, etc.) out of the trait defaults and into each concrete impl block via a impl_primitive_accessors! macro parameterized on the read method (read vs read_unaligned).
SparkUnsafeRow uses ptr::read() (aligned) — avoids the from_le_bytes + slice overhead.
SparkUnsafeArray uses ptr::read_unaligned() — correct for potentially unaligned data.
Switch is_null_at and set_not_null_at in SparkUnsafeRow from read_unaligned/write_unaligned to aligned read/write, since the null bitset words are always at 8-byte aligned offsets within the row.

How are these changes tested?

Existing tests cover these code paths. The change is purely an optimization of pointer read methods — no behavioral change. cargo clippy and cargo check pass cleanly.

SparkUnsafeRow field offsets are always 8-byte aligned (the JVM guarantees 8-byte alignment on the base address, bitset_width is a multiple of 8, and each field slot is 8 bytes). This means we can safely use ptr::read() instead of the from_le_bytes(slice) pattern for all typed accesses, avoiding slice creation and try_into overhead. Move primitive accessor implementations out of the SparkUnsafeObject trait defaults and into each concrete impl via a macro parameterized on the read method: - SparkUnsafeRow uses ptr::read() (aligned) - SparkUnsafeArray uses ptr::read_unaligned() (may be unaligned when nested in a row's variable-length region) Also switch is_null_at/set_not_null_at in SparkUnsafeRow from read_unaligned/write_unaligned to aligned read/write, since the null bitset is always at 8-byte aligned offsets within the row.

The test_append_null_struct_field_to_struct_builder test used a plain [u8; 16] stack buffer with no alignment guarantee. Since is_null_at performs aligned i64 reads, Miri flags this as undefined behavior when the buffer lands at a non-8-byte-aligned address. Wrap the buffer in a #[repr(align(8))] struct to match the alignment that real Spark UnsafeRow data always has from JVM memory.

mbutrovich

Thanks @andygrove! Like I mentioned before, this seems to compile down the same so not expecting performance differences, but it's more readable/maintainable.

andygrove · 2026-03-12T13:51:38Z

Thanks for the review @mbutrovich

martin-g · 2026-03-12T14:04:44Z

I was just looking into this PR.
A note: ptr::read()/read_unaligned() read in native order. Spark Unsafe uses Little Endian. So, the new code won't work on Big Endian machines.

mbutrovich · 2026-03-12T14:08:15Z

I was just looking into this PR. A note: ptr::read()/read_unaligned() read in native order. Spark Unsafe uses Little Endian. So, the new code won't work on Big Endian machines.

AFAIK arrow-rs' position on big endian support has not changed (i.e., it does not support big endian). apache/arrow-rs#6917 (comment)

It's possible we've acquired some code that supports both targets, but we likely only need to consider little endian.

andygrove marked this pull request as draft March 11, 2026 18:12

andygrove mentioned this pull request Mar 11, 2026

perf: use direct pointer reads in SparkUnsafeObject accessors #3658

Closed

Merge branch 'main' into perf/aligned-reads-spark-unsafe-row

6963a92

andygrove marked this pull request as ready for review March 11, 2026 19:36

andygrove added 2 commits March 11, 2026 13:37

Merge branch 'main' into perf/aligned-reads-spark-unsafe-row

c59b81b

andygrove force-pushed the perf/aligned-reads-spark-unsafe-row branch from 729f044 to 8bc5761 Compare March 11, 2026 20:04

andygrove requested review from comphead, mbutrovich and parthchandra March 11, 2026 22:15

mbutrovich approved these changes Mar 12, 2026

View reviewed changes

andygrove merged commit a05c568 into apache:main Mar 12, 2026
206 of 213 checks passed

andygrove deleted the perf/aligned-reads-spark-unsafe-row branch March 12, 2026 13:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: use aligned pointer reads for SparkUnsafeRow field accessors#3670

perf: use aligned pointer reads for SparkUnsafeRow field accessors#3670
andygrove merged 4 commits intoapache:mainfrom
andygrove:perf/aligned-reads-spark-unsafe-row

andygrove commented Mar 11, 2026

Uh oh!

mbutrovich left a comment

Uh oh!

andygrove commented Mar 12, 2026

Uh oh!

Uh oh!

martin-g commented Mar 12, 2026

Uh oh!

mbutrovich commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

andygrove commented Mar 11, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

mbutrovich left a comment

Choose a reason for hiding this comment

Uh oh!

andygrove commented Mar 12, 2026

Uh oh!

Uh oh!

martin-g commented Mar 12, 2026

Uh oh!

mbutrovich commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants