perf(codegen): Eliminate `size_of_val == 0` for DSTs with Non-zero-sized Prefix via NUW and Assume by TKanX · Pull Request #152843 · rust-lang/rust

TKanX · 2026-02-19T11:40:00Z

Summary:

Problem:

size_of_val(p) == 0 fails to optimize away for DST types that have a statically-known non-zero-sized prefix:

pub struct Foo<T: ?Sized>(pub [u32; 3], pub T);

pub fn demo(p: &Foo<dyn std::fmt::Debug>) -> bool {
    std::mem::size_of_val(p) == 0  // always false, but LLVM can't prove it
}

Foo has a 12-byte prefix, so its total size is always ≥ 12. Yet the comparison persists as a runtime computation in LLVM IR. This matters because Box<dyn T> drop emits this exact check to guard the deallocation call — for types with a guaranteed non-zero prefix, the branch should vanish but doesn't.

The slice tail variant Foo<[i32]> already optimized correctly; Foo<dyn Trait> and Foo<[u8]> did not.

Root Cause:

In size_and_align_of_dst (the ADT/Tuple branch), the size computation is:

full_size = (offset + unsized_size + (align-1)) & -align

LLVM cannot prove full_size > 0 because:

offset + unsized_size used plain add — no NUW flag, so LLVM cannot conclude the result is ≥ offset.
(x + addend) & -align — LLVM has no information that alignment rounding never reduces the value below x.

Additionally, the vtable alignment range metadata was [1, u64::MAX] (only non-zero), despite the actual bound being [1, 1 << (ptr_width - 1)] (all alignments are powers of two with a tighter upper bound).

Solution:

Three minimal additions, each grounded in a precise invariant:

add nuw on offset + unsized_size — sound because both operands are ≤ isize::MAX for any valid Rust object, so unsigned overflow is impossible. Tells LLVM: unrounded_size ≥ offset.
assume(full_size ≥ unrounded_size) — round_up(x, a) ≥ x is a mathematical identity for power-of-two a. Tells LLVM: full_size ≥ unrounded_size ≥ offset. If offset > 0, the chain proves full_size > 0.
Tighten vtable alignment range from [1, u64::MAX] to [1, 1 << (ptr_width - 1)] — consistent with Rust's alignment constraints. Applied in both size_of_val.rs and the vtable_align intrinsic in mir/intrinsic.rs.

LLVM IR Comparison:

Foo<dyn Debug> — before (godbolt):

define noundef zeroext i1 @demo(ptr %p.0, ptr %p.1) {
start:
  %0 = getelementptr inbounds nuw i8, ptr %p.1, i64 8
  %1 = load i64, ptr %0, align 8, !range !3, !invariant.load !4
  %2 = getelementptr inbounds nuw i8, ptr %p.1, i64 16
  %3 = load i64, ptr %2, align 8, !range !5, !invariant.load !4
  %4 = tail call i64 @llvm.umax.i64(i64 %3, i64 4)
  %5 = add nuw i64 %1, 11
  %6 = add i64 %5, %4
  %7 = sub i64 0, %4
  %8 = and i64 %6, %7
  %_0 = icmp eq i64 %8, 0
  ret i1 %_0
}

Foo<dyn Debug> — after:

define noundef zeroext i1 @demo(ptr %p.0, ptr %p.1) {
start:
  ret i1 false
}

Foo<[u8]> — before:

define noundef zeroext i1 @demo_lessalignedslice(ptr %p.0, i64 %p.1) {
start:
  %0 = add i64 %p.1, 15
  %_0 = icmp ult i64 %0, 4
  ret i1 %_0
}

Foo<[u8]> — after:

define noundef zeroext i1 @demo_lessalignedslice(ptr %p.0, i64 %p.1) {
start:
  ret i1 false
}

Changes:

compiler/rustc_codegen_ssa/src/size_of_val.rs: add → unchecked_uadd (NUW) on offset + unsized_size; add assume(full_size ≥ unrounded_size); tighten vtable alignment range.
compiler/rustc_codegen_ssa/src/mir/intrinsic.rs: tighten alignment range on the vtable_align intrinsic, consistent with the above.
tests/codegen-llvm/dst-vtable-align-nonzero.rs: update FileCheck metadata expectation to match the new tighter range.
tests/codegen-llvm/dst-size-of-val-not-zst.rs: new codegen test verifying size_of_val == 0 folds to ret i1 false for Foo<dyn Debug>, Foo<[u8]>, and Foo<[i32]>.

Fixes #152788.

TKanX · 2026-02-20T19:33:41Z

@rustbot label +A-LLVM +A-codegen +C-optimization +T-compiler

fmease · 2026-02-21T22:14:41Z

r? codegen

compiler/rustc_codegen_ssa/src/size_of_val.rs

rustbot · 2026-02-22T00:16:08Z

Reminder, once the PR becomes ready for a review, use @rustbot ready.

…= 0` for non-ZST DSTs

… on non-ZST DSTs

rustbot · 2026-02-22T05:32:48Z

This PR was rebased onto a different main commit. Here's a range-diff highlighting what actually changed.

Rebasing is a normal part of keeping PRs up to date, so no action is needed—this note is just to help reviewers.

TKanX · 2026-02-22T05:34:22Z

@rustbot ready

scottmcm · 2026-02-22T19:00:51Z

compiler/rustc_codegen_ssa/src/size_of_val.rs

+            // Alignment rounding can only increase the size, never decrease it:
+            // `round_up(x, a) >= x` for power-of-two `a`. With the `nuw` on the
+            // addition above, LLVM can therefore deduce
+            // `full_size >= unrounded_size >= offset`, which proves `full_size > 0`
+            // for types with a non-zero-sized prefix (#152788).
+            let size_ge = bx.icmp(IntPredicate::IntUGE, full_size, unrounded_size);
+            bx.assume(size_ge);


Can you elaborate on which things you tried and why this is the best one? Was it not enough to say that the alignment is a power-of-two? Or...

I ask because most of the text in the OP is just useless LLM slop, and the updates to the tests make me suspicious.

dianqk · 2026-02-22T19:06:58Z

r? scottmcm

rustbot assigned fmease Feb 19, 2026

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Feb 19, 2026

This comment has been minimized.

Sign in to view

rustbot added A-codegen Area: Code generation A-LLVM Area: Code generation parts specific to LLVM. Both correctness bugs and optimization-related issues. C-optimization Category: An issue highlighting optimization opportunities or PRs implementing such labels Feb 20, 2026

rustbot assigned dianqk and unassigned fmease Feb 21, 2026

This comment has been minimized.

Sign in to view

scottmcm requested changes Feb 22, 2026

View reviewed changes

compiler/rustc_codegen_ssa/src/size_of_val.rs Outdated Show resolved Hide resolved

compiler/rustc_codegen_ssa/src/size_of_val.rs Outdated Show resolved Hide resolved

rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 22, 2026

TKanX added 2 commits February 21, 2026 21:31

perf(codegen): Use nuw nsw and assume to eliminate `size_of_val =…

12a18ee

…= 0` for non-ZST DSTs

test(codegen): Add regression test for size_of_val == 0 elimination…

8339cfe

… on non-ZST DSTs

TKanX force-pushed the bugfix/152788-codegen-dst-size-nuw-assume branch from a9ec27f to 8339cfe Compare February 22, 2026 05:32

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Feb 22, 2026

TKanX requested a review from scottmcm February 22, 2026 05:34

scottmcm reviewed Feb 22, 2026

View reviewed changes

rustbot assigned scottmcm and unassigned dianqk Feb 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

perf(codegen): Eliminate `size_of_val == 0` for DSTs with Non-zero-sized Prefix via NUW and Assume#152843

perf(codegen): Eliminate `size_of_val == 0` for DSTs with Non-zero-sized Prefix via NUW and Assume#152843
TKanX wants to merge 2 commits intorust-lang:mainfrom
TKanX:bugfix/152788-codegen-dst-size-nuw-assume

TKanX commented Feb 19, 2026 •

edited by rustbot

Loading

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

TKanX commented Feb 20, 2026

Uh oh!

fmease commented Feb 21, 2026

Uh oh!

This comment has been minimized.

Uh oh!

Uh oh!

rustbot commented Feb 22, 2026

Uh oh!

rustbot commented Feb 22, 2026

Uh oh!

TKanX commented Feb 22, 2026

Uh oh!

scottmcm Feb 22, 2026

Uh oh!

scottmcm Feb 22, 2026

Uh oh!

dianqk commented Feb 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Comments

Conversation

TKanX commented Feb 19, 2026 • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary:

Problem:

Root Cause:

Solution:

LLVM IR Comparison:

Changes:

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

TKanX commented Feb 20, 2026

Uh oh!

fmease commented Feb 21, 2026

Uh oh!

This comment has been minimized.

Uh oh!

Uh oh!

rustbot commented Feb 22, 2026

Uh oh!

rustbot commented Feb 22, 2026

Uh oh!

TKanX commented Feb 22, 2026

Uh oh!

scottmcm Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

scottmcm Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

dianqk commented Feb 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

TKanX commented Feb 19, 2026 •

edited by rustbot

Loading