codegen: use direct value PHI for pointer-free aggregate types #59914

KristofferC · 2025-10-20T10:46:42Z

Fixes #59906 from my testing, takes the runtime of the code in there from 4000 ns to 1000 ns on a M4 macbook.

Developed together with Claude 🤖

My understanding of the code here is rudimentary but I still put up this PR in case it is helpful.

vtjnash · 2025-10-20T13:11:53Z

It is correctly implemented, but has quite a few known catastrophic performance failures, so we should not do this.

KristofferC · 2025-10-20T13:13:21Z

It is correctly implemented, but has quite a few known catastrophic performance failures, so we should not do this.

Do you have an example (would nanosoldier show it)? Any other way to get back the performance lost in 25cbe00?

vtjnash · 2025-10-20T13:36:21Z

Yeah, looks like an LLVM bug (this particularly one has been a super common one, and was supposed to be fixed by opaque pointers). But we can add the hack back that usually does okay to often work around that

vtjnash · 2025-10-20T13:44:07Z

We just need to audit all calls to emit_static_alloca and make sure they use the old (pre-opaque pointer) GEP type instead of actually benefitting from LLVM's enormous amount of opaque pointer work. I suspect the SROA pass is still at fault here for the performance issues.

oscardssmith · 2025-10-20T17:34:21Z

wait, we want to use the old version?

gbaraldi · 2025-10-20T19:53:00Z

llvm/llvm-project#164308 I did some snooping around and opened that. I couldn't minimize it much further but the difference is LLVM thinks that an alloca of floats is meaningfully different than an alloca of ints. If anyone wants to take a stab the issue is probably in https://github.com/llvm/llvm-project/blob/e6b4a21849f0588b1c4fb39802a3999d7ac51dad/llvm/lib/Transforms/Scalar/SROA.cpp#L4885-L4966

codegen: use direct value PHI for pointer-free aggregate types

42b6510

KristofferC requested a review from vtjnash October 20, 2025 10:46

KristofferC added performance Must go faster compiler:codegen Generation of LLVM IR and native code backport 1.12 Change should be backported to release-1.12 labels Oct 20, 2025

KristofferC mentioned this pull request Oct 21, 2025

Backports for 1.12.2 #59920

Open

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

codegen: use direct value PHI for pointer-free aggregate types #59914

codegen: use direct value PHI for pointer-free aggregate types #59914

KristofferC commented Oct 20, 2025 •

edited

Loading

Uh oh!

vtjnash commented Oct 20, 2025

Uh oh!

KristofferC commented Oct 20, 2025

Uh oh!

vtjnash commented Oct 20, 2025

Uh oh!

vtjnash commented Oct 20, 2025

Uh oh!

oscardssmith commented Oct 20, 2025

Uh oh!

gbaraldi commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Uh oh!

codegen: use direct value PHI for pointer-free aggregate types #59914

Are you sure you want to change the base?

codegen: use direct value PHI for pointer-free aggregate types #59914

Conversation

KristofferC commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vtjnash commented Oct 20, 2025

Uh oh!

KristofferC commented Oct 20, 2025

Uh oh!

vtjnash commented Oct 20, 2025

Uh oh!

vtjnash commented Oct 20, 2025

Uh oh!

oscardssmith commented Oct 20, 2025

Uh oh!

gbaraldi commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

KristofferC commented Oct 20, 2025 •

edited

Loading