Inlining loses ABI-relevant target feature information at `call` operations #70563

RalfJung · 2023-10-28T18:22:00Z

The behavior of the call terminator is (currently) very sensitive to surrounding target features: e.g. on x86-64, when an argument of type <8 x float> is passed, the exact way that argument will be passed depends on whether the function that contains the call has the AVX feature enabled.

This means that the seemingly harmless operation of moving call from one function to another (e.g. via inlining) can lead to the behavior of call changing, which is obviously bad! This is what happens in this example, where the uninlined program would behave as intended (caller and callee use matching ABI everywhere), but the program produced by clang -emit-llvm calls no_target_feature in a context where the AVX feature is available, thus breaking the call.

Before:

; no target features
define internal void @no_target_feature_intermediate(float %dummy, ptr align 32 %x) unnamed_addr #0 {
start:
  %0 = load <8 x float>, ptr %x, align 32
  ; call uses "no-avx" ABI
  call void @no_target_feature(float %dummy, <8 x float> %0)
  ret void
}

; "target-features"="+avx"
define void @with_target_feature(ptr align 32 %x) unnamed_addr #2 {
start:
  %0 = alloca <8 x float>, align 32
  %1 = load <8 x float>, ptr %x, align 32
  store <8 x float> %1, ptr %0, align 32
  call void @no_target_feature_intermediate(float 0.000000e+00, ptr align 32 %0)
  ret void
}

After:

; "target-features"="+avx"
define void @with_target_feature(ptr align 32 %x) unnamed_addr #1 {
start:
  %0 = alloca <8 x float>, align 32
  %1 = load <8 x float>, ptr %x, align 32
  store <8 x float> %1, ptr %0, align 32
  %2 = load <8 x float>, ptr %0, align 32
  ; call uses "avx" ABI
  call void @no_target_feature(float 0.000000e+00, <8 x float> %2)
  ret void
}

(We are running into this in Rust, where it causes code that should be completely fine to misbehave, so it's a critical issue for us that we'd like to fix.)

The obvious way to avoid this is to not do inlining when the target features differ between caller and callee, but that seems like a big hammer that leaves a lot of optimization potential on the table -- and indeed, if I understood correctly, the most recent attempt to enforce this for all kinds of inlining (including alwaysinline) quickly led to problems.

An alternative would be to make call less context-dependent -- after all it's not the inlining itself that is the fundamental problem here, it's the fact that call doesn't know the ABI it has to use for the callee, and then takes the caller target flags to fill that information gap -- a fragile heuristic, as the inlining issue shows. So I wonder if it wouldn't be possible to in fact still do the inlining, but end up with code like

; "target-features"="+avx"
define void @with_target_feature(ptr align 32 %x) unnamed_addr #1 {
start:
  %0 = alloca <8 x float>, align 32
  %1 = load <8 x float>, ptr %x, align 32
  store <8 x float> %1, ptr %0, align 32
  %2 = load <8 x float>, ptr %0, align 32
  call void @no_target_feature(float 0.000000e+00, <8 x float> %2) !abi-target-features=""
  ret void
}

IOW, the call terminator would stop relying on context clues (which are fragile since inlining changes the context), and instead it would carry an explicit annotation saying which target features should be considered when determining how arguments must be passed for this call. This approach would enable arbitrary inlining from "less feature" to "more feature" functions, provided the inliner takes care to equip the call instructions it is copying with the right annotation.

(I don't know the LLVM internal data structures so I'm not sure how that annotation would best be represented, but LLVM already supports tons of annotations at various instructions so I hope adding one more isn't too hard.)

The text was updated successfully, but these errors were encountered:

RalfJung · 2024-11-10T18:30:54Z

This limitation in LLVM also prevents us from supporting (without unnecessary overhead) some pretty natural-looking Rust programs; see rust-lang/rust#132865 for details.

cor3ntin · 2025-05-11T11:09:31Z

@dtcxzyw @thesamesam Do you know who could help look into that? Thanks!

cor3ntin · 2025-05-11T11:11:41Z

@nikic

nikic · 2025-05-11T14:31:49Z

@cor3ntin Context for your interest in this issue?

From the Rust side, I believe our understanding of this problem space has largely evolved towards a preference to not have target features affect call ABI at all and instead be determined separately, at the module level (with certain ABIs requiring certain target features of course). This is roughly the design more modern targets follow.

RalfJung · 2025-05-11T14:37:18Z

This is roughly the design more modern targets follow.

Except that AFAIK they will still fall back to a different ABI if a target feature for the requested ABI is missing. At least some of them emit a warning on stderr when that happens...

But yeah, if all targets did this consistently and the warnings were made into hard errors then the inliner wouldn't have to be so careful around call operations any more.

cor3ntin · 2025-05-11T16:33:02Z

@cor3ntin Context for your interest in this issue?

I was asked if I knew who to ping, no personal interest! Thanks for the answer

github-actions bot added the new issue label Oct 28, 2023

RalfJung mentioned this issue Oct 28, 2023

Inlining causes miscompilation of code that mixes target features rust-lang/rust#116573

Open

dtcxzyw added miscompilation llvm:optimizations and removed new issue labels Oct 29, 2023

thesamesam added the ABI Application Binary Interface label Oct 30, 2023

RalfJung mentioned this issue Oct 18, 2024

ABI for float builtins depends on target features #112885

Open

RalfJung mentioned this issue Nov 10, 2024

Support calling functions with SIMD vectors that couldn't be used in the caller rust-lang/rust#132865

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inlining loses ABI-relevant target feature information at `call` operations #70563

Inlining loses ABI-relevant target feature information at `call` operations #70563

RalfJung commented Oct 28, 2023 •

edited

Loading

RalfJung commented Nov 10, 2024

cor3ntin commented May 11, 2025

cor3ntin commented May 11, 2025

nikic commented May 11, 2025

RalfJung commented May 11, 2025

cor3ntin commented May 11, 2025

Inlining loses ABI-relevant target feature information at call operations #70563

Inlining loses ABI-relevant target feature information at call operations #70563

Comments

RalfJung commented Oct 28, 2023 • edited Loading

RalfJung commented Nov 10, 2024

cor3ntin commented May 11, 2025

cor3ntin commented May 11, 2025

nikic commented May 11, 2025

RalfJung commented May 11, 2025

cor3ntin commented May 11, 2025

Inlining loses ABI-relevant target feature information at `call` operations #70563

Inlining loses ABI-relevant target feature information at `call` operations #70563

RalfJung commented Oct 28, 2023 •

edited

Loading