[AArch64] Removed redundant FMOV instruction for truncstores of f64/f32 via bitcast to i64/i32/i8. #149997

Amichaxx · 2025-07-22T11:04:46Z

Previously, storing the low bits of a double, which was bitcast to i64 and truncated to i32 or i16, would emit a redundant FMOV. This patch introduces new TableGen patterns to avoid the unnecessary FMOV. Tests added: bitcast_truncstore.ll

github-actions · 2025-07-22T11:05:07Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write permissions for the repository. In which case you can instead tag reviewers by name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

llvmbot · 2025-07-22T11:05:39Z

@llvm/pr-subscribers-backend-aarch64

Author: Amina Chabane (Amichaxx)

Changes

Previously, storing the low bits of a double, which was bitcast to i64 and truncated to i32 or i16, would emit a redundant FMOV. This patch introduces new TableGen patterns to avoid the unnecessary FMOV. Tests added: bitcast_truncstore.ll

Full diff: https://github.com/llvm/llvm-project/pull/149997.diff

2 Files Affected:

(modified) llvm/lib/Target/AArch64/AArch64InstrInfo.td (+8)
(added) llvm/test/CodeGen/AArch64/bitcast_truncstore.ll (+26)

diff --git a/llvm/lib/Target/AArch64/AArch64InstrInfo.td b/llvm/lib/Target/AArch64/AArch64InstrInfo.td
index 0cb7b02d84a6e..aa635b188da70 100644
--- a/llvm/lib/Target/AArch64/AArch64InstrInfo.td
+++ b/llvm/lib/Target/AArch64/AArch64InstrInfo.td
@@ -4649,6 +4649,14 @@ let Predicates = [IsLE] in {
             (STRQui FPR128:$Rt, GPR64sp:$Rn, uimm12s16:$offset)>;
 }
 
+// truncstorei32 of f64 bitcasted to i64
+def : Pat<(truncstorei32 (i64 (bitconvert (f64 FPR64:$Rt))), (am_indexed32 GPR64sp:$Rn, uimm12s4:$offset)),
+          (STRSui (EXTRACT_SUBREG FPR64:$Rt, ssub), GPR64sp:$Rn, uimm12s4:$offset)>;
+
+// truncstorei16 of f64 bitcasted to i64
+def : Pat<(truncstorei16 (i64 (bitconvert (f64 FPR64:$Rt))), (am_indexed16 GPR64sp:$Rn, uimm12s2:$offset)),
+          (STRHui (f16 (EXTRACT_SUBREG FPR64:$Rt, hsub)), GPR64sp:$Rn, uimm12s2:$offset)>;     
+
 // truncstore i64
 def : Pat<(truncstorei32 GPR64:$Rt,
                          (am_indexed32 GPR64sp:$Rn, uimm12s4:$offset)),
diff --git a/llvm/test/CodeGen/AArch64/bitcast_truncstore.ll b/llvm/test/CodeGen/AArch64/bitcast_truncstore.ll
new file mode 100644
index 0000000000000..8e0d0c2158090
--- /dev/null
+++ b/llvm/test/CodeGen/AArch64/bitcast_truncstore.ll
@@ -0,0 +1,26 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 5
+; RUN: llc -mtriple=aarch64-linux-gnu -o - %s | FileCheck %s
+
+define void @_Z10store_i64_from_f64Pjd(ptr %n, double noundef %x){
+; CHECK-LABEL: _Z10store_i64_from_f64Pjd:
+; CHECK:       // %bb.0: // %entry
+; CHECK-NEXT:    str s0, [x0]
+; CHECK-NEXT:    ret
+entry:
+  %0 = bitcast double %x to i64
+  %conv = trunc i64 %0 to i32
+  store i32 %conv, ptr %n, align 4
+  ret void
+}
+
+define void @_Z9store_i16Ptd(ptr %n, double noundef %x) {
+; CHECK-LABEL: _Z9store_i16Ptd:
+; CHECK:       // %bb.0: // %entry
+; CHECK-NEXT:    str h0, [x0]
+; CHECK-NEXT:    ret
+entry:
+  %0 = bitcast double %x to i64
+  %conv = trunc i64 %0 to i16
+  store i16 %conv, ptr %n, align 2
+  ret void
+}

davemgreen · 2025-07-23T08:46:15Z

Can we add patterns and tests for the other types, similar to #146920?

… (f64/f32 → i32/i16/i8X)

CarolineConcatto

Thank you Amina,

llvm/test/CodeGen/AArch64/bitcast_truncstore.ll

llvm/lib/Target/AArch64/AArch64InstrInfo.td

CarolineConcatto

Thank you Amina,
The patch looks good.
Can you align the stores before merging the patch please.

llvm/lib/Target/AArch64/AArch64InstrInfo.td

Amichaxx · 2025-09-08T08:51:39Z

@davemgreen Hi, just wondering if the changes look okay to you? Thanks.

davemgreen

Yep, LGTM. Do you want me to hit submit?

Amichaxx · 2025-09-08T09:08:47Z

Yes please :)

github-actions · 2025-09-08T09:35:26Z

@Amichaxx Congratulations on having your first Pull Request (PR) merged into the LLVM Project!

Your changes will be combined with recent changes from other authors, then tested by our build bots. If there is a problem with a build, you may receive a report in an email or a comment on this PR.

Please check whether problems have been caused by your change specifically, as the builds can include changes from many authors. It is not uncommon for your change to be included in a build that fails due to someone else's changes, or infrastructure issues.

How to do this, and the rest of the post-merge process, is covered in detail here.

If your change does cause a problem, it may be reverted, or you can revert it yourself. This is a normal part of LLVM development. You can fix your changes and open a new PR to merge them again.

If you don't get any reports, no action is required from you. Your changes are working as expected, well done!

llvmbot added the backend:AArch64 label Jul 22, 2025

hstk30-hw requested a review from davemgreen July 23, 2025 01:45

davemgreen requested a review from CarolineConcatto July 23, 2025 08:46

Amichaxx force-pushed the aarch64-store-opt branch 2 times, most recently from f0b3edf to aec8bb3 Compare August 18, 2025 10:28

Amichaxx changed the title ~~[AArch64] Removed redundant FMOV instruction for truncstores of f64 via bitcast to i64.~~ [AArch64] Removed redundant FMOV instruction for truncstores of f64/f32 via bitcast to i64/i32/i8. Aug 18, 2025

[AArch64] Add TableGen patterns for truncstore of bitcasted FP values…

97f61b4

… (f64/f32 → i32/i16/i8X)

Amichaxx force-pushed the aarch64-store-opt branch from aec8bb3 to 97f61b4 Compare August 18, 2025 10:35

CarolineConcatto reviewed Aug 18, 2025

View reviewed changes

llvm/test/CodeGen/AArch64/bitcast_truncstore.ll Outdated Show resolved Hide resolved

llvm/test/CodeGen/AArch64/bitcast_truncstore.ll Outdated Show resolved Hide resolved

llvm/test/CodeGen/AArch64/bitcast_truncstore.ll Outdated Show resolved Hide resolved

Added tests for converting from int to fp, renamed tests

3faa302

davemgreen reviewed Aug 18, 2025

View reviewed changes

llvm/lib/Target/AArch64/AArch64InstrInfo.td Outdated Show resolved Hide resolved

llvm/lib/Target/AArch64/AArch64InstrInfo.td Outdated Show resolved Hide resolved

Removed predicates from patterns and feature flags from test

efa6fab

CarolineConcatto approved these changes Aug 19, 2025

View reviewed changes

llvm/lib/Target/AArch64/AArch64InstrInfo.td Show resolved Hide resolved

Alignment

a2169a9

davemgreen approved these changes Sep 8, 2025

View reviewed changes

davemgreen merged commit 3b19717 into llvm:main Sep 8, 2025
9 checks passed

[AArch64] Removed redundant FMOV instruction for truncstores of f64/f32 via bitcast to i64/i32/i8. #149997

[AArch64] Removed redundant FMOV instruction for truncstores of f64/f32 via bitcast to i64/i32/i8. #149997

Uh oh!

Conversation

Amichaxx commented Jul 22, 2025

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

llvmbot commented Jul 22, 2025

Uh oh!

davemgreen commented Jul 23, 2025

Uh oh!

CarolineConcatto left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CarolineConcatto left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Amichaxx commented Sep 8, 2025

Uh oh!

davemgreen left a comment

Choose a reason for hiding this comment

Uh oh!

Amichaxx commented Sep 8, 2025

Uh oh!

Uh oh!

github-actions bot commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants