[RISCV] Add load/store clustering in post machine schedule #111504

BoyaoWang430 · 2024-10-08T08:58:01Z

#73789 added load clustering and #73796 tried to add store clustering.
If post machine schedule is used, previous cluster of load/store which formed in machine schedule may break. In order to solve this, add load/sotre clustering to post machine schedule.

github-actions · 2024-10-08T08:58:21Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write permissions for the repository. In which case you can instead tag reviewers by name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

asb · 2024-10-10T14:53:11Z

#73789 added load clustering and #73796 tried to add store clustering. If post machine schedule is used, previous cluster of load/store which formed in machine schedule may break. In order to solve this, add load/sotre clustering to post machine schedule.

Hi, I'm struggling to parse this a bit. I think you're saying that if load and store clustering aren't added to the post machine schedule in addition to being in createMachineScheduler, you see issues. Is that correct? It looks like PPC (which does store clustering only) also adds it in both functions, though AArch64 doesn't.

If you could update this PR to show any test differences that would really help with review. Thanks!

asb · 2024-10-10T15:44:45Z

FWIW, if I add a version of this on top of #73796 I get assertions in several llvm/test/CodeGen/RISCV tests

EDIT: SNIP - thanks wangpc-pp, the problem was indeed on my end!

wangpc-pp · 2024-10-10T15:50:05Z

createGenericSchedLive

I think the reason is you are using pre-ra scheduler here.

asb · 2024-10-10T15:54:15Z

createGenericSchedLive

I think the reason is you are using pre-ra scheduler here.

Thank you - the problem was indeed me being too eager with copy and paste and failing to switch to createGenericSchedPostRA. Now I've done that though, I see zero codegen changes within the in-tree unit tests.

wangpc-pp · 2024-10-10T16:02:15Z

createGenericSchedLive

I think the reason is you are using pre-ra scheduler here.

Thank you - the problem was indeed me being too eager with copy and paste and failing to switch to createGenericSchedPostRA. Now I've done that though, I see zero codegen changes within the in-tree unit tests.

Thanks! I will ask @BoyaoWang430 to add some MIR tests. And thanks in advance if you can help to evaluate/review this PR. :-)

asb · 2024-10-10T16:23:17Z

createGenericSchedLive

I think the reason is you are using pre-ra scheduler here.

Thank you - the problem was indeed me being too eager with copy and paste and failing to switch to createGenericSchedPostRA. Now I've done that though, I see zero codegen changes within the in-tree unit tests.

Thanks! I will ask @BoyaoWang430 to add some MIR tests. And thanks in advance if you can help to evaluate/review this PR. :-)

Thanks! And more generally, any note you have on how/if it affects codegen on external codebases very welcome. e.g. is this something that kicks in a lot in real-world code but we just don't trigger in our tests, or is it fairly rare it makes a difference (but of course worth addressing for the cases it helps)

BoyaoWang430 · 2024-10-11T02:15:27Z

createGenericSchedLive

I think the reason is you are using pre-ra scheduler here.

Thank you - the problem was indeed me being too eager with copy and paste and failing to switch to createGenericSchedPostRA. Now I've done that though, I see zero codegen changes within the in-tree unit tests.

We also made the same mistake when we first tried to add this by copy-paste. I will add some MIR tests these days to demonstrate the difference that enabling load/store clustering in the post machine scheduler can make.

wangpc-pp · 2024-10-11T03:54:13Z

createGenericSchedLive

I think the reason is you are using pre-ra scheduler here.

Thank you - the problem was indeed me being too eager with copy and paste and failing to switch to createGenericSchedPostRA. Now I've done that though, I see zero codegen changes within the in-tree unit tests.

Thanks! I will ask @BoyaoWang430 to add some MIR tests. And thanks in advance if you can help to evaluate/review this PR. :-)

Thanks! And more generally, any note you have on how/if it affects codegen on external codebases very welcome. e.g. is this something that kicks in a lot in real-world code but we just don't trigger in our tests, or is it fairly rare it makes a difference (but of course worth addressing for the cases it helps)

The reason why there is no CodeGen change in in-tree tests is because postra scheduler is not enabled by default. To enable postra scheduler, we should use a CPU with FeaturePostRAScheduler or add +use-postra-scheduler to -mattr.

The problem is very common, maybe I didn't make it clear in sync-up meeting because of my bad speaking :-(.

We have added load/store clustering mutations to pre-ra scheduler, so when doing pre-ra scheduling, we can add dependencies between load/store instructions and make them together.
But, if we enable post-ra scheduling, and load/store clustering mutations are not added to post-ra scheduler, then there is no strong dependency between these load/store instructions and the scheduler may separate them.
For example:

# Before pre-ra scheduling
ld vreg0, 0(rs)
addi vreg1, vreg0, 1
ld vreg2, 8(rs)
addi vreg3, vreg2, 1

# After pre-ra scheduling
ld vreg0, 0(rs)
ld vreg2, 8(rs)
addi vreg1, vreg0, 1
addi vreg3, vreg2, 1

# After post-ra scheduling (possile current result)
ld a1, 0(a0)
addi a3, a1, 1
ld a2, 8(a0)
addi a4, a2, 1

# After post-ra scheduling (what we want)
ld a1, 0(a0)
ld a2, 8(a0)
addi a3, a1, 1
addi a4, a2, 1

I think this problem is common across targets and as what you have said PPC have already done it.

topperc · 2024-10-11T17:29:04Z

Why is this what we want?

ld a1, 0(a0)
ld a2, 8(a0)
addi a3, a1, 1
addi a4, a2, 1

What is problem if there is an add between them? Is there some fusion going on in the hardware? I understand why you want them loads ordered near each other if they access the same cache line, but I don't understand why arithmetic between them is bad?

wangpc-pp · 2024-10-12T03:40:52Z

Why is this what we want?
ld a1, 0(a0)
ld a2, 8(a0)
addi a3, a1, 1
addi a4, a2, 1
What is problem if there is an add between them? Is there some fusion going on in the hardware? I understand why you want them loads ordered near each other if they access the same cache line, but I don't understand why arithmetic between them is bad?

Oh, sorry, my example may not be exact. Several reasons:

If the result of enabling pre-ra scheduling is what we are expecting, then I think we should get the same result after post-ra scheduling, right?
Some μ-arch things require this:

As what you have pointed out, fusion is one thing.
Another thing is about store clustering. Store buffer may be impacted as we interleave arithmetic/load(which is not shown in my example) between stores.

Anyway, we saw the benefit and if it is not common for all μ-arch, we can make it a feature.

llvmbot · 2024-10-16T07:28:32Z

@llvm/pr-subscribers-backend-risc-v

Author: None (BoyaoWang430)

Changes

#73789 added load clustering and #73796 tried to add store clustering.
If post machine schedule is used, previous cluster of load/store which formed in machine schedule may break. In order to solve this, add load/sotre clustering to post machine schedule.

Full diff: https://github.com/llvm/llvm-project/pull/111504.diff

2 Files Affected:

(modified) llvm/lib/Target/RISCV/RISCVTargetMachine.cpp (+13)
(added) llvm/test/CodeGen/RISCV/misched-mem-clustering.mir (+64)

diff --git a/llvm/lib/Target/RISCV/RISCVTargetMachine.cpp b/llvm/lib/Target/RISCV/RISCVTargetMachine.cpp
index 089dc6c529193d..4c8faed5a4787a 100644
--- a/llvm/lib/Target/RISCV/RISCVTargetMachine.cpp
+++ b/llvm/lib/Target/RISCV/RISCVTargetMachine.cpp
@@ -363,6 +363,19 @@ class RISCVPassConfig : public TargetPassConfig {
     return DAG;
   }
 
+  ScheduleDAGInstrs *
+  createPostMachineScheduler(MachineSchedContext *C) const override {
+    ScheduleDAGMI *DAG = nullptr;
+    if (EnableMISchedLoadStoreClustering) {
+      DAG = createGenericSchedPostRA(C);
+      DAG->addMutation(createLoadClusterDAGMutation(
+          DAG->TII, DAG->TRI, /*ReorderWhileClustering=*/true));
+      DAG->addMutation(createStoreClusterDAGMutation(
+          DAG->TII, DAG->TRI, /*ReorderWhileClustering=*/true));
+    }
+    return DAG;
+  }
+  
   void addIRPasses() override;
   bool addPreISel() override;
   void addCodeGenPrepare() override;
diff --git a/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir b/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir
new file mode 100644
index 00000000000000..4764e08969da5d
--- /dev/null
+++ b/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir
@@ -0,0 +1,64 @@
+# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 5
+# RUN: llc -mtriple=riscv64 -x mir -mcpu=sifive-p470 -mattr=+use-postra-scheduler -verify-misched -enable-post-misched=true \
+# RUN:     -riscv-misched-load-store-clustering=false -debug-only=machine-scheduler \
+# RUN:     -start-before=machine-scheduler -stop-after=postmisched -o - 2>&1 < %s \
+# RUN:   | FileCheck -check-prefix=NOCLUSTER %s
+# RUN: llc -mtriple=riscv64 -x mir -mcpu=sifive-p470 -mattr=+use-postra-scheduler -verify-misched -enable-post-misched=true \
+# RUN:     -debug-only=machine-scheduler \
+# RUN:     -start-before=machine-scheduler -stop-after=postmisched -o - 2>&1 < %s \
+# RUN:   | FileCheck -check-prefix=MEMCLUSTER %s
+
+...
+---
+name:            mem_clustering_1
+tracksRegLiveness: true
+body:             |
+  bb.0:
+    liveins: $x6, $x10, $x14, $x15, $x16, $x17
+    ; NOCLUSTER-LABEL: name: mem_clustering_1
+    ; NOCLUSTER: liveins: $x6, $x10, $x14, $x15, $x16, $x17
+    ; NOCLUSTER-NEXT: {{  $}}
+    ; NOCLUSTER-NEXT: renamable $x5 = LW renamable $x15, 0 :: (load (s32))
+    ; NOCLUSTER-NEXT: SW renamable $x14, renamable $x15, 0 :: (store (s32))
+    ; NOCLUSTER-NEXT: renamable $x11 = ADDW killed renamable $x6, killed renamable $x5
+    ; NOCLUSTER-NEXT: renamable $x7 = LW renamable $x15, 8 :: (load (s32))
+    ; NOCLUSTER-NEXT: renamable $x28 = LW renamable $x15, 16 :: (load (s32))
+    ; NOCLUSTER-NEXT: renamable $x29 = LW renamable $x15, 24 :: (load (s32))
+    ; NOCLUSTER-NEXT: renamable $x13 = ADDW killed renamable $x7, killed renamable $x28
+    ; NOCLUSTER-NEXT: SW renamable $x14, renamable $x15, 8 :: (store (s32))
+    ; NOCLUSTER-NEXT: SW renamable $x14, renamable $x15, 16 :: (store (s32))
+    ; NOCLUSTER-NEXT: SW killed renamable $x14, killed renamable $x15, 24 :: (store (s32))
+    ; NOCLUSTER-NEXT: renamable $x11 = ADDW killed renamable $x11, killed renamable $x13
+    ; NOCLUSTER-NEXT: renamable $x6 = ADDW killed renamable $x11, killed renamable $x29
+    ; NOCLUSTER-NEXT: PseudoRET
+    ;
+    ; MEMCLUSTER-LABEL: name: mem_clustering_1
+    ; MEMCLUSTER: liveins: $x6, $x10, $x14, $x15, $x16, $x17
+    ; MEMCLUSTER-NEXT: {{  $}}
+    ; MEMCLUSTER-NEXT: renamable $x5 = LW renamable $x15, 0 :: (load (s32))
+    ; MEMCLUSTER-NEXT: renamable $x7 = LW renamable $x15, 8 :: (load (s32))
+    ; MEMCLUSTER-NEXT: renamable $x28 = LW renamable $x15, 16 :: (load (s32))
+    ; MEMCLUSTER-NEXT: renamable $x29 = LW renamable $x15, 24 :: (load (s32))
+    ; MEMCLUSTER-NEXT: SW renamable $x14, renamable $x15, 0 :: (store (s32))
+    ; MEMCLUSTER-NEXT: SW renamable $x14, renamable $x15, 8 :: (store (s32))
+    ; MEMCLUSTER-NEXT: SW renamable $x14, renamable $x15, 16 :: (store (s32))
+    ; MEMCLUSTER-NEXT: SW killed renamable $x14, killed renamable $x15, 24 :: (store (s32))
+    ; MEMCLUSTER-NEXT: renamable $x11 = ADDW killed renamable $x6, killed renamable $x5
+    ; MEMCLUSTER-NEXT: renamable $x13 = ADDW killed renamable $x7, killed renamable $x28
+    ; MEMCLUSTER-NEXT: renamable $x11 = ADDW killed renamable $x11, killed renamable $x13
+    ; MEMCLUSTER-NEXT: renamable $x6 = ADDW killed renamable $x11, killed renamable $x29
+    ; MEMCLUSTER-NEXT: PseudoRET
+    renamable $x5 = LW renamable $x15, 0 :: (load (s32))
+    renamable $x7 = LW renamable $x15, 8 :: (load (s32))
+    renamable $x28 = LW renamable $x15, 16 :: (load (s32))
+    renamable $x29 = LW renamable $x15, 24 :: (load (s32))
+    SW renamable $x14, renamable $x15, 0 :: (store (s32))
+    SW renamable $x14, renamable $x15, 8 :: (store (s32))
+    SW renamable $x14, renamable $x15, 16 :: (store (s32))
+    SW renamable $x14, renamable $x15, 24 :: (store (s32))
+    renamable $x11 = ADDW killed renamable $x6, killed renamable $x5
+    renamable $x13 = ADDW killed renamable $x7, killed renamable $x28
+    renamable $x11 = ADDW killed renamable $x11, killed renamable $x13
+    renamable $x6 = ADDW killed renamable $x11, killed renamable $x29
+    PseudoRET
+...

wangpc-pp · 2024-10-16T08:04:26Z

llvm/lib/Target/RISCV/RISCVTargetMachine.cpp

+  ScheduleDAGInstrs *
+  createPostMachineScheduler(MachineSchedContext *C) const override {
+    ScheduleDAGMI *DAG = nullptr;
+    if (EnableMISchedLoadStoreClustering) {


Can we add a new debug option to control if we should enable postra clustering?

static cl::opt<bool> EnablePostMISchedLoadStoreClustering( "riscv-postmisched-load-store-clustering", cl::Hidden, cl::desc("Enable PostRA load and store clustering in the machine scheduler"), cl::init(true)); if (EnableMISchedLoadStoreClustering && EnablePostMISchedLoadStoreClustering) { ... }

llvm/test/CodeGen/RISCV/misched-mem-clustering.mir

wangpc-pp · 2024-11-05T10:00:02Z

Ping.

topperc

LGTM

If post machine schedule is used, previous cluster of load/store which formed in machine schedule may break. In order to solve this, add load/sotre clustering to post machine schedule.

github-actions · 2024-11-06T08:21:52Z

@BoyaoWang430 Congratulations on having your first Pull Request (PR) merged into the LLVM Project!

Your changes will be combined with recent changes from other authors, then tested by our build bots. If there is a problem with a build, you may receive a report in an email or a comment on this PR.

Please check whether problems have been caused by your change specifically, as the builds can include changes from many authors. It is not uncommon for your change to be included in a build that fails due to someone else's changes, or infrastructure issues.

How to do this, and the rest of the post-merge process, is covered in detail here.

If your change does cause a problem, it may be reverted, or you can revert it yourself. This is a normal part of LLVM development. You can fix your changes and open a new PR to merge them again.

If you don't get any reports, no action is required from you. Your changes are working as expected, well done!

llvm-ci · 2024-11-06T08:54:03Z

LLVM Buildbot has detected a new failure on builder fuchsia-x86_64-linux running on fuchsia-debian-64-us-central1-a-1 while building llvm at step 4 "annotate".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/11/builds/7711

Here is the relevant piece of the build log for the reference

Step 4 (annotate) failure: 'python ../llvm-zorg/zorg/buildbot/builders/annotated/fuchsia-linux.py ...' (failure)
...
[1339/1344] Building CXX object unittests/Transforms/Scalar/CMakeFiles/ScalarTests.dir/LoopPassManagerTest.cpp.o
clang++: warning: optimization flag '-ffat-lto-objects' is not supported [-Wignored-optimization-argument]
[1340/1344] Linking CXX executable unittests/Transforms/Scalar/ScalarTests
[1340/1344] Running the LLVM regression tests
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:506: note: using ld.lld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/ld.lld
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:506: note: using lld-link: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/lld-link
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:506: note: using ld64.lld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/ld64.lld
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:506: note: using wasm-ld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/wasm-ld
-- Testing: 56722 tests, 60 workers --
Testing:  0.. 10.. 20.. 30
FAIL: LLVM :: CodeGen/RISCV/misched-mem-clustering.mir (18365 of 56722)
******************** TEST 'LLVM :: CodeGen/RISCV/misched-mem-clustering.mir' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/llc -mtriple=riscv64 -x mir -mcpu=sifive-p470 -verify-misched -enable-post-misched=false      -riscv-postmisched-load-store-clustering=false -debug-only=machine-scheduler      -start-before=machine-scheduler -stop-after=postmisched -o - 2>&1 < /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir    | /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/FileCheck -check-prefix=NOPOSTMISCHED /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/llc -mtriple=riscv64 -x mir -mcpu=sifive-p470 -verify-misched -enable-post-misched=false -riscv-postmisched-load-store-clustering=false -debug-only=machine-scheduler -start-before=machine-scheduler -stop-after=postmisched -o -
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/FileCheck -check-prefix=NOPOSTMISCHED /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir
/var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir:22:25: error: NOPOSTMISCHED-LABEL: expected string not found in input
 ; NOPOSTMISCHED-LABEL: name: mem_clustering_1
                        ^
<stdin>:1:1: note: scanning from here
llc: Unknown command line argument '-debug-only=machine-scheduler'. Try: '/var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/llc --help'
^
<stdin>:2:38: note: possible intended match here
llc: Did you mean '--debug-pass=machine-scheduler'?
                                     ^

Input file: <stdin>
Check file: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            1: llc: Unknown command line argument '-debug-only=machine-scheduler'. Try: '/var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/llc --help' 
label:22'0     X~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error: no match found
            2: llc: Did you mean '--debug-pass=machine-scheduler'? 
label:22'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
label:22'1                                          ?               possible intended match
>>>>>>

--

********************
Testing:  0.. 10.. 20.. 30.. 40.. 50.. 60.. 70.. 80.. 90.. 
********************
Failed Tests (1):
Step 7 (check) failure: check (failure)
...
[1339/1344] Building CXX object unittests/Transforms/Scalar/CMakeFiles/ScalarTests.dir/LoopPassManagerTest.cpp.o
clang++: warning: optimization flag '-ffat-lto-objects' is not supported [-Wignored-optimization-argument]
[1340/1344] Linking CXX executable unittests/Transforms/Scalar/ScalarTests
[1340/1344] Running the LLVM regression tests
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:506: note: using ld.lld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/ld.lld
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:506: note: using lld-link: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/lld-link
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:506: note: using ld64.lld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/ld64.lld
llvm-lit: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/utils/lit/lit/llvm/config.py:506: note: using wasm-ld: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/wasm-ld
-- Testing: 56722 tests, 60 workers --
Testing:  0.. 10.. 20.. 30
FAIL: LLVM :: CodeGen/RISCV/misched-mem-clustering.mir (18365 of 56722)
******************** TEST 'LLVM :: CodeGen/RISCV/misched-mem-clustering.mir' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/llc -mtriple=riscv64 -x mir -mcpu=sifive-p470 -verify-misched -enable-post-misched=false      -riscv-postmisched-load-store-clustering=false -debug-only=machine-scheduler      -start-before=machine-scheduler -stop-after=postmisched -o - 2>&1 < /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir    | /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/FileCheck -check-prefix=NOPOSTMISCHED /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/llc -mtriple=riscv64 -x mir -mcpu=sifive-p470 -verify-misched -enable-post-misched=false -riscv-postmisched-load-store-clustering=false -debug-only=machine-scheduler -start-before=machine-scheduler -stop-after=postmisched -o -
+ /var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/FileCheck -check-prefix=NOPOSTMISCHED /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir
/var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir:22:25: error: NOPOSTMISCHED-LABEL: expected string not found in input
 ; NOPOSTMISCHED-LABEL: name: mem_clustering_1
                        ^
<stdin>:1:1: note: scanning from here
llc: Unknown command line argument '-debug-only=machine-scheduler'. Try: '/var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/llc --help'
^
<stdin>:2:38: note: possible intended match here
llc: Did you mean '--debug-pass=machine-scheduler'?
                                     ^

Input file: <stdin>
Check file: /var/lib/buildbot/fuchsia-x86_64-linux/llvm-project/llvm/test/CodeGen/RISCV/misched-mem-clustering.mir

-dump-input=help explains the following input dump.

Input was:
<<<<<<
            1: llc: Unknown command line argument '-debug-only=machine-scheduler'. Try: '/var/lib/buildbot/fuchsia-x86_64-linux/build/llvm-build-5jw6xz06/bin/llc --help' 
label:22'0     X~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error: no match found
            2: llc: Did you mean '--debug-pass=machine-scheduler'? 
label:22'0     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
label:22'1                                          ?               possible intended match
>>>>>>

--

********************
Testing:  0.. 10.. 20.. 30.. 40.. 50.. 60.. 70.. 80.. 90.. 
********************
Failed Tests (1):

wangpc-pp · 2024-11-06T09:04:20Z

LLVM Buildbot has detected a new failure on builder fuchsia-x86_64-linux running on fuchsia-debian-64-us-central1-a-1 while building llvm at step 4 "annotate".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/11/builds/7711
Here is the relevant piece of the build log for the reference

Should be fixed by 37ce189.

wangpc-pp requested review from lukel97, asb, topperc, wangpc-pp, michaelmaitland and dtcxzyw October 8, 2024 09:02

asb mentioned this pull request Oct 10, 2024

[RISCV] Enable store clustering by default #73796

Merged

BoyaoWang430 closed this Oct 15, 2024

BoyaoWang430 force-pushed the main branch from 78e22d5 to 29ec071 Compare October 15, 2024 12:51

BoyaoWang430 reopened this Oct 15, 2024

BoyaoWang430 force-pushed the main branch from cd8fc26 to 37a6084 Compare October 15, 2024 14:26

wangpc-pp marked this pull request as ready for review October 16, 2024 07:28

llvmbot added the backend:RISC-V label Oct 16, 2024

wangpc-pp reviewed Oct 16, 2024

View reviewed changes

BoyaoWang430 force-pushed the main branch from 37a6084 to fd87bd4 Compare October 16, 2024 08:35

topperc approved these changes Nov 5, 2024

View reviewed changes

[RISCV] Add load/store clustering in post machine schedule

13f4177

If post machine schedule is used, previous cluster of load/store which formed in machine schedule may break. In order to solve this, add load/sotre clustering to post machine schedule.

BoyaoWang430 force-pushed the main branch from fd87bd4 to 13f4177 Compare November 6, 2024 05:23

wangpc-pp merged commit 69d0bab into llvm:main Nov 6, 2024
5 of 6 checks passed

[RISCV] Add load/store clustering in post machine schedule #111504

[RISCV] Add load/store clustering in post machine schedule #111504

Uh oh!

Conversation

BoyaoWang430 commented Oct 8, 2024

Uh oh!

github-actions bot commented Oct 8, 2024

Uh oh!

asb commented Oct 10, 2024

Uh oh!

asb commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wangpc-pp commented Oct 10, 2024

Uh oh!

asb commented Oct 10, 2024

Uh oh!

wangpc-pp commented Oct 10, 2024

Uh oh!

asb commented Oct 10, 2024

Uh oh!

BoyaoWang430 commented Oct 11, 2024

Uh oh!

wangpc-pp commented Oct 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

topperc commented Oct 11, 2024

Uh oh!

wangpc-pp commented Oct 12, 2024

Uh oh!

llvmbot commented Oct 16, 2024

Uh oh!

wangpc-pp Oct 16, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wangpc-pp commented Nov 5, 2024

Uh oh!

topperc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Nov 6, 2024

Uh oh!

llvm-ci commented Nov 6, 2024

Uh oh!

wangpc-pp commented Nov 6, 2024

Uh oh!

Uh oh!

asb commented Oct 10, 2024 •

edited

Loading

wangpc-pp commented Oct 11, 2024 •

edited

Loading