[mlir][vector] Clarify the semantics of BroadcastOp #101928

banach-space · 2024-08-05T07:32:42Z

Clarifies the semantics of vector.broadcast in the context of scalable
vectors. In particular, broadcasting a unit scalable dim, [1], is not
valid unless there's a match between the output and the input dims.
See the examples below for an illustration:

// VALID
 %0 = vector.broadcast %arg0 : vector<[1]xf32> to vector<4x[1]xf32>
// INVALID
 %0 = vector.broadcast %arg0 : vector<[1]xf32> to vector<[4]xf32>
// VALID FIXED-WIDTH EQUIVALENT
 %0 = vector.broadcast %arg0 : vector<1xf32> to vector<4xf32>

Documentation, the Op verifier and tests are updated accordingly.

Clarifies the semantics of `vector.broadcast` in the context of scalable vectors. In particular, broadcasting a unit scalable dim, `[1]`, is not valid unless there's a match between the output and the input dims. See the examples below for an illustration: ```mlir // VALID %0 = vector.broadcast %arg0 : vector<[1]xf32> to vector<4x[1]xf32> // INVALID %0 = vector.broadcast %arg0 : vector<[1]xf32> to vector<[4]xf32> // VALID FIXED-WIDTH EQUIVALENT %0 = vector.broadcast %arg0 : vector<1xf32> to vector<4xf32> ``` Documentation, the Op verifier and tests are updated accordingly.

llvmbot · 2024-08-05T07:33:12Z

@llvm/pr-subscribers-mlir

@llvm/pr-subscribers-mlir-vector

Author: Andrzej Warzyński (banach-space)

Changes

Clarifies the semantics of vector.broadcast in the context of scalable
vectors. In particular, broadcasting a unit scalable dim, [1], is not
valid unless there's a match between the output and the input dims.
See the examples below for an illustration:

// VALID
 %0 = vector.broadcast %arg0 : vector&lt;[1]xf32&gt; to vector&lt;4x[1]xf32&gt;
// INVALID
 %0 = vector.broadcast %arg0 : vector&lt;[1]xf32&gt; to vector&lt;[4]xf32&gt;
// VALID FIXED-WIDTH EQUIVALENT
 %0 = vector.broadcast %arg0 : vector&lt;1xf32&gt; to vector&lt;4xf32&gt;

Documentation, the Op verifier and tests are updated accordingly.

Full diff: https://github.com/llvm/llvm-project/pull/101928.diff

4 Files Affected:

(modified) mlir/include/mlir/Dialect/Vector/IR/VectorOps.h (+5-1)
(modified) mlir/include/mlir/Dialect/Vector/IR/VectorOps.td (+2)
(modified) mlir/lib/Dialect/Vector/IR/VectorOps.cpp (+35-10)
(modified) mlir/test/Dialect/Vector/invalid.mlir (+14)

diff --git a/mlir/include/mlir/Dialect/Vector/IR/VectorOps.h b/mlir/include/mlir/Dialect/Vector/IR/VectorOps.h
index ac55433fadb2f..9f61f7c866d3d 100644
--- a/mlir/include/mlir/Dialect/Vector/IR/VectorOps.h
+++ b/mlir/include/mlir/Dialect/Vector/IR/VectorOps.h
@@ -68,9 +68,13 @@ enum class BroadcastableToResult {
   DimensionMismatch = 2,
   SourceTypeNotAVector = 3
 };
+struct VectorDim {
+  int64_t dim;
+  bool scalableFlag;
+};
 BroadcastableToResult
 isBroadcastableTo(Type srcType, VectorType dstVectorType,
-                  std::pair<int, int> *mismatchingDims = nullptr);
+                  std::pair<VectorDim, VectorDim> *mismatchingDims = nullptr);
 
 /// Collect a set of vector-to-vector canonicalization patterns.
 void populateVectorToVectorCanonicalizationPatterns(RewritePatternSet &patterns,
diff --git a/mlir/include/mlir/Dialect/Vector/IR/VectorOps.td b/mlir/include/mlir/Dialect/Vector/IR/VectorOps.td
index 434ff3956c250..08bff3d5e1382 100644
--- a/mlir/include/mlir/Dialect/Vector/IR/VectorOps.td
+++ b/mlir/include/mlir/Dialect/Vector/IR/VectorOps.td
@@ -367,6 +367,8 @@ def Vector_BroadcastOp :
                                s_1     x .. x s_j x .. x s_k
                <duplication>         <potential stretch>
        ```
+    * a scalable unit dimeension, `[1]`, must match exactly.
+
     The source operand is duplicated over all the missing leading dimensions
     and stretched over the trailing dimensions where the source has a non-equal
     dimension of 1. These rules imply that any scalar broadcast (k=0) to any
diff --git a/mlir/lib/Dialect/Vector/IR/VectorOps.cpp b/mlir/lib/Dialect/Vector/IR/VectorOps.cpp
index 5047bd925d4c5..673c128932893 100644
--- a/mlir/lib/Dialect/Vector/IR/VectorOps.cpp
+++ b/mlir/lib/Dialect/Vector/IR/VectorOps.cpp
@@ -2371,9 +2371,9 @@ Value BroadcastOp::createOrFoldBroadcastOp(
   return res;
 }
 
-BroadcastableToResult
-mlir::vector::isBroadcastableTo(Type srcType, VectorType dstVectorType,
-                                std::pair<int, int> *mismatchingDims) {
+BroadcastableToResult mlir::vector::isBroadcastableTo(
+    Type srcType, VectorType dstVectorType,
+    std::pair<VectorDim, VectorDim> *mismatchingDims) {
   // Broadcast scalar to vector of the same element type.
   if (srcType.isIntOrIndexOrFloat() && dstVectorType &&
       getElementTypeOrSelf(srcType) == getElementTypeOrSelf(dstVectorType))
@@ -2391,12 +2391,28 @@ mlir::vector::isBroadcastableTo(Type srcType, VectorType dstVectorType,
   // (all leading dimensions are simply duplicated).
   int64_t lead = dstRank - srcRank;
   for (int64_t r = 0; r < srcRank; ++r) {
+    bool mismatch = false;
+
+    // Check fixed-width dims
     int64_t srcDim = srcVectorType.getDimSize(r);
     int64_t dstDim = dstVectorType.getDimSize(lead + r);
-    if (srcDim != 1 && srcDim != dstDim) {
+    if ((srcDim != 1 && srcDim != dstDim))
+      mismatch = true;
+
+    // Check scalable flags
+    bool srcDimScalableFlag = srcVectorType.getScalableDims()[r];
+    bool dstDimScalableFlag = dstVectorType.getScalableDims()[lead + r];
+    if ((srcDim == 1 && srcDimScalableFlag && dstDim != 1) ||
+        (srcDimScalableFlag && !dstDimScalableFlag))
+      mismatch = true;
+
+    if (mismatch) {
       if (mismatchingDims) {
-        mismatchingDims->first = srcDim;
-        mismatchingDims->second = dstDim;
+        mismatchingDims->first.dim = srcDim;
+        mismatchingDims->first.scalableFlag = srcDimScalableFlag;
+
+        mismatchingDims->second.dim = dstDim;
+        mismatchingDims->second.scalableFlag = dstDimScalableFlag;
       }
       return BroadcastableToResult::DimensionMismatch;
     }
@@ -2406,16 +2422,25 @@ mlir::vector::isBroadcastableTo(Type srcType, VectorType dstVectorType,
 }
 
 LogicalResult BroadcastOp::verify() {
-  std::pair<int, int> mismatchingDims;
+  std::pair<VectorDim, VectorDim> mismatchingDims;
   BroadcastableToResult res = isBroadcastableTo(
       getSourceType(), getResultVectorType(), &mismatchingDims);
   if (res == BroadcastableToResult::Success)
     return success();
   if (res == BroadcastableToResult::SourceRankHigher)
     return emitOpError("source rank higher than destination rank");
-  if (res == BroadcastableToResult::DimensionMismatch)
-    return emitOpError("dimension mismatch (")
-           << mismatchingDims.first << " vs. " << mismatchingDims.second << ")";
+  if (res == BroadcastableToResult::DimensionMismatch) {
+    std::string msg =
+        (Twine("dimension mismatch (") +
+         (mismatchingDims.first.scalableFlag ? "[" : "") +
+         std::to_string(mismatchingDims.first.dim) +
+         (mismatchingDims.first.scalableFlag ? "]" : "") + " vs. " +
+         (mismatchingDims.second.scalableFlag ? "[" : "") +
+         std::to_string(mismatchingDims.second.dim) +
+         (mismatchingDims.second.scalableFlag ? "]" : "") + ")")
+            .str();
+    return emitOpError(msg);
+  }
   if (res == BroadcastableToResult::SourceTypeNotAVector)
     return emitOpError("source type is not a vector");
   llvm_unreachable("unexpected vector.broadcast op error");
diff --git a/mlir/test/Dialect/Vector/invalid.mlir b/mlir/test/Dialect/Vector/invalid.mlir
index 00914c1d1baf6..6dd690be032c7 100644
--- a/mlir/test/Dialect/Vector/invalid.mlir
+++ b/mlir/test/Dialect/Vector/invalid.mlir
@@ -35,6 +35,20 @@ func.func @broadcast_dim2_mismatch(%arg0: vector<4x8xf32>) {
 
 // -----
 
+func.func @broadcast_scalable_unit_dim(%arg0: vector<[1]xf32>) {
+  // expected-error@+1 {{'vector.broadcast' op dimension mismatch ([1] vs. [4])}}
+  %0 = vector.broadcast %arg0 : vector<[1]xf32> to vector<[4]xf32>
+}
+
+// -----
+
+func.func @broadcast_scalable_to_fixed(%arg0: vector<[1]xf32>) {
+  // expected-error@+1 {{'vector.broadcast' op dimension mismatch ([1] vs. 1)}}
+  %0 = vector.broadcast %arg0 : vector<[1]xf32> to vector<4x1xf32>
+}
+
+// -----
+
 func.func @broadcast_unknown(%arg0: memref<4x8xf32>) {
   // expected-error@+1 {{'vector.broadcast' op source type is not a vector}}
   %1 = vector.broadcast %arg0 : memref<4x8xf32> to vector<1x8xf32>

nujaa

Hi, I think there's a missed opportunity here. 😃

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

nujaa · 2024-08-05T08:33:57Z

mlir/include/mlir/Dialect/Vector/IR/VectorOps.h

+struct VectorDim {
+  int64_t dim;
+  bool scalableFlag;
+};


There was this MR from @MacDue , implementing similar features. I dont know why it got closed though.
#96236

This PR is unrelated to that discussion. I'm only adding this here to avoid adding new set of params to isBroadcastableTo.

I believe that before we commit to any new wider API, we should discuss the internal representation of VectorType and how scalable dimensions are represented. I am working on a proposal, but that's not yet ready to share 😅 I'm hoping to have something in the coming weeks.

Ouuh, exciting.

nujaa · 2024-08-05T09:28:51Z

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

+    bool srcDimScalableFlag = srcVectorType.getScalableDims()[r];
+    bool dstDimScalableFlag = dstVectorType.getScalableDims()[lead + r];
+    if ((srcDim == 1 && srcDimScalableFlag && dstDim != 1) ||
+        (srcDimScalableFlag && !dstDimScalableFlag))


It got me thinking, what would be the expected behaviour of something like:

%0 = vector.broadcast %arg0 : vector<nxf32> to vector<[n]xf32>

IMO it should not be supported as physically equivalent to a usecase

%1 = vector.broadcast %arg0 : vector<nxf32> to vector<vscale*nxf32>

Which is not invalid for fixed dimensions. Do you think this handles the cases ?

Suggested change

(srcDimScalableFlag && !dstDimScalableFlag))

(srcDimScalableFlag != dstDimScalableFlag))

If you have e.g. [2] and [4] (i.e. vscale * 2 and vscale * 4), then that's already "rejected" as "mismatching dims":

llvm-project/mlir/lib/Dialect/Vector/IR/VectorOps.cpp

Line 2396 in 0dcada9

if (srcDim != 1 && srcDim != dstDim) {

Is that the case you had in mind?

The case I pointed out was more src = 2 and dest = [2]. srcDim == dstDim, so no mismatch on line 2399. and we have !srcDimScalableFlag so no mismatch on line 2406. Whereas I think this is wrong.

%0 = vector.broadcast %arg0 : vector<2xf32> to vector<[2]xf32>

Ah, nice, great catch! In my head I had one case that wouldn't work with !=, but now I am failing to recall that 😂

Let me send an update - thanks very much for pointing this out 🙏🏻

Address comments from Hugo - thank you!

mlir/include/mlir/Dialect/Vector/IR/VectorOps.h

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

github-actions · 2024-08-06T09:48:56Z

✅ With the latest revision this PR passed the C/C++ code formatter.

Address comments from Jakub

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

More comments and simplifications

…roadcastOp Avoid using llvm::Twine

banach-space · 2024-08-07T16:07:19Z

Ping @kuhar :)

banach-space · 2024-08-07T19:23:11Z

@nujaa Any other suggestions from you or shall I land it?

banach-space requested review from dcaballe, nicolasvasilache and kuhar as code owners August 5, 2024 07:32

llvmbot added mlir:vectorops mlir mlir:vector labels Aug 5, 2024

banach-space requested review from c-rhodes and MacDue August 5, 2024 07:33

banach-space requested a review from nujaa August 5, 2024 07:53

banach-space mentioned this pull request Aug 5, 2024

[mlir][vector] Add more tests for ConvertVectorToLLVM (1/n) #101936

Merged

nujaa reviewed Aug 5, 2024

View reviewed changes

fixup! [mlir][vector] Clarify the semantics of BroadcastOp

74d843c

Address comments from Hugo - thank you!

kuhar reviewed Aug 6, 2024

View reviewed changes

fixup! fixup! [mlir][vector] Clarify the semantics of BroadcastOp

89be55b

Address comments from Jakub

banach-space force-pushed the andrzej/restrict_vec_bcast branch from 587f08e to 89be55b Compare August 6, 2024 09:54

kuhar reviewed Aug 6, 2024

View reviewed changes

mlir/lib/Dialect/Vector/IR/VectorOps.cpp Outdated Show resolved Hide resolved

mlir/lib/Dialect/Vector/IR/VectorOps.cpp Outdated Show resolved Hide resolved

mlir/lib/Dialect/Vector/IR/VectorOps.cpp Show resolved Hide resolved

banach-space added 2 commits August 6, 2024 16:28

fixup! fixup! fixup! [mlir][vector] Clarify the semantics of BroadcastOp

001a348

More comments and simplifications

fixup! fixup! fixup! fixup! [mlir][vector] Clarify the semantics of B…

ab40831

…roadcastOp Avoid using llvm::Twine

banach-space mentioned this pull request Aug 6, 2024

[mlir][vector] Add more tests for ConvertVectorToLLVM (2/n) #102203

Merged

kuhar approved these changes Aug 7, 2024

View reviewed changes

nujaa merged commit 1919db9 into llvm:main Aug 8, 2024
7 checks passed

	(srcDimScalableFlag && !dstDimScalableFlag))
	(srcDimScalableFlag != dstDimScalableFlag))

[mlir][vector] Clarify the semantics of BroadcastOp #101928

[mlir][vector] Clarify the semantics of BroadcastOp #101928

Uh oh!

Conversation

banach-space commented Aug 5, 2024

Uh oh!

llvmbot commented Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nujaa left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nujaa Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

banach-space Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

nujaa Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

nujaa Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

banach-space Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

nujaa Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

banach-space Aug 5, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

banach-space commented Aug 7, 2024

Uh oh!

banach-space commented Aug 7, 2024

Uh oh!

Uh oh!

Uh oh!

llvmbot commented Aug 5, 2024 •

edited

Loading

nujaa left a comment •

edited

Loading

github-actions bot commented Aug 6, 2024 •

edited

Loading