[CIR] Vector types - part 1 #347

dkolsen-pgi · 2023-12-07T01:07:12Z

This is the first part of implementing vector types and vector operations in ClangIR, issue #284. This is enough to compile this test program. I haven't tried to do anything beyond that yet.

typedef int int4 __attribute__((vector_size(16)));
int main(int argc, char** argv) {
  int4 a = { 1, argc, argc + 1, 4 };
  int4 b = { 5, argc + 2, argc + 3, 8 };
  int4 c = a + b;
  return c[1];
}

This change includes:

Fixed-sized vector types which are parameterized on the element type and the number of elements. For example, !cir.vector<s32i x 4>. (No scalable vector types yet; those will come later.)
New operation cir.vec which creates an object of a vector type with the given operands.
New operation cir.vec_elem which extracts an element from a vector. (The array subscript operation doesn't work here because the result is an rvalue, not an lvalue.)
Basic binary arithmetic operations on vector types, though only addition has been tested.

There are no unary operators, comparison operators, casts, or shuffle operations yet. Those will all come later.

This is the first part of implementing vector types and vector operations in ClangIR. This is enough to compile this test program. I haven't tried to do anything beyond that yet. ``` typedef int int4 __attribute__((vector_size(16))); int main(int argc, char** argv) { int4 a = { 1, argc, argc + 1, 4 }; int4 b = { 5, argc + 2, argc + 3, 8 }; int4 c = a + b; return c[1]; } ``` This change includes: * Fixed-sized vector types which are parameterized on the element type and the number of elements. For example, !cir.vector<s32i x 4>. (No scalable vector types yet; those will come later.) * New operation cir.vec which creates an object of a vector type with the given operands. * New operation cir.vec_elem which extracts an element from a vector. (The array subscript operation doesn't work here because the result is an rvalue, not an lvalue.) * Basic binary arithmetic operations on vector types, though only addition has been tested. There are no unary operators, comparison operators, casts, or shuffle operations yet. Those will all come later.

dkolsen-pgi · 2023-12-07T03:46:48Z

One of the tests failed because apparently the order in which CIR code gen emits the operands is not guaranteed. So I need to change one of the tests that checks the CIR output to be more flexible.

The order in which CIR is generated is not always guaranteed, so the expected results for code gen tests need to be flexible about the order of operations and the names of MLIR values.

dkolsen-pgi · 2023-12-07T20:35:30Z

I updated the expected results for the test that failed, and the test now passes.

bcardosolopes · 2023-12-08T22:55:19Z

One of the tests failed because apparently the order in which CIR code gen emits the operands is not guaranteed. So I need to change one of the tests that checks the CIR output to be more flexible.

Is this due to some non-determinism we should fix or something else? I lost the context!

dkolsen-pgi · 2023-12-09T00:03:45Z

The automated CI tests are run on Linux, Windows, and MacOS. On the first two the generated MLIR for the cir.vec_elem op in the vector code gen test was:

    %17 = cir.const(#cir.int<1> : !s32i) : !s32i
    %18 = cir.load %3 : cir.ptr <!cir.vector<!s32i x 4>>, !cir.vector<!s32i x 4>
    %19 = cir.vec_elem %18[%17 : !s32i] <!s32i x 4> -> !s32i

On MacOS the generated CIR was slightly different:

    %17 = cir.load %3 : cir.ptr <!cir.vector<!s32i x 4>>, !cir.vector<!s32i x 4>
    %18 = cir.const(#cir.int<1> : !s32i) : !s32i
    %19 = cir.vec_elem %17[%18 : !s32i] <!s32i x 4> -> !s32i

I realized just now that this non-determinism is the result of this code in ScalarExprEmitter::VisitArraySubscriptExpr:

      return CGF.builder.create<mlir::cir::VecElemOp>(
          CGF.getLoc(E->getSourceRange()), Visit(E->getBase()),
          Visit(E->getIdx()));

The C++ standard does not guarantee the order of the two calls to Visit It seems that the CI compilers on Linux and Windows evaluate function arguments right-to-left while the MacOS compiler evaluates them left-to-right.

I should probably change the code so the calls to Visit are in their own statements, since those function calls have noticeable side effects.

bcardosolopes

This is great, excited to have vector support! Comments inline

clang/test/CIR/CodeGen/vectype.cpp

clang/lib/CIR/Dialect/IR/CIRDialect.cpp

clang/include/clang/CIR/Dialect/IR/CIROps.td

clang/lib/CIR/CodeGen/CIRGenExprScalar.cpp

clang/include/clang/CIR/Dialect/IR/CIROps.td

clang/test/CIR/CodeGen/vectype.cpp

clang/test/CIR/IR/invalid.cir

bcardosolopes · 2023-12-09T00:42:30Z

Thanks for the detailed explanation, very subtle!

I should probably change the code so the calls to Visit are in their own statements, since those function calls have noticeable side effects.

That works for me! I wonder if we can find a better way to wrap/abstract these things so we can guarantee some determinism. We could also use the "DAG" stuff from FileCheck, but still annoying!

Rename `cir.vec` op to `cir.vec.create`, and rename `cir.vec_elem` to `cir.vec.extract`. Rename any associated class types similarly. Add `vectorConstants()` and `scalableVectors()` to class `UnimplementedFeature`. Now that vector types are being implemented, `cirVectorType()` is too broad, and flags are needed for specific vector type features that haven't been implemented yet. When doing CodeGen for `cir.vec.extract`, call `Visit(E->getBase())` and `Visit(E->getIdx())` in separate statements so that their MLIR is always generated in a consistent order.

… into feature/vector-types

dkolsen-pgi · 2023-12-13T22:19:20Z

I pushed another commit that resolves most of the code review comments, along with promises to resolve the other code review comments in later PRs.

bcardosolopes

Awesome. One minor comment and one missing bit: the LLVM testcases!

bcardosolopes · 2023-12-14T15:33:20Z

clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp

@@ -1110,6 +1110,48 @@ class CIRConstantLowering
  }
 };

+class CIRVectorCreateLowering


Nice, great to already add LLVM lowering - can you please add a testcase to either clang/test/CIR/Lowering or add an extra RUN step to CodeGen/vectype.cpp and check for LLVM output?

bcardosolopes · 2023-12-14T15:42:08Z

clang/lib/CIR/CodeGen/CIRGenExpr.cpp

-  }
+  if (!CGM.getCodeGenOpts().PreserveVec3Type && Ty->isVectorType() &&
+      Ty->castAs<clang::VectorType>()->getNumElements() == 3)
+    llvm_unreachable("NYI: Special treatment of 3-element vectors");


Can you move this to after the atomic check and wrap it around if (const auto *ClangVecTy = Ty->getAs<VectorType>()) {? No problem if this comes in a later PR, but I'd prefer if the skeleton is a bit more similar. Similar for buildStoreOfScalar on the if (const auto *ClangVecTy = Ty->getAs<VectorType>()) { part.

bcardosolopes · 2023-12-14T15:46:29Z

Actually, since @lanza is going to do a rebase very soon, I'm gonna merge this so you don't have to deal with rebase fall out. Please address both comments in follow up PRs!

This is the first part of implementing vector types and vector operations in ClangIR, issue #284. This is enough to compile this test program. I haven't tried to do anything beyond that yet. ``` typedef int int4 __attribute__((vector_size(16))); int main(int argc, char** argv) { int4 a = { 1, argc, argc + 1, 4 }; int4 b = { 5, argc + 2, argc + 3, 8 }; int4 c = a + b; return c[1]; } ``` This change includes: * Fixed-sized vector types which are parameterized on the element type and the number of elements. For example, `!cir.vector<s32i x 4>`. (No scalable vector types yet; those will come later.) * New operation `cir.vec` which creates an object of a vector type with the given operands. * New operation `cir.vec_elem` which extracts an element from a vector. (The array subscript operation doesn't work here because the result is an rvalue, not an lvalue.) * Basic binary arithmetic operations on vector types, though only addition has been tested. There are no unary operators, comparison operators, casts, or shuffle operations yet. Those will all come later.

This is the first part of implementing vector types and vector operations in ClangIR, issue llvm#284. This is enough to compile this test program. I haven't tried to do anything beyond that yet. ``` typedef int int4 __attribute__((vector_size(16))); int main(int argc, char** argv) { int4 a = { 1, argc, argc + 1, 4 }; int4 b = { 5, argc + 2, argc + 3, 8 }; int4 c = a + b; return c[1]; } ``` This change includes: * Fixed-sized vector types which are parameterized on the element type and the number of elements. For example, `!cir.vector<s32i x 4>`. (No scalable vector types yet; those will come later.) * New operation `cir.vec` which creates an object of a vector type with the given operands. * New operation `cir.vec_elem` which extracts an element from a vector. (The array subscript operation doesn't work here because the result is an rvalue, not an lvalue.) * Basic binary arithmetic operations on vector types, though only addition has been tested. There are no unary operators, comparison operators, casts, or shuffle operations yet. Those will all come later.

This is the first part of implementing vector types and vector operations in ClangIR, issue #284. This is enough to compile this test program. I haven't tried to do anything beyond that yet. ``` typedef int int4 __attribute__((vector_size(16))); int main(int argc, char** argv) { int4 a = { 1, argc, argc + 1, 4 }; int4 b = { 5, argc + 2, argc + 3, 8 }; int4 c = a + b; return c[1]; } ``` This change includes: * Fixed-sized vector types which are parameterized on the element type and the number of elements. For example, `!cir.vector<s32i x 4>`. (No scalable vector types yet; those will come later.) * New operation `cir.vec` which creates an object of a vector type with the given operands. * New operation `cir.vec_elem` which extracts an element from a vector. (The array subscript operation doesn't work here because the result is an rvalue, not an lvalue.) * Basic binary arithmetic operations on vector types, though only addition has been tested. There are no unary operators, comparison operators, casts, or shuffle operations yet. Those will all come later.

This is the first part of implementing vector types and vector operations in ClangIR, issue llvm#284. This is enough to compile this test program. I haven't tried to do anything beyond that yet. ``` typedef int int4 __attribute__((vector_size(16))); int main(int argc, char** argv) { int4 a = { 1, argc, argc + 1, 4 }; int4 b = { 5, argc + 2, argc + 3, 8 }; int4 c = a + b; return c[1]; } ``` This change includes: * Fixed-sized vector types which are parameterized on the element type and the number of elements. For example, `!cir.vector<s32i x 4>`. (No scalable vector types yet; those will come later.) * New operation `cir.vec` which creates an object of a vector type with the given operands. * New operation `cir.vec_elem` which extracts an element from a vector. (The array subscript operation doesn't work here because the result is an rvalue, not an lvalue.) * Basic binary arithmetic operations on vector types, though only addition has been tested. There are no unary operators, comparison operators, casts, or shuffle operations yet. Those will all come later.

This is the first part of implementing vector types and vector operations in ClangIR, issue #284. This is enough to compile this test program. I haven't tried to do anything beyond that yet. ``` typedef int int4 __attribute__((vector_size(16))); int main(int argc, char** argv) { int4 a = { 1, argc, argc + 1, 4 }; int4 b = { 5, argc + 2, argc + 3, 8 }; int4 c = a + b; return c[1]; } ``` This change includes: * Fixed-sized vector types which are parameterized on the element type and the number of elements. For example, `!cir.vector<s32i x 4>`. (No scalable vector types yet; those will come later.) * New operation `cir.vec` which creates an object of a vector type with the given operands. * New operation `cir.vec_elem` which extracts an element from a vector. (The array subscript operation doesn't work here because the result is an rvalue, not an lvalue.) * Basic binary arithmetic operations on vector types, though only addition has been tested. There are no unary operators, comparison operators, casts, or shuffle operations yet. Those will all come later.

This is the first part of implementing vector types and vector operations in ClangIR, issue llvm#284. This is enough to compile this test program. I haven't tried to do anything beyond that yet. ``` typedef int int4 __attribute__((vector_size(16))); int main(int argc, char** argv) { int4 a = { 1, argc, argc + 1, 4 }; int4 b = { 5, argc + 2, argc + 3, 8 }; int4 c = a + b; return c[1]; } ``` This change includes: * Fixed-sized vector types which are parameterized on the element type and the number of elements. For example, `!cir.vector<s32i x 4>`. (No scalable vector types yet; those will come later.) * New operation `cir.vec` which creates an object of a vector type with the given operands. * New operation `cir.vec_elem` which extracts an element from a vector. (The array subscript operation doesn't work here because the result is an rvalue, not an lvalue.) * Basic binary arithmetic operations on vector types, though only addition has been tested. There are no unary operators, comparison operators, casts, or shuffle operations yet. Those will all come later.

This is the first part of implementing vector types and vector operations in ClangIR, issue #284. This is enough to compile this test program. I haven't tried to do anything beyond that yet. ``` typedef int int4 __attribute__((vector_size(16))); int main(int argc, char** argv) { int4 a = { 1, argc, argc + 1, 4 }; int4 b = { 5, argc + 2, argc + 3, 8 }; int4 c = a + b; return c[1]; } ``` This change includes: * Fixed-sized vector types which are parameterized on the element type and the number of elements. For example, `!cir.vector<s32i x 4>`. (No scalable vector types yet; those will come later.) * New operation `cir.vec` which creates an object of a vector type with the given operands. * New operation `cir.vec_elem` which extracts an element from a vector. (The array subscript operation doesn't work here because the result is an rvalue, not an lvalue.) * Basic binary arithmetic operations on vector types, though only addition has been tested. There are no unary operators, comparison operators, casts, or shuffle operations yet. Those will all come later.

[CIR] Update code gen test vectype.cpp to be more flexible

8d10690

The order in which CIR is generated is not always guaranteed, so the expected results for code gen tests need to be flexible about the order of operations and the names of MLIR values.

dkolsen-pgi requested a review from bcardosolopes December 7, 2023 20:34

bcardosolopes requested changes Dec 9, 2023

View reviewed changes

dkolsen-pgi added 3 commits December 13, 2023 13:51

Merge branch 'llvm:main' into feature/vector-types

68c1dd2

Merge branch 'feature/vector-types' of github.com:dkolsen-pgi/clangir…

249ea1c

… into feature/vector-types

bcardosolopes reviewed Dec 14, 2023

View reviewed changes

bcardosolopes merged commit 54a37ba into llvm:main Dec 14, 2023
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CIR] Vector types - part 1 #347

[CIR] Vector types - part 1 #347

dkolsen-pgi commented Dec 7, 2023

dkolsen-pgi commented Dec 7, 2023

dkolsen-pgi commented Dec 7, 2023

bcardosolopes commented Dec 8, 2023

dkolsen-pgi commented Dec 9, 2023

bcardosolopes left a comment

bcardosolopes commented Dec 9, 2023

dkolsen-pgi commented Dec 13, 2023

bcardosolopes left a comment

bcardosolopes Dec 14, 2023

bcardosolopes Dec 14, 2023

bcardosolopes commented Dec 14, 2023

[CIR] Vector types - part 1 #347

[CIR] Vector types - part 1 #347

Conversation

dkolsen-pgi commented Dec 7, 2023

dkolsen-pgi commented Dec 7, 2023

dkolsen-pgi commented Dec 7, 2023

bcardosolopes commented Dec 8, 2023

dkolsen-pgi commented Dec 9, 2023

bcardosolopes left a comment

Choose a reason for hiding this comment

bcardosolopes commented Dec 9, 2023

dkolsen-pgi commented Dec 13, 2023

bcardosolopes left a comment

Choose a reason for hiding this comment

bcardosolopes Dec 14, 2023

Choose a reason for hiding this comment

bcardosolopes Dec 14, 2023

Choose a reason for hiding this comment

bcardosolopes commented Dec 14, 2023