[SMT] Add bitvector type, attribute, and constant operation #6804

maerhart · 2024-03-09T21:04:19Z

To clearly separate semantics, define a bit-vector type and attribute instead of reusing the built-in integer attribute. The built-in integer is usually encoded using two SMT bit-vectors to model poison and the regular bit values.

Stacked PR, only review top commit.

math-fehr

Nice!
I'm just wondering if keeping the same syntax as SMT-LIB makse sense or not.
My reasonning would be that keeping it the same as MLIR/CIRCT would make more sense, but that's a matter of preferences I think, and I would be okay for both.
I'm just not quite sure what we would gain from the SMT-LIB way!

math-fehr · 2024-03-10T00:51:24Z

include/circt/Dialect/SMT/SMTAttributes.td

+    ```mlir
+    #smt.bv<"#b0101"> : !smt.bv<4>
+    #smt.bv<"#x5c"> : !smt.bv<8>
+    ```


Is there a reason we want this compared to #smt.bv<15> : !smt.bv<8> ?
I feel that this removes the "complexity" of choosing between #x and #b, and this follows better what MLIR integer attributes, and hw.constant is doing?

I think I just went for this syntax for the more SMT-like feel, but using regular decimal numbers is also a good approach. It's probably just a matter of taste. The only disadvantages of using regular decimals I see are

bit-vector width cannot be inferred from the constant, i.e., we need the explicit type annotation: smt.bv.constant #smt.bv<5> : !smt.bv<8> vs smt.bv.constant "#x05" which basically means we always need the quantified version of the attribute in any operation assembly formats, on the other hand it saves typing a lot of zeros for big bit-vectors just storing small numbers

need to do some APInt zext and trunc with sanity checks, i.e., we just replace the complexity with some other complexity

I don't really understand why MLIR doesn't allow integer constants of the format 0b1010 which is the most basic way of representing bit-vectors

I might disagree with the first point, because while #x05 is easy to read and to understand, I cannot tell you from a glance what's the bitwidth of #b010101010101101100011101010, so having an explicit bitwidth makes a lot of sense to me!
What is the difference with zext and trunc? My understanding is that this would be the same? At least, I do not recommend changing the storage of #smt.bv.

And I would agree for the third point, though the problem is knowing when to use that format, and when to use the decimal format.

When parsing an APInt, the bitwidth does not necessarily match the one required for the given bitvector type and has to be adjusted in the attribute parser with sext and trunc and before truncating it should be checked if the truncated bits are significant/meaningful. The storage of #smt.bv is the autogenerated one with just a single APInt since that stores both the value and the bitwidth.

include/circt/Dialect/SMT/SMTAttributes.td

include/circt/Dialect/SMT/SMTBitVectorOps.td

lib/Dialect/SMT/SMTAttributes.cpp

lib/Dialect/SMT/SMTOps.cpp

math-fehr · 2024-03-10T17:17:35Z

test/Dialect/SMT/bitvectors.mlir

+  %c5_bv32 = smt.bv.constant #smt.bv<5> : !smt.bv<32> {smt.some_attr}
+  // CHECK: %c92_bv8 = smt.bv.constant #smt.bv<92> : !smt.bv<8> {smt.some_attr}
+  %c92_bv8 = smt.bv.constant #smt.bv<0x5c> : !smt.bv<8> {smt.some_attr}
+  // CHECK: %c-1_bv8 = smt.bv.constant #smt.bv<-1> : !smt.bv<8>


I'm also wondering, should we print this as -1, or should we print this as 255?
I do not have a good answer, but that's probably a question we should ask ourselves.

I just went with the same as the HW dialect does. In the end it's a signless bitvector, whether we interpret it here as a signed or unsigned value is probably equally good or bad.

fabianschuiki

LGTM! Nice to see so much stuff done via TableGen 🥳 And 👍 for adding unit tests 💯

maerhart force-pushed the maerhart-smt-bitvector-constants branch 3 times, most recently from bd9d336 to c89bf61 Compare March 9, 2024 22:18

math-fehr approved these changes Mar 10, 2024

View reviewed changes

maerhart force-pushed the maerhart-smt-bitvector-constants branch 2 times, most recently from 52b0d1c to 7d80fd8 Compare March 10, 2024 11:48

math-fehr reviewed Mar 10, 2024

View reviewed changes

fabianschuiki approved these changes Mar 10, 2024

View reviewed changes

Base automatically changed from maerhart-smt-dialect to main March 12, 2024 07:12

maerhart added 2 commits March 12, 2024 08:15

[SMT] Add bitvector type, attribute, and constant operation

8b59934

Address comments

228bfdf

maerhart force-pushed the maerhart-smt-bitvector-constants branch from 7d80fd8 to 228bfdf Compare March 12, 2024 07:15

maerhart merged commit 7e0ae45 into main Mar 12, 2024
4 checks passed

maerhart deleted the maerhart-smt-bitvector-constants branch March 12, 2024 07:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SMT] Add bitvector type, attribute, and constant operation #6804

[SMT] Add bitvector type, attribute, and constant operation #6804

maerhart commented Mar 9, 2024

math-fehr left a comment

math-fehr Mar 10, 2024

maerhart Mar 10, 2024

math-fehr Mar 10, 2024 •

edited

Loading

maerhart Mar 10, 2024

math-fehr Mar 10, 2024

maerhart Mar 10, 2024

fabianschuiki left a comment

[SMT] Add bitvector type, attribute, and constant operation #6804

[SMT] Add bitvector type, attribute, and constant operation #6804

Conversation

maerhart commented Mar 9, 2024

math-fehr left a comment

Choose a reason for hiding this comment

math-fehr Mar 10, 2024

Choose a reason for hiding this comment

maerhart Mar 10, 2024

Choose a reason for hiding this comment

math-fehr Mar 10, 2024 • edited Loading

Choose a reason for hiding this comment

maerhart Mar 10, 2024

Choose a reason for hiding this comment

math-fehr Mar 10, 2024

Choose a reason for hiding this comment

maerhart Mar 10, 2024

Choose a reason for hiding this comment

fabianschuiki left a comment

Choose a reason for hiding this comment

math-fehr Mar 10, 2024 •

edited

Loading