C++: Handle casts to `void` in IR #67

dave-bartolomeo · 2018-08-17T08:49:48Z

Casts to void did not have a semantic conversion type in the AST, so they also weren't getting generated correctly in the IR. I've added a VoidConversion class to the AST, along with tests. I've also added IR translation for such conversions, using a new ConvertToVoid opcode. I'm not sure if it's really necessary to generate an instruction to represent this, but it may be useful for detecting values that are explicitly unused (e.g. return value from a call).

I added two new sanity queries for the IR to detect the following:

IR blocks with no successors, which usually indicates bad IR translation
Phi instruction without an operand for one of the predecessor blocks.

These sanity queries found another subtle IR translation bug. If an expression that is normally translated as a condition (e.g. &&, ||, or parens in certain contexts) has a constant value, we were not creating a TranslatedExpr for the expression at all. I changed it to always treat a constant condition as a non-condition expression.

Note that the ".expected" files for the IR and AST dumps are now human-readable.

Casts to `void` did not have a semantic conversion type in the AST, so they also weren't getting generated correctly in the IR. I've added a `VoidConversion` class to the AST, along with tests. I've also added IR translation for such conversions, using a new `ConvertToVoid` opcode. I'm not sure if it's really necessary to generate an instruction to represent this, but it may be useful for detecting values that are explicitly unused (e.g. return value from a call). I added two new sanity queries for the IR to detect the following: - IR blocks with no successors, which usually indicates bad IR translation - Phi instruction without an operand for one of the predecessor blocks. These sanity queries found another subtle IR translation bug. If an expression that is normally translated as a condition (e.g. `&&`, `||`, or parens in certain contexts) has a constant value, we were not creating a `TranslatedExpr` for the expression at all. I changed it to always treat a constant condition as a non-condition expression.

jbj

Otherwise LGTM

jbj · 2018-08-17T11:48:37Z

cpp/ql/src/semmle/code/cpp/ir/internal/Opcode.qll

@@ -33,6 +33,7 @@ private newtype TOpcode =
  TPointerSub() or
  TPointerDiff() or
  TConvert() or
+  TConvertToVoid() or


Given that we already have a TConvert opcode, why is it necessary to add another one? Or could these conversions just have TConvert with a result type of VoidType?

So far, I've been trying to use a different opcode whenever two operations are "qualitatively different", for some fuzzy definition of "qualitatively different" that exists only in my head. The fact that ConvertToVoid has no result makes it different enough from Convert, which always has a result, makes it different enough in my view.

The void type is special in many ways in C/C++, but the main purpose of the IR is to paper over syntactic details. To me, void semantically corresponds to the unit type of functional programming languages. It's like a struct with no members. This is a useful type in template programming, and C++17 added it as std::monostate. There has even been a proposal to turn void into a proper object type in C++.

It's probably true that the translation of (void)e will never appear as an operand of an instruction, but isn't that just the consequence of an arbitrary syntactic restriction in the current standard?

I think I'm actually buying your argument here. A conversion from, say, int to void doesn't seem much different from a conversion from int to float, which is just a Convert instruction. I've removed ConvertToVoid and now just use Convert.
The only place I found in the IR that needed to change to support void as a first-class type was in Instruction.getResultSize(). For a void result, previously it did not hold, meaning the result had unknown size. I now just return sizeof(void), which is zero.

jbj · 2018-08-17T12:21:27Z

cpp/ql/src/semmle/code/cpp/ir/internal/Instruction.qll

+      not exists(PhiOperand operand |
+        exists(instr.getOperand(operand)) and
+        operand.getPredecessorBlock() = pred
+      )


This predicate doesn't implement what its QLDoc says. It checks to make sure phi instructions have operands for all of their predecessors. But why should that always hold? After int x = 0; if (b) { x = 1; } else { f(); } I'd expect a phi node with two operands, one for each x = but none for the f() block.

PhiOperand.getPredecessorBlock() specifies the immediate predecessor block from which the value flowed, not the block that contains the definition. This is important for anything trying to ignore unreachable edges, or anything vaguely path sensitive, because we need to know which definitions flowed in on which edge, even if the same definition came via multiple edges.

I see. Then I'm happy with the check but not with its QLDoc. Could you add a separate check that any block with a phi node has at least two predecessors?

Fixed the comment and added unnecessaryPhiInstruction.

Have `Instruction.getResultSize()` return zero for `void`.

Rename describeQlClass to getAPrimaryQlClass

Kotlin: Support for more type operators

feat(bash): Improve bash command parsing

dave-bartolomeo added the C++ label Aug 17, 2018

dave-bartolomeo assigned jbj Aug 17, 2018

jbj reviewed Aug 17, 2018

View reviewed changes

dave-bartolomeo added 2 commits August 17, 2018 15:37

C++: IR sanity query unnecessaryPhiInstruction

650539d

Have `Instruction.getResultSize()` return zero for `void`.

C++: Remove ConvertToVoid, replace with Convert

332e944

dave-bartolomeo mentioned this pull request Aug 18, 2018

C++: Make InitializeParameter and Uninitialized return memory results #72

Merged

jbj approved these changes Aug 20, 2018

View reviewed changes

jbj merged commit b931e88 into github:master Aug 20, 2018

dave-bartolomeo deleted the dave/CastToVoid branch September 5, 2018 18:48

kamarcum unassigned jbj Apr 28, 2020

aibaars pushed a commit that referenced this pull request Oct 14, 2021

Merge pull request #67 from github/getAPrimaryQlClass

53a1cbc

Rename describeQlClass to getAPrimaryQlClass

smowton pushed a commit to smowton/codeql that referenced this pull request Nov 1, 2021

Merge pull request github#67 from github/igfoo/typeop2

50994b6

Kotlin: Support for more type operators

dbartol pushed a commit that referenced this pull request Dec 18, 2024

Merge pull request #67 from github/bash_script_parsing

4d7c985

feat(bash): Improve bash command parsing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

C++: Handle casts to `void` in IR #67

C++: Handle casts to `void` in IR #67

Uh oh!

dave-bartolomeo commented Aug 17, 2018

Uh oh!

jbj left a comment

Uh oh!

jbj Aug 17, 2018

Uh oh!

dave-bartolomeo Aug 17, 2018

Uh oh!

jbj Aug 17, 2018

Uh oh!

dave-bartolomeo Aug 18, 2018

Uh oh!

jbj Aug 17, 2018

Uh oh!

dave-bartolomeo Aug 17, 2018

Uh oh!

jbj Aug 17, 2018

Uh oh!

dave-bartolomeo Aug 17, 2018

Uh oh!

Uh oh!

C++: Handle casts to void in IR #67

C++: Handle casts to void in IR #67

Uh oh!

Conversation

dave-bartolomeo commented Aug 17, 2018

Uh oh!

jbj left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

C++: Handle casts to `void` in IR #67

C++: Handle casts to `void` in IR #67