C++: Value number performance fix #2835

MathiasVP · 2020-02-13T17:05:26Z

This PR fixes a performance problem observed when an FieldAddressInstruction's getField predicate returns more than one field.

The fix is the same as in the AST library: we simply filter out those FieldAddressInstructions that have too many fields.

The PR also fixes a problem caused by a ConstantInstruction having multiple value numbers due to the instruction having more than one IR return type.

This is not enough to get genome/breakdancer working.

MathiasVP · 2020-02-13T20:41:40Z

3 tests are failing now due to the change in value numbering for ConstantInstructions. This snippet is representative of all the failing testcases:

 #   56|     r56_21(int)           = Constant[1]          :
-#   56|         valnum = unique
+#   56|         valnum = r56_21, r62_3
...
 #   62|     r62_3(unsigned int)        = Constant[1]             :
-#   62|         valnum = unique
+#   62|         valnum = r56_21, r62_3

I'm guessing we do want to distinquish these two values with different value numbers.

Edit: Fixed this in 57613d5

…stant with different signed-ness the same value number. Instead filter those with more than one type out.

Instructions that are removed from the normal value numbering recursion because they have a duplicated type or AST element get unique value numbers rather than going unnumbered. This ensures comparisons of value numbers using `!=` hold for filtered instructions.

jbj · 2020-02-14T13:29:31Z

...ql/src/semmle/code/cpp/ir/implementation/aliased_ssa/gvn/internal/ValueNumberingInternal.qll

-  ) {
-    loadTotalOverlapValueNumber(_, irFunc, type, memOperand, operand)
+  TLoadTotalOverlapValueNumber(IRFunction irFunc, TValueNumber memOperand, TValueNumber operand) {
+    loadTotalOverlapValueNumber(_, irFunc, memOperand, operand)


Will the removal of type here make us confused about the following (inspired by #2772 (comment))?

double foo(int* p, int choice) { if (choice == 1) return *(int*)p; // 1 else if (choice == 2) return (int)*(char*)p; // 2 else return *(float*)p; // 3 }

I think one could argue that 1 and 3 should have the same value number before their conversion to double since they're the same bit pattern, but it seems wrong to give 1 and 2 the same value number. It seems especially wrong to give the same value number to the * in 1 and the (int) in 2.

Adding the following test function to ir_gvn.ql:

void foo(int* p) { double d1 = *(int*)p; double d2 = (int)*(char*)p; double d3 = *(float*)p; }

gives the following value numbers with this PR:

r162_5 = *(int*)p // r162_5, r163_5, r164_5 r163_5 = *(char*)p // r162_5, r163_5, r164_5 r162_6 = (double)*(int*)p // r162_6, r163_6, r164_6 r163_6 = (int)*(char*)p // r162_6, r163_6, r164_6 r164_5 = *(float*)p // r162_5, r163_5, r164_5 r164_6 = (double)*(float*)p // r162_6, r163_6, r164_6

So the expressions *(int*)p and *(float*)p have the same value number. (i.e., // 1 and // 3 have the same value number before conversions to double).
But d1 and d2 are assigned the same value number, sadly.

At least based on the current design, *(int*)p and *(float*)p should not have the same value number. The intention is that two results will only have the same value number if they have the same bit pattern and have the same IRType.

Can someone check if the removal of type here is necessary for performance of bloomberg/bde?

The removal is unnecessary for performance on bloomberg/bde, but we also need #2844 for it.

jbj

This LGTM. I only contributed an autoformat commit, so I'll allow myself to merge it.

jbj and others added 4 commits February 13, 2020 18:02

WIP: Switch on IR

8054cde

WIP: Try to reduce ambiguous value numbers

2439690

This is not enough to get genome/breakdancer working.

C++: Perf fix for value numbering

04c5f1c

C++: Sync up identical files and restore imports

cb510ed

MathiasVP added the C++ label Feb 13, 2020

MathiasVP requested a review from jbj February 13, 2020 17:05

MathiasVP requested a review from a team as a code owner February 13, 2020 17:05

MathiasVP and others added 3 commits February 13, 2020 21:49

C++: Reintroduce the type in TConstantValueNumber to avoid giving con…

57613d5

…stant with different signed-ness the same value number. Instead filter those with more than one type out.

C++: Sync identical files

ed7888c

rdmarsh2 assigned dbartol Feb 13, 2020

rdmarsh2 mentioned this pull request Feb 13, 2020

C++/C#: Fix sync config file for value numbering sharing #2839

Merged

C++: sync identical files

b4ff121

jbj reviewed Feb 14, 2020

View reviewed changes

rdmarsh2 and others added 2 commits February 14, 2020 13:34

C++: reinclude IRType in total load value numbers

7abd289

C++: autoformat

49d2f5a

jbj approved these changes Feb 15, 2020

View reviewed changes

jbj merged commit 0628625 into github:master Feb 15, 2020

jbj mentioned this pull request Mar 3, 2020

C++: Tests for variables with ambiguous types #2970

Merged

kamarcum unassigned dbartol Apr 28, 2020

MathiasVP mentioned this pull request Jul 21, 2021

C++: Fix FP in cpp/uninitialized-local #6342

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

C++: Value number performance fix #2835

C++: Value number performance fix #2835

Uh oh!

MathiasVP commented Feb 13, 2020 •

edited

Loading

Uh oh!

MathiasVP commented Feb 13, 2020 •

edited

Loading

Uh oh!

jbj Feb 14, 2020

Uh oh!

MathiasVP Feb 14, 2020 •

edited

Loading

Uh oh!

dbartol Feb 14, 2020

Uh oh!

jbj Feb 14, 2020

Uh oh!

rdmarsh2 Feb 14, 2020

Uh oh!

jbj left a comment

Uh oh!

Uh oh!

C++: Value number performance fix #2835

C++: Value number performance fix #2835

Uh oh!

Conversation

MathiasVP commented Feb 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MathiasVP commented Feb 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jbj Feb 14, 2020

Choose a reason for hiding this comment

Uh oh!

MathiasVP Feb 14, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dbartol Feb 14, 2020

Choose a reason for hiding this comment

Uh oh!

jbj Feb 14, 2020

Choose a reason for hiding this comment

Uh oh!

rdmarsh2 Feb 14, 2020

Choose a reason for hiding this comment

Uh oh!

jbj left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MathiasVP commented Feb 13, 2020 •

edited

Loading

MathiasVP commented Feb 13, 2020 •

edited

Loading

MathiasVP Feb 14, 2020 •

edited

Loading