Cleanup: Only have one query for must-equality between expressions #156

michael-schwarz · 2020-12-16T10:19:14Z

We currently have two different queries for must equality of expressions:

ExpEq (answered by var_eq and octagon and used in base)
MustBeEqual (answered by base, expRelation, and octagon and used in base and for arrays)

(I am to blame here, I introduced the second one in my Master's thesis).

With this PR:

We use MustBeEqual everywhere where ExpEq was used
Fix expRelation to always answer top for floats (see Unsoundness in SV-COMP Prerun #141)

This means that we use all analyses to answer the question whether to expressions must be equal, rather than different ones in different scenarios.

sim642 · 2020-12-16T10:54:31Z

As this is also a query with boolean results, it brings me flashbacks from when I implemented #130 and #137, concerning all the Must, May, Not and no-Not scheme of them. Specifically, how results of boolean queries are supposed to combine if answered by multiple analyses. The boolean result lattice for queries has false leq top.

For example, if one analysis answers MustBeEqual with true and the other one with false, then MCP2 meets them (as reminded by #155), giving false in that lattice. Shouldn't the point of such query be that if one analysis is really precise and can prove the equality, we should trust it, instead of dropping down to the level of the most imprecise analysis, which answered that query? In that sense, it seems to me like the query should be named, answered and used the other way around, like "not equal", so that they meet according to the more precise analysis.
(Maybe the less confusing alternative would be to have two different boolean result lattices with opposite orders and the queries give the one which matches their semantics for combining.)

Also, in some places, e.g. MustBeEqual in base, things have been abused to instead return the top of the query result lattice, which is larger than the top of the boolean lattice. This is such that the imprecise false result wouldn't override others. I feel like this should never be done and instead queries like this should always return results of the correct type, while having their ordering for meet thought through instead of hacked around.

This is in no way a criticism of just MustBeEqual but likely affects many of the other queries too. For example, it's the reason I added a NotSingleThreaded query even though a SingleThreaded query exists, but I wanted the results from different analyses combined in the other way.

michael-schwarz · 2020-12-16T11:37:35Z

For example, if one analysis answers MustBeEqual with true and the other one with false, then MCP2 meets them (as reminded by #155), giving false in that lattice. Shouldn't the point of such query be that if one analysis is really precise and can prove the equality, we should trust it, instead of dropping down to the level of the most imprecise analysis, which answered that query?

I agree.

Maybe the less confusing alternative would be to have two different boolean result lattices with opposite orders and the queries give the one which matches their semantics for combining.

I think this the cleaner solution, as a negation of MustBeEqual would have to be MayBeUnequal which causes all sorts of knots when thinking about it.
Maybe a better way to handle this is to not have this boolean lattice, but instead two different lattices
T: \bot \leq True \leq \top and F: \bot \leq False \leq \top. This would mean having more queries, but would make it impossible to get confused about the ordering:

E.g. MayBeEqual would return sth from F whereas MustBeEqual would return something from T. I think it is only in rare circumstances that we are interested in both possible answers at the same time.

michael-schwarz · 2020-12-16T12:30:00Z

I experimented with MayHold with false \leq true and MustHold with true \leq false. I think this is nicer as it forces ne to think about the order when defining such a query. I'm not too happy with the names though, any better suggestions?

If this is what we want to do, I can replace all instance of the Bool result type by this one.

sim642 · 2020-12-16T12:33:08Z

I think this the cleaner solution, as a negation of MustBeEqual would have to be MayBeUnequal which causes all sorts of knots when thinking about it.

That's true, especially if you need to figure out what result false means for MayBeUnequal.

When these issues occurred to me, I think I dug a bit in the git history and found that some things were at some point renamed and flipped to avoid the Not in their name. I suspect these changes might have been done without any consideration about differing meet behavior. I think it often hasn't mattered either because many queries are just answered by one (activated) analysis. But anyway, if we want to clarify this in general, we might have to go over all of the boolean queries anyway.

Maybe a better way to handle this is to not have this boolean lattice, but instead two different lattices
T: \bot \leq True \leq \top and F: \bot \leq False \leq \top. This would mean having more queries, but would make it impossible to get confused about the ordering:

How would it mean more queries? I think the same queries would suffice but their return values would be somewhat weird, e.g. MustBeEqual would return True or \top (of the entire query result lattice). It would actually avoid duplicate meaning of the two tops (of the boolean lattice and the query result lattice), because all analyses being imprecise enough not to be able to prove equality is just as good as no analysis answering at all. Right now many places where queries are made have such special logic for Top also.

But then the natural continuation would be to also want to do the same to all the other variants in the query result type as many of them also have their own specific bottom and top. That might be too much work again.

So in that sense I feel like each result variant should be a standalone lattice (and thus there be two separate variants for boolean and reverse-boolean lattice). Each query (if implemented by an analysis) should then always return something within the appropriate variant. Top of the result lattice should be reserved for "not answered" instead of "not answered or unknown". And bottom of the result lattice should be reserved for "conflicting result types" instead of "conflicting result types or conflicting values of the same type".

EDIT: And then the queries themselves should probably always have "Must" or "May" in their name, to be very clear about their behavior and result variant. At least given the lack of #33.

michael-schwarz · 2020-12-16T12:41:01Z

How would it mean more queries?

At least now MustBeEqual in base.ml seems to have been with understanding that both Bool true and Bool false are definite answers. But i guess this was only confused usage.

So in that sense I feel like each result variant should be a standalone lattice (and thus there be two separate variants for boolean and reverse-boolean lattice). Each query (if implemented by an analysis) should then always return something within the appropriate variant. Top of the result lattice should be reserved for "not answered" instead of "not answered or unknown". And bottom of the result lattice should be reserved for "conflicting result types" instead of "conflicting result types or conflicting values of the same type".

That's pretty much the thing I now have, right? Minus the \top and \bot in the boolean lattices, but I guess there is no need for them?

sim642 · 2020-12-16T12:48:07Z

I experimented with MayHold with false \leq true and MustHold with true \leq false. I think this is nicer as it forces ne to think about the order when defining such a query. I'm not too happy with the names though, any better suggestions?

Maybe MayBool and MustBool instead, so they still refer to the type as previously and as everything else. Because it isn't unthinkable that similar may/must ordering issues could affect queries of other types as well (although maybe not among the ones we currently have, I never checked).

That's pretty much the thing I now have, right? Minus the \top and \bot in the boolean lattices, but I guess there is no need for them?

The change you committed in the meanwhile matches what I originally had in mind and as I said above, I think I'd prefer it that way for now anyway.

…tBeSingleThreaded

michael-schwarz · 2020-12-16T13:49:29Z

Ok, I switched them all over to MustBool or MayBool now and removed the plain Bool one.

During this, I noticed that SingleThreaded and NotSingleThreaded seem to express exactly the same thing, i.e. are MustBeSingleThreaded and MayNotBeSingleThreaded. So I removed the MayNotBeSingleThreaded one. Would be good if someone else can confirm that this is indeed the case.

sim642 · 2020-12-17T10:10:31Z

During this, I noticed that SingleThreaded and NotSingleThreaded seem to express exactly the same thing, i.e. are MustBeSingleThreaded and MayNotBeSingleThreaded. So I removed the MayNotBeSingleThreaded one. Would be good if someone else can confirm that this is indeed the case.

That seems correct. When I added the negated variant to get the meet right, I initially didn't want to remove the non-negated one in case something actually depended on it working the other way for whatever reason.

I made a few additional commits here: one about flipping a MayBeNot query to MustBe for better clarity and the other for adding some missing operations which I think the default case didn't cover like the old boolean lattice.

michael-schwarz · 2020-12-17T10:17:11Z

I started a run of this on some larger programs, if this doesn't lead to any curious changes, I'll merge it later today.

michael-schwarz · 2020-12-17T12:46:48Z

The only thing that changed is two more cases of issue #153.

Cleanup: Only one query for MustEq

d155396

sim642 added the cleanup Refactoring, clean-up label Dec 16, 2020

Introduce MustHold and MayHold for queries

41f00cc

Use MustBeEqual correctly

3ab7e7e

michael-schwarz added 11 commits December 16, 2020 13:51

Rename (May\Must)Hold -> (May|Must)Bool

e412401

Use `MayBool for MayEscape

184ff9e

Rename IsPublic to MayBePublic, use `MayBool

22959ef

use `MayBool for IsHeapVar

735e981

Remove unused InInterval query

8c4d6f2

Rename IsUnique to MayBeNotUnique, use MayBol

453823d

Rename NotSingleThreaded to MayBeNotSingleThreaded & use `MayBool

32d08ec

Rename SingleThreaded to MustBeSingleThreaded & use `MustBool

30417f2

Rename IsThreadReturn to MayBeThreadReturn & use `MayBool

db214d3

Remove `Bool from queries

7c343c9

Remove superflous MayBeNotSingleThreaded, is the same as negating Mus…

807b7e7

…tBeSingleThreaded

sim642 self-requested a review December 16, 2020 13:51

sim642 added 2 commits December 17, 2020 11:57

Implement missed MustBool and MayBool operations in Queries.Result

65786ca

Replace MayBeNotUnique with MustBeUniqueThread

e066ca9

sim642 approved these changes Dec 17, 2020

View reviewed changes

michael-schwarz merged commit f4444a0 into master Dec 17, 2020

michael-schwarz deleted the cleanups/one_eq_query branch December 17, 2020 12:47

sim642 mentioned this pull request May 10, 2021

Make queries type safe using GADTs #221

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cleanup: Only have one query for must-equality between expressions #156

Cleanup: Only have one query for must-equality between expressions #156

Uh oh!

michael-schwarz commented Dec 16, 2020

Uh oh!

sim642 commented Dec 16, 2020

Uh oh!

michael-schwarz commented Dec 16, 2020

Uh oh!

michael-schwarz commented Dec 16, 2020

Uh oh!

sim642 commented Dec 16, 2020 •

edited

Loading

Uh oh!

michael-schwarz commented Dec 16, 2020 •

edited

Loading

Uh oh!

sim642 commented Dec 16, 2020

Uh oh!

michael-schwarz commented Dec 16, 2020

Uh oh!

sim642 commented Dec 17, 2020

Uh oh!

michael-schwarz commented Dec 17, 2020

Uh oh!

michael-schwarz commented Dec 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Cleanup: Only have one query for must-equality between expressions #156

Cleanup: Only have one query for must-equality between expressions #156

Uh oh!

Conversation

michael-schwarz commented Dec 16, 2020

Uh oh!

sim642 commented Dec 16, 2020

Uh oh!

michael-schwarz commented Dec 16, 2020

Uh oh!

michael-schwarz commented Dec 16, 2020

Uh oh!

sim642 commented Dec 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michael-schwarz commented Dec 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sim642 commented Dec 16, 2020

Uh oh!

michael-schwarz commented Dec 16, 2020

Uh oh!

sim642 commented Dec 17, 2020

Uh oh!

michael-schwarz commented Dec 17, 2020

Uh oh!

michael-schwarz commented Dec 17, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sim642 commented Dec 16, 2020 •

edited

Loading

michael-schwarz commented Dec 16, 2020 •

edited

Loading