Making rule engine execution order more deterministic #207

mrrodriguez · 2016-07-12T11:16:56Z

Clara currently uses clara.rules.platform/tuned-group-by in a number of places. The grouping that this function does is unordered and often can vary from one JVM to the next. In particular I've noticed this happening in the clara.rules.engine/flush-updates which was causing non-deterministic insertion order to happen across rules. This makes it harder for us to track bad performance cases since they keep fluctuating as we run in different processes.

A side-effect of flush-updates not behaving deterministically was that insertion order across rules of the same salience group level were not deterministic and didn't necessarily respect rule load order as was implemented in #192. From my preliminary testing of this, I see a slight improvement in our execution times with this change in place. This is likely due to the rule order now also being the insertion order used. I've demonstrated this with a test that fails before this change and now passes.

My proposal is to (on the Java CLJ side) introduce a new clara.rules.platform/group-by-seq function that replaces the usages of platform/tuned-group-by. Currently, I've left platform/tuned-group-by in case it ends up having some usages later. It is also used in one CLJS place still, which I'm not wanting to change with this. platform/group-by-seq returns a seq of tuples, which is the same as seq'ing over the map returned by platform/tuned-group-by before. In all cases where we used platform/tuned-group-by we didn't use the map, but rather a seq over it.

I use a java.util.LinkedHashMap to maintain insertion order while building the groupings. Then I convert this into an immutable seq structure. If I tried to retain the usage of a map here, I'd have to return the java.util.LinkedHashMap or come up with some (expensive) way to port it into a Clojure sorted map. This was not worthwhile to me, since we do not need a map anyways. A java.util.LinkedHashMap would be harder to work with across the rest of the codebase since there are issues with the default Clojure seq implementations across mutable object iterators http://dev.clojure.org/jira/browse/CLJ-1738 that I see coming up when running this on JDK6 (oddly not on JDK8, which makes no sense and I want to investigate more for my own benefit).

Along with this, since I was messing with the clara.rules.compiler/create-get-alphas-fn already, I sorted the result of the ancestors-fn so that it would behave deterministically. This would be more of an edge case, but since we cache this result and infrequently should hit this path, I figured it'd still be worthwhile.

rbrush · 2016-07-12T12:27:26Z

Looks good to me after an early review. I'll merge later today unless there are unexpected problems.

WilliamParker · 2016-07-12T14:30:08Z

I'll plan to look at this today.

WilliamParker · 2016-07-12T16:41:37Z

src/main/clojure/clara/rules/compiler.clj

+                ;; by the :node-id of their :children.
+                new-nodes (sort-by #(mapv :node-id (:children %))
+                                   (into []
+                                         (comp (map #(get (get merged-rules :alpha-roots) %))


nit: I think this might be a bit more readable if you let-bound

(get merged-rules :alpha-roots)

I'll do that.

WilliamParker · 2016-07-12T19:38:21Z

+1 apart from my comments.

mrrodriguez · 2016-07-13T11:12:18Z

I've addressed all of the review comments made by @WilliamParker and updated this PR

- Added `platform/group-by-seq` to (mostly) replace `platform/tuned-group-by` - Made ancestors-fn contribution to the alphas-fn map behave deterministically

WilliamParker · 2016-07-13T15:10:53Z

+1

rbrush · 2016-07-13T19:17:32Z

Merged. Thanks!

WilliamParker reviewed Jul 12, 2016
View reviewed changes

mrrodriguez force-pushed the stable-inserts branch from b90c1a4 to 21be09b Compare July 13, 2016 11:11

Making rule engine execution order more deterministic

7839e1d

- Added `platform/group-by-seq` to (mostly) replace `platform/tuned-group-by` - Made ancestors-fn contribution to the alphas-fn map behave deterministically

mrrodriguez force-pushed the stable-inserts branch from 21be09b to 7839e1d Compare July 13, 2016 15:01

rbrush merged commit 13537db into oracle-samples:master Jul 13, 2016

mrrodriguez mentioned this pull request May 21, 2018

clara.rules.platform/group-by-seq used in engine may cause incorrect semantics #393

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Making rule engine execution order more deterministic #207

Making rule engine execution order more deterministic #207

Uh oh!

mrrodriguez commented Jul 12, 2016 •

edited

Loading

Uh oh!

rbrush commented Jul 12, 2016

Uh oh!

WilliamParker commented Jul 12, 2016

Uh oh!

WilliamParker Jul 12, 2016

Uh oh!

mrrodriguez Jul 13, 2016

Uh oh!

WilliamParker commented Jul 12, 2016

Uh oh!

mrrodriguez commented Jul 13, 2016 •

edited

Loading

Uh oh!

WilliamParker commented Jul 13, 2016

Uh oh!

rbrush commented Jul 13, 2016

Uh oh!

Uh oh!

Making rule engine execution order more deterministic #207

Making rule engine execution order more deterministic #207

Uh oh!

Conversation

mrrodriguez commented Jul 12, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rbrush commented Jul 12, 2016

Uh oh!

WilliamParker commented Jul 12, 2016

Uh oh!

WilliamParker Jul 12, 2016

Choose a reason for hiding this comment

Uh oh!

mrrodriguez Jul 13, 2016

Choose a reason for hiding this comment

Uh oh!

WilliamParker commented Jul 12, 2016

Uh oh!

mrrodriguez commented Jul 13, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WilliamParker commented Jul 13, 2016

Uh oh!

rbrush commented Jul 13, 2016

Uh oh!

Uh oh!

mrrodriguez commented Jul 12, 2016 •

edited

Loading

mrrodriguez commented Jul 13, 2016 •

edited

Loading