Improve Escape Analysis under OSR #5737

hzongaro · 2019-05-11T01:02:22Z

This change defines new optimization passes, preEscapeAnalysis and postEscapeAnalysis, that help Escape Analysis (EA) do better under voluntary OSR.

The new preEscapeAnalysis optimization looks for calls to OSRInductionHelper and adds a fake prepareForOSR call that references all live auto symrefs and pending pushes. Any object that ends up as a candidate for stack allocation that appears to be used by such a fake prepareForOSR call can be heapified at that point.

During EA itself, those references on prepareForOSR calls are marked as ignoreable for the purposes of determining whether an object can be stack allocated.

After EA, the postEscapeAnalysis pass will remove the fake calls to prepareForOSR.

Submitted on behalf of Andrew Craik ajcraik@ca.ibm.com

Signed-off-by: Henry Zongaro zongaro@ca.ibm.com

hzongaro · 2019-05-11T01:15:23Z

This change is dependent upon OMR pull request 3841

Vijay @vijaysun-omr, may I ask you to review this change? Andrew @andrewcraik, may I ask you to verify my descriptions of your improvements?

2019-05-13: Removed WIP from title, as OMR pull request 3841 has been merged.

hzongaro · 2019-05-11T01:26:37Z

Force pushed to c31e473 in order to rectify copyright problem

runtime/compiler/optimizer/PreEscapeAnalysis.hpp

hzongaro · 2019-05-13T20:32:44Z

CI builds were failing because I had failed to include PreEscapeAnalysis.cpp and PostEscapeAnalysis.cpp in the list of files to be compiled with cmake. Fixed in 1e74777

andrewcraik · 2019-05-15T14:05:12Z

description is correct - review from @vijaysun-omr is required since I was involved with the development of this change.

runtime/compiler/optimizer/PostEscapeAnalysis.hpp

andrewcraik

Note - we need @vijaysun-omr to review since I was involved with the implementation. I verify the docs are correct and the implementation looks reasonable.

runtime/compiler/optimizer/EscapeAnalysis.cpp

vijaysun-omr · 2019-05-18T03:14:48Z

Looks reasonable to me. What kind of testing has been done on this change ?

andrewcraik · 2019-05-21T17:55:09Z

@hzongaro can you comment on the testing?

hzongaro · 2019-05-21T20:40:24Z

@vijaysun-omr Vijay asked,

What kind of testing has been done on this change?

Thanks for reviewing, Vijay. I ran internal "gold" testing (level.sanity, level.regression, level.promotion) on all platforms.

I also tested this change with OMR pull request #3842 against the benchmark referenced in issue #2072. Unfortunately, even after this change, there still seems to be lingering heap allocation of Double objects in the loop. I'm tracking down the reason for that, but its resolution might have to be the subject of a subsequent pull request.

hzongaro · 2019-05-21T20:45:25Z

Squashed commits down to 09d6692 and 5c9af1b

hzongaro · 2019-05-22T12:14:59Z

I wrote,

Unfortunately, even after this change, there still seems to be lingering heap allocation of Double objects in the loop. I'm tracking down the reason for that, but its resolution might have to be the subject of a subsequent pull request.

I think the lingering object allocation that I mentioned was due to a "fix" I made for an infinite recursion. I will mark this pull request as WIP while I fix the problem, so the intended improvement to the Escape Analysis optimization isn't hobbled.

charliegracie · 2019-05-24T15:23:32Z

Is this PR functionally correct? Does it improve the EA and the amount of objets that get stack allocated? If the answer is yes to those questions then I do not see why we do not merge this PR and the result of the investigation into further refinements can not happen in another PR.

vijaysun-omr · 2019-05-24T19:26:52Z

@hzongaro could you please answer ?

hzongaro · 2019-05-27T05:34:11Z

@charliegracie Charlie, I believe the PR is functionally correct, improves EA and reduces the amount of objects that get stack allocated. My concern is that Andrew's primary motivation for these changes was to improve on the number of objects stack allocated for issue #2072.

In testing the change with the benchmark snippet that appeared in the description from issue #2072 -- which I've reproduced below -- I verified that the objects in the loop were stack allocated with Andrew's changes.

public void run() {
      Double result = 1d;
      Double bound = Main.until;
      for (Double f = 0d; f < bound; f++) {
        result += f / (result + 1);
      }
      if (log) System.out.println(result);
    }

However, last week I went back to the original benchmark link (which now seems to be broken) in the issue. There, the relevant code in the benchmark was slightly different:

    public void run() {
      final Long startTime = System.currentTimeMillis();
            
      Double result = 1d;
      Double bound = Main.until;

      for (Double f = 0d; f < bound; f++) {
        result += f / (result + 1);
      }
      
      final Long endTime = System.currentTimeMillis();
      if (log) System.out.println(result);
      if (log) System.out.println(Thread.currentThread().getName() + " finished in " + (endTime - startTime) + "ms");    
    }

With the addition of those boxed Long values, the object allocations within the loop remain there. One reason is a fix that I made in commit eb5edef, which turns out to be too conservative, and ends up interfering with performing the stack allocations in this seemingly very similar method.

So, although the stack allocations now happen with the version of the code that appeared in issue #2072, the fact that there was no improvement in stack allocation using the original version of the benchmark led me to move this pull request back to WIP.

If you feel it's still worthwhile merging this change while the work on ensuring the objects in the original version of the benchmark can be stack allocated continues, I would be happy to remove the WIP from the title.

Please accept my apologies for failing to notice this difference earlier.

andrewcraik · 2019-05-27T13:33:33Z

@hzongaro I think it is worth putting this in. Getting the Double to stack allocate requires loosening a conservatively correct check to handle some additional situations and is not likely to refactor all of the other changes made here. There may be an additional enhancement to get this other version working, but what we have so far is an improvement and won't be a regression so we should get it and work on the further enhancement.

hzongaro · 2019-05-27T13:36:25Z

OK. Removing WIP from the title.

andrewcraik · 2019-05-27T13:37:07Z

Jenkins test sanity xlinux,plinux,win jdk8,jdk11

hzongaro · 2019-06-11T15:40:29Z

Hi, Vijay @vijaysun-omr - I'm still working on tracking down the source of the intermittent crash from TR_PreEscapeAnalysis::perform. It's difficult to estimate how quickly I can find the source of that problem, but it may well be a preexisting problem that this pull request happens to trip over.

hzongaro · 2019-07-30T17:21:49Z

Sorry for the long delay on updating this. I've marked this change as a work in progress. As was mentioned in #5737 (comment), there is intermittent failure that occurs with this change. The problem is in unrelated code, but happens to exposed by this change.

OpenJ9 pull request #6626, OMR pull request #4177 and OMR pull request #4182 address that problem. Once they are delivered, I will remove the WIP prefix from this pull request.

hzongaro · 2019-08-01T13:46:48Z

Just force pushed rebasing the branch on the latest master. No changes otherwise.

andrewcraik · 2019-08-01T14:44:53Z

Jenkins test sanity xlinux,win,plinux jdk8,jdk11

andrewcraik · 2019-08-01T14:46:26Z

have started sanity since the sequence of changes needed to fix the OSR metadata bug exposed by this work has now landed.

hzongaro · 2019-08-01T14:55:02Z

Removed WIP prefix, because as Andrew mentioned, the bug this change exposes has been fixed.

andrewcraik · 2019-08-01T15:07:52Z

@hzongaro I don't think we should merge this until the new Pre/Post are added to the opt strategy in this commit - the danger is that we would keep running regular EA and ignore the 'real' prepares thinking the fake prepares will save us, but we won't have added them

hzongaro · 2019-08-01T15:50:01Z

@hzongaro I don't think we should merge this until the new Pre/Post are added to the opt strategy in this commit - the danger is that we would keep running regular EA and ignore the 'real' prepares thinking the fake prepares will save us, but we won't have added them

Thanks, Andrew @andrewcraik. So I think that means that I need to separate out the part of this change that adds PreEscapeAnalysis and PostEscapeAnalysis. Then after that's merged, OMR pull request #3842, which adds them to the optimization strategy can be merged, and finally, the remainder of this change, with the actual EA changes, can be delivered. Do I have that right?

andrewcraik · 2019-08-01T15:54:39Z

Yes I think so - we can't change main EA until we have pre/post running, but it is safe to run pre/post with existing EA - you just see more escapes (escapes that would have happened anyway)

hzongaro · 2019-08-01T17:56:24Z

Reorganized changes so that PreEscapeAnaysis and PostEscapeAnalysis are available and running when the Escape Analysis changes are delivered, as Andrew recommends above. This makes this change now dependent upon pull request #6654 and OMR pull request #3842.

hzongaro · 2019-08-09T15:29:55Z

Just force pushed to bring the ea-staging-branch-step2 up to date with changes from pull request #6654 - no substantive changes here.

andrewcraik · 2019-08-21T13:14:35Z

@hzongaro now that #6654 has merged can you rebase so we can sanity this one?

The preEscapeAnalysis optimization, which was introduced in a previous change, looks for calls to OSRInductionHelper and adds a fake prepareForOSR call that references all live auto symrefs and pending pushes. Any object that ends up as a candidate for stack allocation that appears to be used by such a fake prepareForOSR call can be heapified at that point by Escape Analysis. During EA itself, those references on prepareForOSR calls are marked as ignoreable for the purposes of determining whether an object can be stack allocated. Submitted on behalf of Andrew Craik <ajcraik@ca.ibm.com> Signed-off-by: Henry Zongaro <zongaro@ca.ibm.com>

The code in findIgnoreableUses that populates the TR_BitVector, _ignoreableUses, uses the global indices of nodes to index the TR_BitVector. Code in checkDefsAndUses was using the value numbers of nodes to check the entries in _ignoreableUses. Fixed checkDefsAndUses to use nodes' global indices instead. Also, reorganized the code checking _ignoreableUses to pull the invariant check out of a loop. Signed-off-by: Henry Zongaro <zongaro@ca.ibm.com>

hzongaro · 2019-08-21T14:07:06Z

Andrew @andrewcraik, I've brought this branch up to date with the latest changes in OpenJ9. Before this change is sanity tested, we'll need OMR pull request 3842 to make it through the OpenJ9 OMR acceptance testing. I will leave this marked as a work in progress until then.

andrewcraik · 2019-08-27T13:16:00Z

@hzongaro I believe 3842 has merged and been accepted. If you could confirm and un-WIP I'll restart testing on this change.

hzongaro · 2019-08-27T14:16:57Z

Removed WIP prefix now that OMR pull request 3842 has been merged and passed OpenJ9 OMR acceptance testing.

andrewcraik · 2019-09-03T16:21:48Z

Jenkins test sanity all jdk8,jdk11

hzongaro force-pushed the ea-staging-branch-step2 branch from e450af0 to c31e473 Compare May 11, 2019 01:25

hzongaro mentioned this pull request May 11, 2019

Enable Pre-/PostEscapeAnalysis optimizations for OpenJ9 eclipse-omr/omr#3842

Merged

fjeremic reviewed May 11, 2019

View reviewed changes

runtime/compiler/optimizer/PreEscapeAnalysis.hpp Outdated Show resolved Hide resolved

hzongaro changed the title ~~WIP: Improve Escape Analysis under OSR~~ Improve Escape Analysis under OSR May 13, 2019

fjeremic added the comp:jit label May 14, 2019

fjeremic reviewed May 15, 2019

View reviewed changes

runtime/compiler/optimizer/PostEscapeAnalysis.hpp Outdated Show resolved Hide resolved

fjeremic approved these changes May 15, 2019

View reviewed changes

andrewcraik approved these changes May 15, 2019

View reviewed changes

fjeremic assigned vijaysun-omr May 16, 2019

fjeremic requested a review from vijaysun-omr May 16, 2019 01:13

vijaysun-omr reviewed May 16, 2019

View reviewed changes

runtime/compiler/optimizer/EscapeAnalysis.cpp Show resolved Hide resolved

vijaysun-omr approved these changes May 18, 2019

View reviewed changes

hzongaro force-pushed the ea-staging-branch-step2 branch from 5aedfd0 to 5c9af1b Compare May 21, 2019 20:43

hzongaro changed the title ~~Improve Escape Analysis under OSR~~ WIP: Improve Escape Analysis under OSR May 22, 2019

hzongaro changed the title ~~WIP: Improve Escape Analysis under OSR~~ Improve Escape Analysis under OSR May 27, 2019

hzongaro changed the title ~~Improve Escape Analysis under OSR~~ WIP: Improve Escape Analysis under OSR Jul 30, 2019

hzongaro force-pushed the ea-staging-branch-step2 branch from bfef953 to 69739bb Compare August 1, 2019 13:38

hzongaro changed the title ~~WIP: Improve Escape Analysis under OSR~~ Improve Escape Analysis under OSR Aug 1, 2019

andrewcraik changed the title ~~Improve Escape Analysis under OSR~~ WIP: Improve Escape Analysis under OSR Aug 1, 2019

hzongaro mentioned this pull request Aug 1, 2019

Define new Pre- and PostEscapeAnalysis passes #6654

Merged

hzongaro force-pushed the ea-staging-branch-step2 branch from 69739bb to 5791293 Compare August 1, 2019 17:52

hzongaro force-pushed the ea-staging-branch-step2 branch 2 times, most recently from 394c9ea to f3f7383 Compare August 9, 2019 15:14

hzongaro added 2 commits August 21, 2019 09:51

hzongaro force-pushed the ea-staging-branch-step2 branch from f3f7383 to 3fb2b2a Compare August 21, 2019 14:03

hzongaro changed the title ~~WIP: Improve Escape Analysis under OSR~~ Improve Escape Analysis under OSR Aug 27, 2019

andrewcraik merged commit 31fcc88 into eclipse-openj9:master Sep 4, 2019

hzongaro deleted the ea-staging-branch-step2 branch September 4, 2019 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Escape Analysis under OSR #5737

Improve Escape Analysis under OSR #5737

hzongaro commented May 11, 2019

hzongaro commented May 11, 2019 •

edited

Loading

hzongaro commented May 11, 2019

hzongaro commented May 13, 2019

andrewcraik commented May 15, 2019

andrewcraik left a comment

vijaysun-omr commented May 18, 2019

andrewcraik commented May 21, 2019

hzongaro commented May 21, 2019

hzongaro commented May 21, 2019

hzongaro commented May 22, 2019

charliegracie commented May 24, 2019

vijaysun-omr commented May 24, 2019

hzongaro commented May 27, 2019

andrewcraik commented May 27, 2019

hzongaro commented May 27, 2019

andrewcraik commented May 27, 2019

hzongaro commented Jun 11, 2019

hzongaro commented Jul 30, 2019 •

edited

Loading

hzongaro commented Aug 1, 2019

andrewcraik commented Aug 1, 2019

andrewcraik commented Aug 1, 2019

hzongaro commented Aug 1, 2019

andrewcraik commented Aug 1, 2019

hzongaro commented Aug 1, 2019

andrewcraik commented Aug 1, 2019

hzongaro commented Aug 1, 2019

hzongaro commented Aug 9, 2019

andrewcraik commented Aug 21, 2019

hzongaro commented Aug 21, 2019

andrewcraik commented Aug 27, 2019

hzongaro commented Aug 27, 2019

andrewcraik commented Sep 3, 2019

Improve Escape Analysis under OSR #5737

Improve Escape Analysis under OSR #5737

Conversation

hzongaro commented May 11, 2019

hzongaro commented May 11, 2019 • edited Loading

hzongaro commented May 11, 2019

hzongaro commented May 13, 2019

andrewcraik commented May 15, 2019

andrewcraik left a comment

Choose a reason for hiding this comment

vijaysun-omr commented May 18, 2019

andrewcraik commented May 21, 2019

hzongaro commented May 21, 2019

hzongaro commented May 21, 2019

hzongaro commented May 22, 2019

charliegracie commented May 24, 2019

vijaysun-omr commented May 24, 2019

hzongaro commented May 27, 2019

andrewcraik commented May 27, 2019

hzongaro commented May 27, 2019

andrewcraik commented May 27, 2019

hzongaro commented Jun 11, 2019

hzongaro commented Jul 30, 2019 • edited Loading

hzongaro commented Aug 1, 2019

andrewcraik commented Aug 1, 2019

andrewcraik commented Aug 1, 2019

hzongaro commented Aug 1, 2019

andrewcraik commented Aug 1, 2019

hzongaro commented Aug 1, 2019

andrewcraik commented Aug 1, 2019

hzongaro commented Aug 1, 2019

hzongaro commented Aug 9, 2019

andrewcraik commented Aug 21, 2019

hzongaro commented Aug 21, 2019

andrewcraik commented Aug 27, 2019

hzongaro commented Aug 27, 2019

andrewcraik commented Sep 3, 2019

hzongaro commented May 11, 2019 •

edited

Loading

hzongaro commented Jul 30, 2019 •

edited

Loading