Sampling allocation bytes precisely without compromising the performance #9745

LinHu2016 · 2020-05-29T18:59:20Z

in order to sampling heap allocation bytes precisely without
compromising the performance, we have the below changes.

Handle instrumentableAllocateHook and
VM_OBJECT_ALLOCATE_WITHIN_THRESHOLD is still via disabling inline
allocation
Handle smapling for tracepoint is still during out of line allocation
Handle smapling for JEP331 is via setTLHSamplingTop(size)

Using fake Heap Top instead of fake Heap Alloc for disabling inline
allocation (realHeapAlloc-->realHeapTop,
set/getRealAlloc()-->set/getRealTop(), getRealSize(), getUsedSize())
Using fake Heap Top to force out of line allocation at sampling thresold
for sampling heap allocation (setTLHSamplingTop()/resetTLHSamplingTop())
setTLHSamplingTop(size) are only called in the below 3 cases
1, sampling threshold has been changed via GC-VM api
j9gc_set_allocation_sampling_interval()
2, TLH is refreshed
3, after sampling is done

Counting trace allocation byte includes allocation bytes inside TLH
Cache before flushing(_stats.bytesAllocated(true),
stats->_tlhAllocatedUsed, )
Handle traceAllocationByte for Health
Center(_oolTraceAllocationBytesForTracepoint,
oolObjectSamplingBytesGranularityForTracepoint) and traceAllocationByte
for JEP331(_traceAllocationBytesForHook,
objectSamplingBytesGranularityForHook) independently

depend on eclipse-omr/omr#5260
fix: #7740

Signed-off-by: Lin Hu linhu@ca.ibm.com

LinHu2016 · 2020-05-29T19:05:41Z

@amicic @dmitripivkine please review changes, thanks

dmitripivkine · 2020-06-01T17:35:12Z

runtime/gc_glue_java/EnvironmentDelegate.cpp

+			_vmThread->nonZeroHeapTop =  tlh->realHeapTop;
+			tlh->realHeapTop = NULL;
+		}
+	}


Looks like the question I asked about support for dual TLH mode applies here: Am I reading correctly that if we set size for both TLHs we can get 2 * size to be allocated potentially?

yes, we might not get very accurate result on nonzero case,have not found the better way to handle the non zero case.

Do we have a platform that can actually have both active (btw, is X using just nonZeroHeapAlloc/Top?)? I'm willing to ignore that issue for now, if we think it will take more than a day to do/test it and we can follow up after the upcoming release (by as you said disabling one of two TLHs).

I believe non-Zeroed TLH is using on pLinux (LE and BE) and AIX. I am not sure about current status of zLinux.

Non-zero TLH can be used for primitive arrays only. So to see a mismatch an application should include primitive arrays to the allocation mix.

I agree that proper handling for dual TLH case can be done later in separate change

in order to sampling heap allocation bytes precisely without compromising the performance, we have the below changes. Handle instrumentableAllocateHook and VM_OBJECT_ALLOCATE_WITHIN_THRESHOLD is still via disabling inline allocation Handle smapling for tracepoint is still during out of line allocation Handle smapling for JEP331 is via setTLHSamplingTop(size) Using fake Heap Top instead of fake Heap Alloc for disabling inline allocation (realHeapAlloc-->realHeapTop, set/getRealAlloc()-->set/getRealTop(), getRealSize(), getUsedSize()) Using fake Heap Top to force out of line allocation at sampling thresold for sampling heap allocation (setTLHSamplingTop()/resetTLHSamplingTop()) setTLHSamplingTop(size) are only called in the below 3 cases 1, sampling threshold has been changed via GC-VM api j9gc_set_allocation_sampling_interval() 2, TLH is refreshed 3, after sampling is done Counting trace allocation byte includes allocation bytes inside TLH Cache before flushing(_stats.bytesAllocated(true), stats->_tlhAllocatedUsed, ) Handle traceAllocationByte for Health Center(_oolTraceAllocationBytesForTracepoint, oolObjectSamplingBytesGranularityForTracepoint) and traceAllocationByte for JEP331(_traceAllocationBytesForHook, objectSamplingBytesGranularityForHook) independently Signed-off-by: Lin Hu <linhu@ca.ibm.com>

dmitripivkine · 2020-06-04T17:22:19Z

Jenkins test sanity all jdk11

LinHu2016 · 2020-06-04T17:56:57Z

has verified the latest personal build with customer's JEP331 test
https://hyc-runtimes-jenkins.swg-devops.com/view/OpenJ9%20-%20Personal/job/Pipeline-Build-Test-Personal/6218/
/team/linhu/JEP331/result1
/team/linhu/JEP331/result2
/team/linhu/JEP331/result3

keithc-ca · 2020-06-04T21:29:56Z

debugtools/DDR_VM/src/com/ibm/j9ddr/vm29/j9/gc/GCObjectHeapIteratorAddressOrderedList_V1.java

-						U8Pointer realHeapAlloc = adjustedToRange(vmThread.allocateThreadLocalHeap().realHeapAlloc(), base, top);
-						if(realHeapAlloc.notNull() && isSomethingToAdd(realHeapAlloc, heapTop)) {
-							excludedRangeList.add(new U8Pointer[] {realHeapAlloc, heapTop});
+						U8Pointer realHeapTop = adjustedToRange(vmThread.allocateThreadLocalHeap().realHeapTop(), base, top);


This will fail with NoSuchFieldError when examining core files created before the addition of realHeapTop.

Good point, thank you very much.

LinHu2016 mentioned this pull request May 29, 2020

Sampling allocation bytes precisely without compromising the performance eclipse-omr/omr#5260

Merged

dmitripivkine reviewed Jun 1, 2020

View reviewed changes

LinHu2016 force-pushed the JEP331_update branch 9 times, most recently from 3e18140 to 86eb958 Compare June 4, 2020 13:49

LinHu2016 force-pushed the JEP331_update branch from 86eb958 to d2dd559 Compare June 4, 2020 13:54

amicic approved these changes Jun 4, 2020

View reviewed changes

dmitripivkine approved these changes Jun 4, 2020

View reviewed changes

amicic added the comp:gc label Jun 4, 2020

dmitripivkine merged commit f07d574 into eclipse-openj9:master Jun 4, 2020

keithc-ca reviewed Jun 4, 2020

View reviewed changes

pshipton mentioned this pull request Jun 5, 2020

samplingObjectAllocation.soae001 failed, expected 1+ but got: 0 #9808

Closed

keithc-ca mentioned this pull request Jun 5, 2020

gccheck fails on older core file #9810

Closed

This was referenced Jun 5, 2020

DDR Back compatible for TLH enable/disable change #9813

Merged

(0.21.0) DDR Back compatible for TLH enable/disable change #9850

Merged

keithc-ca mentioned this pull request Jun 19, 2020

Fix access to J9ModronThreadLocalHeap.realHeapAlloc in older core files #9956

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sampling allocation bytes precisely without compromising the performance #9745

Sampling allocation bytes precisely without compromising the performance #9745

LinHu2016 commented May 29, 2020 •

edited

Loading

LinHu2016 commented May 29, 2020

dmitripivkine Jun 1, 2020

LinHu2016 Jun 1, 2020

amicic Jun 1, 2020

dmitripivkine Jun 1, 2020

dmitripivkine Jun 1, 2020

dmitripivkine Jun 1, 2020

dmitripivkine commented Jun 4, 2020

LinHu2016 commented Jun 4, 2020

keithc-ca Jun 4, 2020 •

edited

Loading

dmitripivkine Jun 4, 2020

Sampling allocation bytes precisely without compromising the performance #9745

Sampling allocation bytes precisely without compromising the performance #9745

Conversation

LinHu2016 commented May 29, 2020 • edited Loading

LinHu2016 commented May 29, 2020

dmitripivkine Jun 1, 2020

Choose a reason for hiding this comment

LinHu2016 Jun 1, 2020

Choose a reason for hiding this comment

amicic Jun 1, 2020

Choose a reason for hiding this comment

dmitripivkine Jun 1, 2020

Choose a reason for hiding this comment

dmitripivkine Jun 1, 2020

Choose a reason for hiding this comment

dmitripivkine Jun 1, 2020

Choose a reason for hiding this comment

dmitripivkine commented Jun 4, 2020

LinHu2016 commented Jun 4, 2020

keithc-ca Jun 4, 2020 • edited Loading

Choose a reason for hiding this comment

dmitripivkine Jun 4, 2020

Choose a reason for hiding this comment

LinHu2016 commented May 29, 2020 •

edited

Loading

keithc-ca Jun 4, 2020 •

edited

Loading