Adjust initial tlab size #25423

kdnilsen · 2025-05-23T20:13:21Z

We have found with certain workloads that the initial and maximum tlab sizes result in very high latencies for the first few invocations of particular methods for certain threads. The root cause is that TLABs are too large. This is causing allocatable memory to be depleted too quickly. When large numbers of threads are trying to startup at the same time, some of the threads end up with no TLABs or very small TLABs and their efforts run hundreds of times slower than the threads that were able to grab very large TLABs.

This PR reduces the maximum TLAB size and adjusts the initial TLAB size in order to reduce the impact of this problem.

This PR also changes the value of TLABAllocationWeight from 90 to 35 when we are running in generational mode. 35 is the default value used for G1 GC, which is also generational. The default value of 90 was established years ago for non-generational Shenandoah because it tends to have less frequent GC cycles than generational collectors.

With a ``small'' workload, the most significant benefit of this change is seen with p99.99 (66.1% latency improvement) and p99.999 (62.6% latency improvement). At other percentiles, the latency slightly increased (0.6% at p50, 1.7% at p100).

The small workload is represented by the following execution script:

            ~/github/jdk.adjust-initial-tlab-size/build/linux-x86_64-server-release/images/jdk/bin/java \
                -XX:ActiveProcessorCount=2 \
                -XX:+UnlockExperimentalVMOptions \
                -XX:-ShenandoahPacing \
                -XX:+AlwaysPreTouch -XX:+DisableExplicitGC -Xms4g -Xmx4g \
                -XX:+UseShenandoahGC -XX:ShenandoahGCMode=generational \
                -XX:ShenandoahFullGCThreshold=1024 \
                -XX:ShenandoahMinRegionSize=4M \
                -Xlog:"gc*=info,ergo" \
                -Xlog:safepoint=trace -Xlog:safepoint=debug -Xlog:safepoint=info \
                -XX:+UnlockDiagnosticVMOptions \
                -jar ~/github/heapothesys.fix-two-bugs/Extremem/src/main/java/extremem.jar \
                -dDictionarySize=3000000 \
                -dNumCustomers=30000 \
                -dNumProducts=30000 \
                -dCustomerThreads=500 \
                -dCustomerPeriod=5s \
                -dCustomerThinkTime=1s \
                -dKeywordSearchCount=1 \
                -dSelectionCriteriaCount=3 \
                -dProductReviewLength=12 \
                -dServerThreads=5 \
                -dServerPeriod=10s \
                -dProductNameLength=10 \
                -dBrowsingHistoryQueueCount=5 \
                -dSalesTransactionQueueCount=5 \
                -dProductDescriptionLength=40 \
                -dProductReplacementPeriod=60s \
                -dProductReplacementCount=25 \
                -dCustomerReplacementPeriod=60s \
                -dCustomerReplacementCount=1500 \
                -dBrowsingExpiration=1m \
                -dPhasedUpdates=true \
                -dPhasedUpdateInterval=180s \
                -dSimulationDuration=25m \
                -dResponseTimeMeasurements=100000 \
                >$t.genshen.MaxRSWby8-TLABisRSBby128.small.overrides.out 2>$t.genshen.MaxRSWby8-TLABisRSBby128.small.overrides.err &
            job_pid=$!
            sleep 1500
            cpu_percent=$(ps -o cputime -o etime -p $job_pid)
            rss_kb=$(ps -o rss= -p $job_pid)
            rss_mb=$((rss_kb / 1024))
            wait $job_pid
            echo "RSS: $rss_mb MB" >>$t.genshen.MaxRSWby8-TLABisRSBby128.small.overrides.out 2>>$t.genshen.MaxRSWby8-TLABisRSBby128.small.overrides.err
            echo "$cpu_percent" >>$t.genshen.MaxRSWby8-TLABisRSBby128.small.overrides.out
            gzip $t.genshen.MaxRSWby8-TLABisRSBby128.small.overrides.out $t.genshen.MaxRSWby8-TLABisRSBby128.small.overrides.err

With a ``medium'' workload, the impact is somewhat neutral, ranging from 9% improvement at p100 to 22.4% degradation at p99.999.

The medium workload is represented by this execution script:

            ~/github/jdk.adjust-initial-tlab-size/build/linux-x86_64-server-release/images/jdk/bin/java \
                -XX:+UnlockExperimentalVMOptions \
                -XX:-ShenandoahPacing \
                -XX:+AlwaysPreTouch -XX:+DisableExplicitGC -Xms31g -Xmx31g \
                -XX:+UseShenandoahGC -XX:ShenandoahGCMode=generational \
                -XX:ShenandoahFullGCThreshold=1024 \
                -Xlog:"gc*=info,ergo" \
                -Xlog:safepoint=trace -Xlog:safepoint=debug -Xlog:safepoint=info \
                -XX:+UnlockDiagnosticVMOptions \
                -jar ~/github/heapothesys/Extremem/src/main/java/extremem.jar \
                -dDictionarySize=3000000 \
                -dNumCustomers=8000000 \
                -dNumProducts=1800000 \
                -dCustomerThreads=500 \
                -dCustomerPeriod=5s \
                -dCustomerThinkTime=1s \
                -dKeywordSearchCount=1 \
                -dSelectionCriteriaCount=2 \
                -dProductReviewLength=32 \
                -dServerThreads=5 \
                -dServerPeriod=10s \
                -dProductNameLength=10 \
                -dBrowsingHistoryQueueCount=5 \
                -dSalesTransactionQueueCount=5 \
                -dProductDescriptionLength=34 \
                -dProductReplacementPeriod=60s \
                -dProductReplacementCount=25 \
                -dCustomerReplacementPeriod=60s \
                -dCustomerReplacementCount=1500 \
                -dBrowsingExpiration=1m \
                -dPhasedUpdates=true \
                -dPhasedUpdateInterval=180s \
                -dSimulationDuration=25m \
                -dResponseTimeMeasurements=100000 \
                >$t.genshen.medium.MaxTLABisRSWby8-TLABisRSBby128.out 2>$t.genshen.medium.MaxTLABisRSWby8-TLABisRSBby128.err &
            job_pid=$!
            sleep 1500
            cpu_percent=$(ps -o cputime -o etime -p $job_pid)
            rss_kb=$(ps -o rss= -p $job_pid)
            rss_mb=$((rss_kb / 1024))
            wait $job_pid
            echo "RSS: $rss_mb MB" >>$t.genshen.medium.MaxTLABisRSWby8-TLABisRSBby128.out
            echo "$cpu_percent" >>$t.genshen.medium.MaxTLABisRSWby8-TLABisRSBby128.out
            gzip $t.genshen.medium.MaxTLABisRSWby8-TLABisRSBby128.out $t.genshen.medium.MaxTLABisRSWby8-TLABisRSBby128.err

The huge workload comparisons are still being tested...

The huge workload is represented by this execution script:

            ~/github/jdk.adjust-initial-tlab-size/build/linux-x86_64-server-release/images/jdk/bin/java \
                -XX:ActiveProcessorCount=16 \
                -XX:+UnlockExperimentalVMOptions \
                -XX:-ShenandoahPacing \
                -XX:+AlwaysPreTouch -XX:+DisableExplicitGC -Xms512g -Xmx512g \
                -XX:+UseShenandoahGC -XX:ShenandoahGCMode=generational \
                -XX:ShenandoahFullGCThreshold=1024 \
                -XX:ShenandoahGuaranteedGCInterval=0 \
                -XX:ShenandoahGuaranteedOldGCInterval=0 \
                -XX:ShenandoahGuaranteedYoungGCInterval=0 \
                -Xlog:"gc*=info,ergo" \
                -Xlog:safepoint=trace -Xlog:safepoint=debug -Xlog:safepoint=info \
                -XX:+UnlockDiagnosticVMOptions \
                -jar ~/github/heapothesys/Extremem/src/main/java/extremem.jar \
                -dDictionarySize=3000000 \
                -dNumCustomers=210000000 \
                -dNumProducts=18000000 \
                -dCustomerThreads=2000 \
                -dCustomerPeriod=2000ms \
                -dCustomerThinkTime=300ms \
                -dKeywordSearchCount=2 \
                -dAllowAnyMatch=false \
                -dSelectionCriteriaCount=3 \
                -dProductReviewLength=96 \
                -dBuyThreshold=0.5 \
                -dSaveForLaterThreshold=0.15 \
                -dBrowsingExpiration=5m \
                -dServerThreads=20 \
                -dServerPeriod=10s \
                -dProductNameLength=6 \
                -dProductDescriptionLength=70 \
                -dBrowsingHistoryQueueCount=1 \
                -dSalesTransactionQueueCount=1 \
                -dProductReplacementPeriod=60s \
                -dProductReplacementCount=25 \
                -dCustomerReplacementPeriod=60s \
                -dCustomerReplacementCount=150 \
                -dBrowsingExpiration=1m \
                -dSimulationDuration=25m \
                -dResponseTimeMeasurements=100000 \
                -dPhasedUpdates=true \
                -dPhasedUpdateInterval=180s \
                >$t.genshen.huge.MaxTLABisRSWby8-TLABisRSBisRSBby128.out 2>$t.genshen.huge.MaxTLABisRSWby8-TLABisRSBisRSBby128.err &
            job_pid=$!
            sleep 3000
            cpu_percent=$(ps -o cputime -o etime -p $job_pid)
            rss_kb=$(ps -o rss= -p $job_pid)
            rss_mb=$((rss_kb / 1024))
            wait $job_pid
            echo "RSS: $rss_kb KB" >>$t.genshen.huge.MaxTLABisRSWby8-TLABisRSBisRSBby128.out
            echo "RSS: $rss_mb MB" >>$t.genshen.huge.MaxTLABisRSWby8-TLABisRSBisRSBby128.out
            echo "$cpu_percent" >>$t.genshen.huge.MaxTLABisRSWby8-TLABisRSBisRSBby128.out
            gzip $t.genshen.huge.MaxTLABisRSWby8-TLABisRSBisRSBby128.out $t.genshen.huge.MaxTLABisRSWby8-TLABisRSBisRSBby128.err

We also tested the impact of this change on one of our current development branches, identified as adaptive-evac-with-surge. Performance of this development branch, which we are in the process of merging into upstream, is what motivated the original efforts to explore improved tlab sizes.

For the same small workload described above running on a c6a.2xlarge host, the most significant benefits are seen at p99.99, p99.999, and p100 percentiles, with 50.1%, 17.6%, and 98.2% improvement respectively:

When this small workload is run on a m5.4xlarge host, we still see very significant benefits at p100, but degradation at p99.999.

The medium workload performed especially poorly without the improvements provided by this PR. All percentiles except p50 show very large improvement:

The huge workload is roughly neutral with this PR:

Progress

Change must be properly reviewed (1 review required, with at least 1 Reviewer)
Change must not contain extraneous whitespace
Commit message must refer to an issue

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/25423/head:pull/25423
$ git checkout pull/25423

Update a local copy of the PR:
$ git checkout pull/25423
$ git pull https://git.openjdk.org/jdk.git pull/25423/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 25423

View PR using the GUI difftool:
$ git pr show -t 25423

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/25423.diff

This reverts commit 702710e.

This reverts commit 3a67b1f.

…RSBby128

…n/jdk into adjust-initial-tlab-size

bridgekeeper · 2025-05-23T20:14:12Z

👋 Welcome back kdnilsen! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2025-05-23T20:15:01Z

❗ This change is not yet ready to be integrated.
See the Progress checklist in the description for automated requirements.

openjdk · 2025-05-23T20:15:19Z

⚠️ @kdnilsen This pull request contains merges that bring in commits not present in the target repository. Since this is not a "merge style" pull request, these changes will be squashed when this pull request in integrated. If this is your intention, then please ignore this message. If you want to preserve the commit structure, you must change the title of this pull request to Merge <project>:<branch> where <project> is the name of another project in the OpenJDK organization (for example Merge jdk:master).

openjdk · 2025-05-23T20:15:28Z

@kdnilsen The following labels will be automatically applied to this pull request:

hotspot-gc
shenandoah

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing lists. If you would like to change these labels, use the /label pull request command.

kdnilsen · 2025-05-23T20:28:29Z

Leaving this in draft while I prepare details for review.

kdnilsen added 30 commits January 12, 2024 01:06

Improve documentation of how Evac-OOM Protocol works

702710e

Merge branch 'openjdk:master' into master

61b575f

Revert "Improve documentation of how Evac-OOM Protocol works"

51d056f

This reverts commit 702710e.

Merge branch 'openjdk:master' into master

ba98e42

Merge branch 'openjdk:master' into master

441487c

Merge branch 'openjdk:master' into master

dafc363

Merge branch 'openjdk:master' into master

c4c252e

Merge branch 'openjdk:master' into master

41ba86a

Merge branch 'openjdk:master' into master

f215a70

Merge branch 'openjdk:master' into master

4d6b5cd

Merge branch 'openjdk:master' into master

7fe605f

Merge branch 'openjdk:master' into master

2e224f6

Merge branch 'openjdk:master' into master

46ad5c6

Merge branch 'openjdk:master' into master

9a1989d

Merge branch 'openjdk:master' into master

4126c22

Merge branch 'openjdk:master' into master

981692e

Make GC logging less verbose

3a67b1f

Revert "Make GC logging less verbose"

3692312

This reverts commit 3a67b1f.

Merge branch 'openjdk:master' into master

045590b

Merge branch 'openjdk:master' into master

fbbd88c

Merge branch 'openjdk:master' into master

7e0edf0

Merge branch 'openjdk:master' into master

3525369

Merge branch 'openjdk:master' into master

fe0da51

Merge branch 'openjdk:master' into master

db12fe5

Merge branch 'openjdk:master' into master

0440bae

Merge branch 'openjdk:master' into master

3bdc022

Merge branch 'openjdk:master' into master

1ee2ff1

Merge branch 'openjdk:master' into master

e6e772f

Merge branch 'openjdk:master' into master

c5a159e

Merge branch 'openjdk:master' into master

e7ca4f8

kdnilsen added 16 commits March 27, 2025 15:59

Merge branch 'openjdk:master' into master

42a93c7

Merge branch 'openjdk:master' into master

3841ca6

Merge branch 'openjdk:master' into master

9386e90

Merge branch 'openjdk:master' into master

0252a5c

Merge branch 'openjdk:master' into master

e029b8c

Reduce max tlab size for better startup

457af8b

Make test allocate faster to avoid timeouts

adc9b31

Override TLABSize and TLABWeight defaults

b888d75

Use same TLABWeight as G1 for GenShen

5040311

MaxTLABisRSWby2 TLABisRSBby128

04dd163

MaxTLABisRSWby2 TLABisDefault

543aa4e

MaxTLABisRSWby1-TLABisDefault

f6bf43a

MaxTLABisRSWby1 TLABSisDefault TLABAllocationWeight=90

8dc04fb

MaxTLABisRSWby1 TLABSisRSBby128 TLABAllocationWeight=35

3f758df

Add constraints for very large heap sizes with MaxTLABisRSWby8 TLABis…

3f5fa85

…RSBby128

Merge branch 'adjust-initial-tlab-size' of https://github.com/kdnilse…

1446519

…n/jdk into adjust-initial-tlab-size

openjdk bot added hotspot-gc hotspot-gc-dev@openjdk.org shenandoah shenandoah-dev@openjdk.org labels May 23, 2025

Tidy up for review

50890ad

kdnilsen marked this pull request as draft May 23, 2025 20:27

MaxRSWby32-TLABisRSBby128

a3452af

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adjust initial tlab size #25423

Adjust initial tlab size #25423

Uh oh!

kdnilsen commented May 23, 2025 •

edited by openjdk bot

Loading

Uh oh!

bridgekeeper bot commented May 23, 2025

Uh oh!

openjdk bot commented May 23, 2025

Uh oh!

openjdk bot commented May 23, 2025

Uh oh!

openjdk bot commented May 23, 2025

Uh oh!

kdnilsen commented May 23, 2025

Uh oh!

Uh oh!

Adjust initial tlab size #25423

Are you sure you want to change the base?

Adjust initial tlab size #25423

Uh oh!

Conversation

kdnilsen commented May 23, 2025 • edited by openjdk bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Progress

Reviewing

Uh oh!

bridgekeeper bot commented May 23, 2025

Uh oh!

openjdk bot commented May 23, 2025

Uh oh!

openjdk bot commented May 23, 2025

Uh oh!

openjdk bot commented May 23, 2025

Uh oh!

kdnilsen commented May 23, 2025

Uh oh!

Uh oh!

kdnilsen commented May 23, 2025 •

edited by openjdk bot

Loading