8314599: [GenShen] Couple adaptive tenuring and generation size budgeting #27632

earthling-amzn · 2025-10-03T23:19:58Z

Notable changes:

Improvements to logging
More accurate tracking of promotion failures
Use shared allocation for promotions only when the size is above the maximum plab size (not the minimum size)
Use census information gathered during mark to size promotion reserves and old generation

With these changes, GenShen is expected to have fewer promotion failures and this is indeed the case. As a result of this, we expect less time to be spent in concurrent marking and update refs for young collections. We may also expect shorter concurrent evacuation phases because GenShen will have fewer densely packed regions stuck in the young generation. With more objects being promoted, we also expect to see longer remembered set scan times. This is generally the case across all benchmarks, but we do also see some counter-intuitive results.

Here we are comparing 20 executions (10 on x86, 10 on aarch64) of the changes in the PR (experiment) against 20 executions of the same benchmarks results from tip. This is a summary of statistically significant changes of more than 5% across all benchmarks:

Concurrent Evacuation: 7 improvements, 3 regressions
• Best improvements: extremem-large-45g (-29.6%), neo4j-analytics (-26.9%)
• Worst regression: xalan (+53.7%)

Concurrent Marking: 15 improvements, 1 regression  
• Best improvements: hyperalloc_a2048_o4096 (-30.1%), crypto.rsa (-27.3%)
• Only regression: serial (+8.9%)

Concurrent Scan Remembered Set: 7 improvements, 2 regressions
• Best improvements: xalan (-49.4%), pmd (-49.0%), crypto.rsa (-41.8%)
• Worst regression: extremem-phased (+52.4%)

Concurrent Update Refs: 5 improvements, 4 regressions
• Best improvements: crypto.rsa (-36.4%), mnemonics (-28.4%)
• Worst regression: xalan (+89.4%)

Progress

Change must be properly reviewed (1 review required, with at least 1 Reviewer)
Change must not contain extraneous whitespace
Commit message must refer to an issue

Issue

JDK-8314599: [GenShen] Couple adaptive tenuring and generation size budgeting (Task - P4)

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/27632/head:pull/27632
$ git checkout pull/27632

Update a local copy of the PR:
$ git checkout pull/27632
$ git pull https://git.openjdk.org/jdk.git pull/27632/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 27632

View PR using the GUI difftool:
$ git pr show -t 27632

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/27632.diff

Using Webrev

Link to Webrev Comment

…he first cohort that would be tenured

Some of the code is left behind to continue collecting the census when adaptive tenuring is disabled.

…shold

…ntime-option

…tenure

… size?

Added tag jdk-26+12 for changeset 02fe095

…ed to collection set

…ng excess old regions to young generation)

…ld' into make-evac-tracking-runtime-option

…time-option

…vements

…es when computing excess old regions

bridgekeeper · 2025-10-03T23:21:09Z

👋 Welcome back wkemper! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2025-10-03T23:21:44Z

❗ This change is not yet ready to be integrated.
See the Progress checklist in the description for automated requirements.

openjdk · 2025-10-03T23:22:11Z

⚠️ @earthling-amzn This pull request contains merges that bring in commits not present in the target repository. Since this is not a "merge style" pull request, these changes will be squashed when this pull request in integrated. If this is your intention, then please ignore this message. If you want to preserve the commit structure, you must change the title of this pull request to Merge <project>:<branch> where <project> is the name of another project in the OpenJDK organization (for example Merge jdk:master).

openjdk · 2025-10-03T23:22:26Z

@earthling-amzn The following labels will be automatically applied to this pull request:

hotspot-gc
shenandoah

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing lists. If you would like to change these labels, use the /label pull request command.

mlbridge · 2025-10-03T23:26:05Z

Webrevs

kdnilsen

I'm thinking that there should be a change to heap->rebuild_freeset() that adds get_promotion_potential() to the reserve for OldCollector. We did not previously make this reserve because we were only counting data that was to be promoted in place.

This may have the effect of triggering a bit sooner than in existing master. We could subtract out the pip_promo_reserve() if we want to keep track of that separately, because that doesn't need to be reserved.

Once the cset is constructed, we will shrink the reserves because at that point, we'll know better how much we really plan to evacuate into old.

kdnilsen · 2025-10-06T22:35:22Z

src/hotspot/share/gc/shenandoah/shenandoahCollectionSet.inline.hpp


-size_t ShenandoahCollectionSet::get_young_bytes_reserved_for_evacuation() const {
+size_t ShenandoahCollectionSet::get_live_bytes_in_young_regions() const {
  return _young_bytes_to_evacuate - _young_bytes_to_promote;


I'm wondering if these new names properly reflect the intention. It seems get_live_byte_in_young_regions() really means get_live_bytes_that_we_intend_to_evacuate_to_young(). (This number does not include _live_bytes_in_young_regions() that we expect to evacuate to old.)

I called the complementary method get_live_bytes_in_tenurable_regions. How about get_live_bytes_in_untenurable_regions?

I'm still maybe a bit confused. Is it get_untenurable_live_bytes_in_young_regions()? Are we distinguishing?

So we have a total of N live bytes within young regions that have been placed into the collection set. We expect that P bytes (P < N) will be promoted, and the remaining S bytes (S + P == N) will be evacuated to "survivor space" within young.

Does get_live_bytes_in_untenurable_regions() equal P + S?

The tenure here refers to the regions the objects reside in, not the objects themselves. get_live_bytes_in_tenurable_regions is the sum of live bytes in all regions with an age above the tenuring threshold (we expect to promote all of these, though some promotions may fail). It's complement get_live_bytes_in_untenurable_regions is the sum of live bytes in all regions with an age less than the tenuring threshold (we expect to promote some of these, but we don't really know how many). This was part of the reason I wanted to rename these methods. They represent the provenance of the objects in the collection set, not necessarily the regions they will be evacuated to.

src/hotspot/share/gc/shenandoah/shenandoahGeneration.cpp

src/hotspot/share/gc/shenandoah/shenandoahGenerationalHeap.cpp

kdnilsen · 2025-10-06T23:17:01Z

src/hotspot/share/gc/shenandoah/shenandoahGeneration.cpp

-  // Add in the excess_old memory to hold unanticipated promotions, if any.  If there are more unanticipated
-  // promotions than fit in reserved memory, they will be deferred until a future GC pass.
-  size_t total_promotion_reserve = young_advance_promoted_reserve_used + excess_old;
-  old_generation->set_promoted_reserve(total_promotion_reserve);


Can you clarify why we no longer need this set_promoted_reserve() call? (just in a PR comment probably, not necessarily a code comment.)

In compute_evacuation_reserves we are setting the promotion reserve to the maximum possible to handle everything tenurable this cycle (this is still capped by the maximum evacuation reserve for old). I was reluctant to scale the promotion reserve by ShenandoahPromoEvacWaste for fear it would over commit the collector's reserves and lead to OOM errors during evacuation.

So in the new design, we have full awareness of all promotable objects, and we've already done our best to budget for those. So there's no such thing as "unanticipated promotions".

Separate question is whether we scale promotion reserve by ShenandoahPromoEvacWaste. So if old is larger than necessary to handle the anticipated mixed evacuations and promotions, the new code is essentially saying "use this extra space for mixed evacuations rather than for promotions". Since we're not expanding the promoted_reserve, promotions will not be allowed to touch it.

Am I understanding the intent correctly?

In compute_evacuation_budgets, if there are mixed collection candidates we set the initial promotion reserve to zero and the old evacuation reserve to the maximum. However, we then restrict old evacuation reserve to only empty regions. The difference between old available and old unaffiliated is given to the promotion reserve. Here again, I didn't want to scale the promotion reserve because it's basically the scraps of the old generation and I worry about over committing the old reserve. When there are no mixed collections, we use the entirety of old for promotions. Any old regions not needed for old evacuations or promotions are transferred to the young generation as they were before this change.

earthling-amzn added 30 commits July 31, 2025 17:19

Some cleanup and a test harness for adaptive tenuring

18ab073

Assert current behavior is expected

280d8d1

Make evac tracking a runtime feature, add logging for plab management

47a84e0

Instrumentation tweaks

43d25e3

Instrumentation tweaks

c3b4e89

Are some threads not getting plab promotions re-enabled?

be111f0

Re-enable plab promotions for all threads when plabs are retired

0429530

Remove unused ShenandoahThreadLocalData::_paced_time

7dd61a3

Log percentage of population which is above tenuring threshold

101cdf2

Tone down some log messages, draw the tenuring threshold line above t…

c6a3467

…he first cohort that would be tenured

Remove census-at-evac option

038aa46

Some of the code is left behind to continue collecting the census when adaptive tenuring is disabled.

Little clean up as I read

090ae87

Deduplicate some collection set logging

071bc88

Idle clean ups, logging improvements

7c2de24

Merge remote-tracking branch 'jdk/master' into adaptive-tenuring-thre…

f472787

…shold

Update unit test, fix slowdebug build issue

8d7cd39

Remove outdated comment

b13e8b1

Add more census updates, exhibit current behavior in test

86c429a

Idle clean ups

704753f

Merge branch 'adaptive-tenuring-threshold' into make-evac-tracking-ru…

48a330f

…ntime-option

Add a method to get the total bytes occupied by objects eligible for …

a3e5fb4

…tenure

Instrumentation to guage potential changes to promotion reserves

c4fb8cf

Fix linker error

ca6ca98

How is promotion potential higher than next cycle's tenurable objects…

89a6212

… size?

Oops, age table sizes are words

d3a63ec

Add test that simulates promotion above tenuring age

32f6b3b

Checkpoint, tests pass

642c1a0

Clean up tests

82c06f2

Merge tag 'jdk-26+12' into adaptive-tenuring-threshold

2c7275f

Added tag jdk-26+12 for changeset 02fe095

Revert unintended change

64c6839

earthling-amzn added 17 commits August 22, 2025 15:27

Use age census to size promotion reserve for current cycle

fb5e0fc

Adjust promotion reserve based only on old regions that have been add…

e1cdba7

…ed to collection set

Why do we keep giving away all our old reserve?

d306a51

Count promotion reserve in old consumed

25c488e

Fix windows build

b9e16d2

Keep promotion reserve (we already accounted for this when transferri…

268bddf

…ng excess old regions to young generation)

More instrumentation

2bd2e41

Merge remote-tracking branch 'earthling-jdk/adaptive-tenuring-thresho…

9f2be25

…ld' into make-evac-tracking-runtime-option

Merge remote-tracking branch 'jdk/master' into make-evac-tracking-run…

5845982

…time-option

Tweak comment

3037ca8

Merge remote-tracking branch 'jdk/master' into promotion-budget-impro…

dc3458a

…vements

Merge remote-tracking branch 'jdk/master' into promotion-budget-impro…

3841fc0

…vements

Merge fallout

38cda58

More accurate method names for cset fields, consider promotion reserv…

5809aa5

…es when computing excess old regions

Only use old generation in generational mode

35e0417

Fix wrong asserts

1f757d1

Little cleanup

fd9619d

openjdk bot added hotspot-gc [email protected] shenandoah [email protected] labels Oct 3, 2025

openjdk bot added the rfr Pull request is ready for review label Oct 3, 2025

earthling-amzn added 2 commits October 6, 2025 07:47

Fix windows build

8fca0b6

Fix windows build more

09926eb

kdnilsen reviewed Oct 6, 2025

View reviewed changes

Review feedback, bug fixes

b4d1cf9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

8314599: [GenShen] Couple adaptive tenuring and generation size budgeting #27632

8314599: [GenShen] Couple adaptive tenuring and generation size budgeting #27632

Uh oh!

earthling-amzn commented Oct 3, 2025 •

edited by openjdk bot

Loading

Uh oh!

bridgekeeper bot commented Oct 3, 2025

Uh oh!

openjdk bot commented Oct 3, 2025

Uh oh!

openjdk bot commented Oct 3, 2025

Uh oh!

openjdk bot commented Oct 3, 2025 •

edited

Loading

Uh oh!

mlbridge bot commented Oct 3, 2025 •

edited

Loading

Uh oh!

kdnilsen left a comment

Uh oh!

kdnilsen Oct 6, 2025

Uh oh!

earthling-amzn Oct 7, 2025

Uh oh!

kdnilsen Oct 7, 2025

Uh oh!

earthling-amzn Oct 8, 2025

Uh oh!

Uh oh!

Uh oh!

kdnilsen Oct 6, 2025

Uh oh!

earthling-amzn Oct 7, 2025

Uh oh!

kdnilsen Oct 7, 2025

Uh oh!

earthling-amzn Oct 8, 2025

Uh oh!

Uh oh!

8314599: [GenShen] Couple adaptive tenuring and generation size budgeting #27632

Are you sure you want to change the base?

8314599: [GenShen] Couple adaptive tenuring and generation size budgeting #27632

Uh oh!

Conversation

earthling-amzn commented Oct 3, 2025 • edited by openjdk bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Progress

Issue

Reviewing

Uh oh!

bridgekeeper bot commented Oct 3, 2025

Uh oh!

openjdk bot commented Oct 3, 2025

Uh oh!

openjdk bot commented Oct 3, 2025

Uh oh!

openjdk bot commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mlbridge bot commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Webrevs

Uh oh!

kdnilsen left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

earthling-amzn commented Oct 3, 2025 •

edited by openjdk bot

Loading

openjdk bot commented Oct 3, 2025 •

edited

Loading

mlbridge bot commented Oct 3, 2025 •

edited

Loading