8366990: C2: Compilation hits the memory limit when verifying loop opts in Split-If code #27731

benoitmaillard · 2025-10-09T14:48:37Z

This PR prevents the C2 compiler from hitting memory limits during compilation when using -XX:+StressLoopPeeling and -XX:+VerifyLoopOptimizations in certain edge cases. The fix addresses an issue where the ciEnv arena grows uncontrollably due to the high number of verification passes, a complex IR graph, and repeated field accesses leading to unnecessary memory allocations.

Analysis

This issue was initially detected with the fuzzer. The original test from the fuzzer was reduced
and added to this PR as a regression test.

The test contains a switch inside a loop, and stressing the loop peeling results in
a fairly complex graph. The split-if optimization is applied agressively, and we
run a verification pass at every progress made.

We end up with a relatively high number of verification passes, with each pass being
fairly expensive because of the size of the graph.
Each verification pass requires building a new IdealLoopTree. This is quite slow
(which is unfortunately hard to mitigate), and also causes inefficient memory usage
on the ciEnv arena.

The inefficient usages are caused by the ciInstanceKlass::get_field_by_offset method.
At every call, we have

One allocation on the ciEnv arena to store the returned ciField
The constructor of ciField results in a call to ciObjectFactory::get_symbol, which:
- Allocates a new ciSymbol on the ciEnv arena at every call (when not found in vmSymbols)
- Pushes the new symbol to the _symbols array

The ciEnv objects returned by ciInstanceKlass::get_field_by_offset are only used once, to
check if the BasicType of a static field is a reference type.

In ciObjectFactory, the _symbols array ends up containg a large number of duplicates for certain symbols
(up to several millions), which hints at the fact that ciObjectFactory::get_symbol should not be called
repeatedly as it is done here.

The stack trace of how we get to the ciInstanceKlass::get_field_by_offset is shown below:

ciInstanceKlass::get_field_by_offset ciInstanceKlass.cpp:412
TypeOopPtr::TypeOopPtr type.cpp:3484
TypeInstPtr::TypeInstPtr type.cpp:3953
TypeInstPtr::make type.cpp:3990
TypeInstPtr::add_offset type.cpp:4509
AddPNode::bottom_type addnode.cpp:696
MemNode::adr_type memnode.cpp:73
PhaseIdealLoop::get_late_ctrl_with_anti_dep loopnode.cpp:6477
PhaseIdealLoop::get_late_ctrl loopnode.cpp:6439
PhaseIdealLoop::build_loop_late_post_work loopnode.cpp:6827
PhaseIdealLoop::build_loop_late_post loopnode.cpp:6715
PhaseIdealLoop::build_loop_late loopnode.cpp:6660
PhaseIdealLoop::build_and_optimize loopnode.cpp:5093
PhaseIdealLoop::PhaseIdealLoop loopnode.hpp:1209
PhaseIdealLoop::verify loopnode.cpp:5336
...

Because the ciEnv arena is not fred up between verification passes, it quickly fills up and hits
the memory limit after about 30s of execution in this case.

Proposed fix

As explained in the previous section, the only point of the ciInstanceKlass::get_field_by_offset
call is to obtain the BasicType of the field. By inspecting carefully what this method does,
we notice that the field descriptor fd already contains the type information we need.
We do not actually need all the information embedded in the ciField object.

ciField* ciInstanceKlass::get_field_by_offset(int field_offset, bool is_static) {
  if (!is_static) {
    for (int i = 0, len = nof_nonstatic_fields(); i < len; i++) {
      ciField* field = _nonstatic_fields->at(i);
      int  field_off = field->offset_in_bytes();
      if (field_off == field_offset)
        return field;
    }
    return nullptr;
  }
  VM_ENTRY_MARK;
  InstanceKlass* k = get_instanceKlass();
  fieldDescriptor fd;
  if (!k->find_field_from_offset(field_offset, is_static, &fd)) {
    return nullptr;
  }
  ciField* field = new (CURRENT_THREAD_ENV->arena()) ciField(&fd);
  return field;
}

Hence we can simply create a more specialized version of ciInstanceKlass::get_field_type_by_offset
that directly returns the BasicType without creating the ciField. This happens to
avoid the three memory allocations mentioned before.

After this change, the memory usage of the ciEnv arena stays constant across verification
passes.

Testing

Added test obtained from the fuzzer (and reduced with c-reduce)
GitHub Actions
tier1-3, plus some internal testing

Thank you for reviewing!

Progress

Change must be properly reviewed (1 review required, with at least 1 Reviewer)
Change must not contain extraneous whitespace
Commit message must refer to an issue

Issue

JDK-8366990: C2: Compilation hits the memory limit when verifying loop opts in Split-If code (Bug - P4)

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/27731/head:pull/27731
$ git checkout pull/27731

Update a local copy of the PR:
$ git checkout pull/27731
$ git pull https://git.openjdk.org/jdk.git pull/27731/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 27731

View PR using the GUI difftool:
$ git pr show -t 27731

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/27731.diff

Using Webrev

Link to Webrev Comment

bridgekeeper · 2025-10-09T14:51:14Z

👋 Welcome back bmaillard! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2025-10-09T14:51:24Z

❗ This change is not yet ready to be integrated.
See the Progress checklist in the description for automated requirements.

openjdk · 2025-10-09T14:55:09Z

@benoitmaillard The following label will be automatically applied to this pull request:

hotspot-compiler

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command.

mlbridge · 2025-10-10T12:44:23Z

Webrevs

chhagedorn

Nice summary and solution! I have a few comments but otherwise, the fix looks good to me.

I guess it's a discussion for another time if we also want to improve the verification time somehow. But that should not block this PR.

src/hotspot/share/ci/ciInstanceKlass.cpp

test/hotspot/jtreg/compiler/loopopts/TestVerifyLoopOptimizationsHitsMemLimit.java

chhagedorn · 2025-10-10T15:04:36Z

test/hotspot/jtreg/compiler/loopopts/TestVerifyLoopOptimizationsHitsMemLimit.java

+ *      -XX:CompileCommand=compileonly,compiler.loopopts.TestVerifyLoopOptimizationsHitsMemLimit::test
+ *      -XX:-TieredCompilation -Xcomp -XX:CompileCommand=dontinline,*::*
+ *      -XX:+StressLoopPeeling -XX:PerMethodTrapLimit=0 -XX:+VerifyLoopOptimizations
+ *      -XX:StressSeed=1870557292


I suggest to remove the stress seed since it might not trigger anymore in later builds. Usually, we add a run with a fixed stress seed and one without but since this test requires to do just some verification work, I would suggest to not add two runs but only one without fixed seed.

Another question: How close are we to hit the default the memory limit with this test? With your fix it probably consumes not much memory anymore. I therefore suggest to add MemLimit as additional flag with a much smaller value to be sure that your fix works as expected (you might need to check how low we can choose the limit to not run into problems in higher tiers).

I was able to reduce the test further using a memory limit of 100M (approximately 10 times less than the default) and a shorter timeout with creduce. Compilation of the new test method with a fast debug build now takes an average of 1.22 s over 100 runs according to -XX:+CITime.
With the decrease compilation time I think it now reasonable to have two runs (one with the stress seed, one without). Let me know if you think otherwise!

test/hotspot/jtreg/compiler/loopopts/TestVerifyLoopOptimizationsHitsMemLimit.java

src/hotspot/share/ci/ciInstanceKlass.hpp

src/hotspot/share/opto/type.cpp

Co-authored-by: Christian Hagedorn <[email protected]>

Co-authored-by: Damon Fenacci <[email protected]>

Co-authored-by: Christian Hagedorn <[email protected]>

benoitmaillard · 2025-10-15T08:58:25Z

I have made the following (significant) changes that are ready for review:

Replaced the test method with a further reduced version that now takes a little more than one second compared to ~40s previously
Added a second run without a fixed stress seed (as the compilation is now fast enough)
Added a memory limit of 100M

benoitmaillard added 2 commits October 7, 2025 11:50

8366990: Avoid growing ciEnv arena in TypeOopPtr::TypeOopPtr

3801d77

8366990: Add reduced test from the fuzzer

a51c1c4

openjdk bot changed the title ~~8366990~~ 8366990: C2: Compilation stuck when verifying loop opts in Split-If code Oct 9, 2025

openjdk bot added the hotspot-compiler [email protected] label Oct 9, 2025

Minor comments and style changes

61d1d18

benoitmaillard changed the title ~~8366990: C2: Compilation stuck when verifying loop opts in Split-If code~~ 8366990: C2: Compilation hits the memory limit when verifying loop opts in Split-If code Oct 10, 2025

benoitmaillard marked this pull request as ready for review October 10, 2025 12:38

openjdk bot added the rfr Pull request is ready for review label Oct 10, 2025

Add -XX:+UnlockDiagnosticVMOptions

7cdcd05

chhagedorn reviewed Oct 10, 2025

View reviewed changes

sendaoYan reviewed Oct 11, 2025

View reviewed changes

test/hotspot/jtreg/compiler/loopopts/TestVerifyLoopOptimizationsHitsMemLimit.java Outdated Show resolved Hide resolved

dafedafe reviewed Oct 13, 2025

View reviewed changes

src/hotspot/share/ci/ciInstanceKlass.hpp Show resolved Hide resolved

src/hotspot/share/opto/type.cpp Outdated Show resolved Hide resolved

benoitmaillard and others added 13 commits October 13, 2025 12:22

Update src/hotspot/share/ci/ciInstanceKlass.cpp

5605539

Co-authored-by: Christian Hagedorn <[email protected]>

Update src/hotspot/share/opto/type.cpp

b49d1ae

Co-authored-by: Damon Fenacci <[email protected]>

Update src/hotspot/share/ci/ciInstanceKlass.cpp

37ff941

Co-authored-by: Christian Hagedorn <[email protected]>

Move package after copyright

04582cc

Introduce ciInstanceKlass::get_non_static_field_by_offset

482976d

Add missing const

6c93a87

Add memlimit constraint to the test

4f4728f

Replace test body with reduced, faster version

98a36ab

Add check for expected NPE

1491922

Change name

a27bd07

Remove unnecessary CompileCommand=dontinline

9a00164

Reorder flags

f9d737f

Add run without fixed stress seed

1f13f87

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

8366990: C2: Compilation hits the memory limit when verifying loop opts in Split-If code #27731

8366990: C2: Compilation hits the memory limit when verifying loop opts in Split-If code #27731

benoitmaillard commented Oct 9, 2025 •

edited by openjdk bot

Loading

Uh oh!

bridgekeeper bot commented Oct 9, 2025

Uh oh!

openjdk bot commented Oct 9, 2025

Uh oh!

openjdk bot commented Oct 9, 2025 •

edited

Loading

Uh oh!

mlbridge bot commented Oct 10, 2025 •

edited

Loading

Uh oh!

chhagedorn left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chhagedorn Oct 10, 2025 •

edited

Loading

Uh oh!

benoitmaillard Oct 15, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benoitmaillard commented Oct 15, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

6 participants

8366990: C2: Compilation hits the memory limit when verifying loop opts in Split-If code #27731

Are you sure you want to change the base?

8366990: C2: Compilation hits the memory limit when verifying loop opts in Split-If code #27731

Conversation

benoitmaillard commented Oct 9, 2025 • edited by openjdk bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Analysis

Proposed fix

Testing

Progress

Issue

Reviewing

Uh oh!

bridgekeeper bot commented Oct 9, 2025

Uh oh!

openjdk bot commented Oct 9, 2025

Uh oh!

openjdk bot commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mlbridge bot commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Webrevs

Uh oh!

chhagedorn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chhagedorn Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benoitmaillard Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benoitmaillard commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

6 participants

benoitmaillard commented Oct 9, 2025 •

edited by openjdk bot

Loading

openjdk bot commented Oct 9, 2025 •

edited

Loading

mlbridge bot commented Oct 10, 2025 •

edited

Loading

chhagedorn Oct 10, 2025 •

edited

Loading

benoitmaillard commented Oct 15, 2025 •

edited

Loading