Allow stress test to trigger at byte-level granularity #198

k-sareen · 2020-12-23T09:42:16Z

This PR re-hauls the Stress GC trigger mechanism in MMTk so that we can trigger a GC at byte-level granularity as opposed to the page-level granularity we have right now. This mechanism will also be appropriated by the analysis tracer (such as sanity) in future PRs.

Note: I have only tested this for the openjdk binding. I have informally tested the mechanism on small test cases such as fop.

qinsoon · 2021-01-03T21:25:50Z

There seems to be an issue for running workflows from a fork. I will work on that today.

k-sareen · 2021-01-04T03:32:27Z

Using secrets in a pull request from a fork: https://docs.github.com/en/free-pro-team@latest/actions/reference/events-that-trigger-workflows#pull_request_target
https://github.blog/2020-08-03-github-actions-improvements-for-fork-and-pull-request-workflows/

Most likely what we need

qinsoon · 2021-01-04T05:31:45Z

Using secrets in a pull request from a fork: https://docs.github.com/en/free-pro-team@latest/actions/reference/events-that-trigger-workflows#pull_request_target
https://github.blog/2020-08-03-github-actions-improvements-for-fork-and-pull-request-workflows/

Most likely what we need

Thanks for the links. I have got a seemingly working fix here: #200. The only issue I noticed is that it cannot post perf results as comments (as GITHUB_TOKEN from a pull request does not have permission to post comments). We can still see the results as artefacts for the runs. Switching to use pull_request_target could solve this. I am not using it at the moment, mainly because it needs to be in the base branch and I cannot test it from a PR. I may later do another PR to use pull_request_target.

You can wait until #200 gets merged (should be tomorrow), and then merge with master. Hopefully things will work.

qinsoon

There are two issues for this PR.

There are race conditions and the implementation does not guarantee precise triggering per bytes. For example, we would like to see stress GC for every 8 bytes allocation, and we are allocating 8 bytes objects in two mutator threads. In this line

mmtk-core/src/util/alloc/bumpallocator.rs

Line 121 in 201d442

&& (base.allocation_count.load(Ordering::Relaxed) > base.options.stress_factor)

both threads may see allocation_count == 0 and pass the check. In this case, they both allocate from their local buffer and no stress GC is triggered with 16 bytes allocated. To fix this, you probably would need to restructure the code a bit and use atomic operations to check and update the allocation count.
The implementation is very specific about bump allocator. For example, large object allocator does not trigger stress GC in this implementation. In my understanding, we should have some general code in allocator.rs which works for all allocators (e.g. check and add allocation bytes), and have some allocator-specific code that each allocator is required to implement (e.g. how to allocate when we are in slow path but do not need to acquire a new block).

src/util/alloc/bumpallocator.rs

src/util/options.rs

src/plan/global.rs

k-sareen · 2021-01-06T03:33:06Z

Since we are calling all shots off on performance, we can address the first point by using sequential consistency. That'll force the two threads to synchronize in sense.

For the second, I'll refactor updating the allocation count semantics into alloc_slow_inline. On a similar note, is it expected for the PR to change other allocators such as large object etc?

qinsoon · 2021-01-06T04:51:18Z

For the second, I'll refactor updating the allocation count semantics into alloc_slow_inline. On a similar note, is it expected for the PR to change other allocators such as large object etc?

Yeah, I hope so. It is a good test to see if you have cleanly separated general code and allocator-specific code. Large object allocator does not have a fastpath, which means once you have separated the code, you should need minimal changes to large object allocator. If things are not going this way (i.e. you still need a lot of changes for large object allocator), please let me know and I will look more into your code.

k-sareen · 2021-01-08T00:23:17Z

I believe all your concerns should have been met. I have been unable to test the interaction of stress testing with large object space, but since the count is updated in alloc_slow_inline() now, it should work.

EDIT: Apologies for the messy git log. If possible, please squash the formatting commits during merge. I'll set up a reminder to run the ci-style script before every commit

qinsoon · 2021-01-08T01:03:13Z

I believe all your concerns should have been met. I have been unable to test the interaction of stress testing with large object space, but since the count is updated in alloc_slow_inline() now, it should work.

The race condition still exists. As long as reading and updating allocation_bytes is not atomic, there is a race condition. It would end up allocating more bytes than the defined stress factor before triggering a GC. To solve this, you would need to refactor to put reading and updating code together and make it atomic. You can refer to this: https://stackoverflow.com/questions/47753528/how-to-compare-and-increment-an-atomic-variable

However, I think this PR is good to merge, and that race issue can be resolved in a separate PR.

EDIT: Apologies for the messy git log. If possible, please squash the formatting commits during merge. I'll set up a reminder to run the ci-style script before every commit

Don't worry about that. We always do squash and merge.

javadamiri

LGTM

Allow stress test to trigger at byte-level granularity

ccd7124

qinsoon added the PR-testing Run binding tests for the pull request (deprecated: use PR-extended-testing instead) label Jan 2, 2021

Fix style and formatting

67ca98c

qinsoon added this to the v0.3 milestone Jan 5, 2021

Merge branch 'master' into byte-level-stress

201d442

qinsoon requested changes Jan 6, 2021

View reviewed changes

k-sareen added 3 commits January 7, 2021 11:04

Fix race condition in accessing allocation bytes

ddcf1ba

Refactor updating allocation bytes

08fb990

Fix bug where first allocation after a GC was double counted

4886e82

Fix style and formatting

be5e6fb

qinsoon approved these changes Jan 8, 2021

View reviewed changes

qinsoon added the PR-benchmarking label Jan 8, 2021

qinsoon requested a review from javadamiri January 12, 2021 00:16

javadamiri approved these changes Jan 12, 2021

View reviewed changes

qinsoon merged commit bb9fd6c into mmtk:master Jan 12, 2021

k-sareen mentioned this pull request Feb 23, 2021

Update the stress factor for CIs #253

Open

qinsoon mentioned this pull request Apr 1, 2021

Generalization of Sanity GC #187

Closed

qinsoon pushed a commit to qinsoon/mmtk-core that referenced this pull request Mar 28, 2023

Update OpenJDK to jdk-11.0.19+1-mmtk (mmtk#198)

7ea4d23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow stress test to trigger at byte-level granularity #198

Allow stress test to trigger at byte-level granularity #198

Uh oh!

k-sareen commented Dec 23, 2020

Uh oh!

qinsoon commented Jan 3, 2021

Uh oh!

k-sareen commented Jan 4, 2021

Uh oh!

qinsoon commented Jan 4, 2021

Uh oh!

qinsoon left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

k-sareen commented Jan 6, 2021

Uh oh!

qinsoon commented Jan 6, 2021

Uh oh!

k-sareen commented Jan 8, 2021 •

edited

Loading

Uh oh!

qinsoon commented Jan 8, 2021

Uh oh!

javadamiri left a comment

Uh oh!

Uh oh!

Allow stress test to trigger at byte-level granularity #198

Allow stress test to trigger at byte-level granularity #198

Uh oh!

Conversation

k-sareen commented Dec 23, 2020

Uh oh!

qinsoon commented Jan 3, 2021

Uh oh!

k-sareen commented Jan 4, 2021

Uh oh!

qinsoon commented Jan 4, 2021

Uh oh!

qinsoon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

k-sareen commented Jan 6, 2021

Uh oh!

qinsoon commented Jan 6, 2021

Uh oh!

k-sareen commented Jan 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qinsoon commented Jan 8, 2021

Uh oh!

javadamiri left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

k-sareen commented Jan 8, 2021 •

edited

Loading