Optimize class ExceptionStateSet to improve both runtime and memory usage. #171

ZhangChangICB · 2025-01-13T07:27:43Z

motivation && reason

Under the original ExceptionStateSet architecture, we observed that during the process of exception propagation, the STA would attempt to create an ExceptionStateSet for each specific Vertex and Tag, even if it was entirely unrelated to that point and could have been completely copied from the ExceptionStateSet of the previous stage. Like following:

if (states == nullptr)
  states = new ExceptionStateSet();
states->insert(state);

This meant that a large number of identical ExceptionStateSet instances were created, even though their contents were exactly the same, leading to unnecessary memory overhead.

Furthermore, under this architecture, every time a Tag Match was performed, the STA needed to repeatedly compare the elements within the ExceptionStateSet. Additionally, when calculating the Tag Hash, it had to repeatedly iterate over and compute the hash values of each Exception in the ExceptionStateSet. This resulted in significant additional runtime overhead.

action && improvement

We refactored the ExceptionStateSet structure and observed over a 5% runtime improvement through testing with the pprof tool on our test design, along with minor memory optimizations. The changes we made include the following:

Ensured that ExceptionStateSet instances with identical contents are treated as the same element.
With the above adjustment, Tag Match/Exception Match operations can now match pointers instead of comparing their actual contents.
Cached the hash of the ExceptionStateSet, eliminating the need to recalculate it every time a Tag is created.
Replaced the use of OwnState for memory management with a reference counting (refCount) approach.

pprof data && evidence

Under design with about 150k instances, 4 corners, dozens of exceptions && clocks.
Read design and find requireds.

Before this pull request:
before_this_pr.pdf
After this pull request:
after_this_pr.pdf
You can see the sample time percentage of function "findTag" improved from 12.3% to 5.9%.

CLAassistant · 2025-01-13T07:27:50Z

All committers have signed the CLA.

ZhangChangICB added 2 commits January 13, 2025 14:40

Optimize Structure ExceptionStateSet for Runtime Improvement.

fb1f99a

Remove unused class member var(ExceptionStateSet::network_).

6e6f29c

ZhangChangICB changed the title ~~Optimize structure ExceptionStateSet to improve both runtime and memory usage.~~ Optimize class ExceptionStateSet to improve both runtime and memory usage. Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize class ExceptionStateSet to improve both runtime and memory usage. #171

Optimize class ExceptionStateSet to improve both runtime and memory usage. #171

Uh oh!

ZhangChangICB commented Jan 13, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Jan 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Optimize class ExceptionStateSet to improve both runtime and memory usage. #171

Are you sure you want to change the base?

Optimize class ExceptionStateSet to improve both runtime and memory usage. #171

Uh oh!

Conversation

ZhangChangICB commented Jan 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

motivation && reason

action && improvement

pprof data && evidence

Uh oh!

CLAassistant commented Jan 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ZhangChangICB commented Jan 13, 2025 •

edited

Loading

CLAassistant commented Jan 13, 2025 •

edited

Loading