Only preempt simulator testbenches on explicit wait points #1231

whitequark · 2024-03-23T06:43:16Z

Before this commit, testbenches (generators added with add_testbench) were not only preemptible after any yield, but were guaranteed to be preempted by another testbench after every yield. This is evil: if you have any race condition between testbenches, which is common, this scheduling strategy will maximize the resulting nondeterminism by interleaving your testbench with every other one as much as possible. This behavior is an outcome of the way add_testbench is implemented, which is by yielding Settle() after every command.

One can observe that:

yield value_like should never preempt;
yield assignable.eq() in add_process() should not preempt, since it only sets a next signal state, or appends to write_queue of a memory state, and never wakes up processes;
yield assignable.eq() in add_testbench() should only preempt if changing assignable wakes up an RTL process. (It could potentially also preempt if that wakes up another testbench, but this has no benefit and requires sim.set() from RFC 36 to be awaitable, which is not desirable.)

After this commit, PySimEngine._step() is implemented with two nested loops instead of one. The outer loop iterates through every testbench and runs it until an explicit wait point (Settle(), Delay(), or Tick()), terminating when no testbenches are runnable. The inner loop is the usual eval/commit loop, running whenever a testbench changes design state.

PySimEngine._processes is a set, which doesn't have a deterministic iteration order. This does not matter for processes, where determinism is guaranteed by the eval/commit loop, but causes racy testbenches to pass or fail nondeterministically (in practice depending on the memory layout of the Python process). While it is best to not have races in the testbenches, this commit makes PySimEngine._testbenches a list, making the outcome of a race deterministic, and enabling a hacky work- around to make them work: reordering calls to add_testbench().

A potential future improvement is a simulation mode that, instead, randomizes the scheduling of testbenches, exposing race conditions early.

Needs:

A test

codecov · 2024-03-23T07:11:51Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 89.80%. Comparing base (9ed83b6) to head (c9ab69e).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1231      +/-   ##
==========================================
+ Coverage   89.78%   89.80%   +0.02%     
==========================================
  Files          43       43              
  Lines        9923     9923              
  Branches     2395     2400       +5     
==========================================
+ Hits         8909     8911       +2     
+ Misses        822      820       -2     
  Partials      192      192

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

amaranth/sim/core.py

amaranth/sim/pysim.py

amaranth/sim/_pycoro.py

Before this commit, testbenches (generators added with `add_testbench`) were not only preemptible after any `yield`, but were *guaranteed* to be preempted by another testbench after *every* yield. This is evil: if you have any race condition between testbenches, which is common, this scheduling strategy will maximize the resulting nondeterminism by interleaving your testbench with every other one as much as possible. This behavior is an outcome of the way `add_testbench` is implemented, which is by yielding `Settle()` after every command. One can observe that: - `yield value_like` should never preempt; - `yield assignable.eq()` in `add_process()` should not preempt, since it only sets a `next` signal state, or appends to `write_queue` of a memory state, and never wakes up processes; - `yield assignable.eq()` in `add_testbench()` should only preempt if changing `assignable` wakes up an RTL process. (It could potentially also preempt if that wakes up another testbench, but this has no benefit and requires `sim.set()` from RFC 36 to be awaitable, which is not desirable.) After this commit, `PySimEngine._step()` is implemented with two nested loops instead of one. The outer loop iterates through every testbench and runs it until an explicit wait point (`Settle()`, `Delay()`, or `Tick()`), terminating when no testbenches are runnable. The inner loop is the usual eval/commit loop, running whenever a testbench changes design state. `PySimEngine._processes` is a `set`, which doesn't have a deterministic iteration order. This does not matter for processes, where determinism is guaranteed by the eval/commit loop, but causes racy testbenches to pass or fail nondeterministically (in practice depending on the memory layout of the Python process). While it is best to not have races in the testbenches, this commit makes `PySimEngine._testbenches` a `list`, making the outcome of a race deterministic, and enabling a hacky work- around to make them work: reordering calls to `add_testbench()`. A potential future improvement is a simulation mode that, instead, randomizes the scheduling of testbenches, exposing race conditions early.

stafverhaegen-chipflow · 2024-03-25T12:04:02Z

A potential future improvement is a simulation mode that, instead, randomizes the scheduling of testbenches, exposing race conditions early.

To me this is a requirement to do in the future and not optional. One of the problems with classic RTL simulation was that by trying to put determinism into situations where there is a race in the design that the problem was not seen during simulation but that the race actually shows up on a real chip.
Therefor in classical flow a post-layout verification step is normally included where simulations are done with back-annotated timing data to catch such situations before actually going to tape-out. I think we have to avoid that for a Amaranth based ASIC flow.

whitequark · 2024-03-25T13:47:34Z

To me this is a requirement to do in the future and not optional. One of the problems with classic RTL simulation was that by trying to put determinism into situations where there is a race in the design that the problem was not seen during simulation but that the race actually shows up on a real chip.
Therefor in classical flow a post-layout verification step is normally included where simulations are done with back-annotated timing data to catch such situations before actually going to tape-out. I think we have to avoid that for a Amaranth based ASIC flow.

Are we talking about the same thing? The testbenches are very explicitly not a part of the design; they are a part of the test suite. There is nothing to back-annotate because testbenches are behavioral and can only be used to provide stimulus and simulate I/O.

stafverhaegen-chipflow · 2024-03-25T14:51:46Z

What I had in the back of my mind is the use of behavioral models for RTL blocks or full-custom blocks for speed reasons.

whitequark · 2024-03-25T14:52:52Z

What I had in the back of my mind is the use of behavioral models for RTL blocks or full-custom blocks for speed reasons.

These will not be using add_testbench but add_process and add_process cannot race. (This is the entire reason for having add_process.)

stafverhaegen-chipflow · 2024-03-25T16:23:34Z

Is await sim.tick().sample(...) allowed in add_testbench behavioral models ?
One may need non-propagated signals then also in these processes.

whitequark · 2024-03-25T16:25:31Z

Is await sim.tick().sample(...) allowed in add_testbench behavioral models ?

Yes, await sim.tick() and everything you hang off it works exactly the same in add_process and add_testbench.

stafverhaegen-chipflow · 2024-03-25T16:26:42Z

Then I agree add_testbench is OK for these behavioral models.

whitequark added this to the 0.5 milestone Mar 23, 2024

whitequark marked this pull request as draft March 23, 2024 06:43

whitequark force-pushed the testbenchess branch from 9562d00 to b122484 Compare March 23, 2024 07:09

whitequark temporarily deployed to publish March 23, 2024 07:14 — with GitHub Actions Inactive

whitequark mentioned this pull request Mar 23, 2024

Allow visualizing delta cycles in VCD dumps #1232

Merged

whitequark force-pushed the testbenchess branch 3 times, most recently from 285473b to dc62cea Compare March 23, 2024 08:17

whitequark marked this pull request as ready for review March 23, 2024 08:18

whitequark force-pushed the testbenchess branch 2 times, most recently from 980bf44 to 520e934 Compare March 23, 2024 08:26

whitequark temporarily deployed to publish March 23, 2024 08:32 — with GitHub Actions Inactive

whitequark mentioned this pull request Mar 23, 2024

Amend RFC #36 with a concrete concurrency model amaranth-lang/rfcs#64

Merged

wanda-phi reviewed Mar 23, 2024

View reviewed changes

amaranth/sim/core.py Outdated Show resolved Hide resolved

wanda-phi reviewed Mar 23, 2024

View reviewed changes

amaranth/sim/pysim.py Show resolved Hide resolved

wanda-phi reviewed Mar 23, 2024

View reviewed changes

amaranth/sim/_pycoro.py Outdated Show resolved Hide resolved

whitequark force-pushed the testbenchess branch from 520e934 to f86c4cb Compare March 24, 2024 11:29

whitequark temporarily deployed to publish March 24, 2024 11:35 — with GitHub Actions Inactive

whitequark force-pushed the testbenchess branch from f86c4cb to c9ab69e Compare March 24, 2024 11:47

whitequark enabled auto-merge March 24, 2024 11:49

wanda-phi approved these changes Mar 24, 2024

View reviewed changes

whitequark added this pull request to the merge queue Mar 24, 2024

whitequark temporarily deployed to publish March 24, 2024 11:53 — with GitHub Actions Inactive

Merged via the queue into amaranth-lang:main with commit 0cb71f8 Mar 24, 2024

whitequark deleted the testbenchess branch March 24, 2024 12:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Only preempt simulator testbenches on explicit wait points #1231

Only preempt simulator testbenches on explicit wait points #1231

Uh oh!

whitequark commented Mar 23, 2024 •

edited

Loading

Uh oh!

codecov bot commented Mar 23, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stafverhaegen-chipflow commented Mar 25, 2024

Uh oh!

whitequark commented Mar 25, 2024

Uh oh!

stafverhaegen-chipflow commented Mar 25, 2024

Uh oh!

whitequark commented Mar 25, 2024 •

edited

Loading

Uh oh!

stafverhaegen-chipflow commented Mar 25, 2024

Uh oh!

whitequark commented Mar 25, 2024

Uh oh!

stafverhaegen-chipflow commented Mar 25, 2024

Uh oh!

Uh oh!

Only preempt simulator testbenches on explicit wait points #1231

Only preempt simulator testbenches on explicit wait points #1231

Uh oh!

Conversation

whitequark commented Mar 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Mar 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stafverhaegen-chipflow commented Mar 25, 2024

Uh oh!

whitequark commented Mar 25, 2024

Uh oh!

stafverhaegen-chipflow commented Mar 25, 2024

Uh oh!

whitequark commented Mar 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stafverhaegen-chipflow commented Mar 25, 2024

Uh oh!

whitequark commented Mar 25, 2024

Uh oh!

stafverhaegen-chipflow commented Mar 25, 2024

Uh oh!

Uh oh!

whitequark commented Mar 23, 2024 •

edited

Loading

codecov bot commented Mar 23, 2024 •

edited

Loading

whitequark commented Mar 25, 2024 •

edited

Loading