Open
Description
Currently our stress tests run for each commit in master, and due to the timeout of 3 hours, none of them can finish.
We need rework on the stress tests. Ideally:
- The stress tests should include the combination of [benchmark x binding x plan].
- We estimate each successful run would take more than 5h.
- Each run should be as simple as running one benchmark for a binding for a plan, e.g. fop-openjdk-semispace.
- When a run finishes, the next run should be started. The next run should pick up the latest commit, and run on that.
- When all the runs finish, we start another round.
Depends on whether it is possible to implement this with Github actions, we may change our design a bit. The principles are: 1. keep running the tests, 2. each test pick up the latest commit (not every commit), 3. if any test fails, we manually do a bisect to find which commit intorduces a bug.