Edge and impossible transitions to memory #7205

crusaderky · 2022-10-27T11:39:39Z

fjetter · 2022-10-27T11:58:06Z

distributed/tests/test_steal.py

+        await ev.wait()
+        s = next(si for si in Scheduler._instances if id(si) == sched_id)
+        # Note that the task is async, so it runs in the same thread as the scheduler
+        s.reschedule("x-0", stimulus_id="steal")


As pointed out in #7200 (comment)
this is not how stealing works and using Scheduler.reschedule in this way is not intended use. If this is the only way to provoke this transition, we should remove Scheduler.reschedule (or rather make it private, users should raise the Reschedule exception.

fjetter · 2022-10-27T11:58:47Z

I'd like us to add a more realistic reproducer to show this can actually occur before adding this transition

github-actions · 2022-10-27T13:15:35Z

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

      15 files ±    0       15 suites ±0 6h 14m 13s ⏱️ + 2m 57s
  3 168 tests +    1   3 083 ✔️ +  13   84 💤 - 11 1 ❌ - 1
23 440 runs +643 22 535 ✔️ +689 902 💤 - 44 3 ❌ - 2

For more details on these failures, see this check.

Results for commit 8fb3dab. ± Comparison against base commit c137ac0.

♻️ This comment has been updated with latest results.

crusaderky · 2022-10-27T16:13:51Z

Discussion on #7200: I'd like to remove the test altogether, merge the change untested, and open a follow-up for a proper investigation and coverage of all use cases of unexpected transitions to memory.

gjoseph92 · 2022-10-27T18:03:10Z

Follow-up issue:

Investigate and remove unusual scheduler transitions to memory #7210

distributed/scheduler.py

distributed/tests/test_scheduler.py

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

crusaderky · 2022-11-01T14:20:30Z

All test failures are unrelated. Ready for review and merge.

gjoseph92 · 2022-11-01T21:16:51Z

distributed/scheduler.py

        ("queued", "released"): transition_queued_released,
        ("queued", "processing"): transition_queued_processing,
        ("processing", "released"): transition_processing_released,
        ("processing", "memory"): transition_processing_memory,
        ("processing", "erred"): transition_processing_erred,
        ("no-worker", "released"): transition_no_worker_released,
        ("no-worker", "processing"): transition_no_worker_processing,
-        ("no-worker", "memory"): transition_no_worker_memory,
+        ("no-worker", "memory"): partial(impossible_transition, "memory"),


What's the value of having a transition function for an impossible transition, versus updating

distributed/distributed/scheduler.py

Line 1853 in 5a14053

raise RuntimeError(f"Impossible transition from {start} to {finish}")

to raise a more detailed error like you have here (including the story)?

The idea was to avoid the error from being silenced in the future if anybody adds a (released, memory) transition. But on second thought I'll just overhaul the above.

gjoseph92

Nice, thanks for figuring all this out

gjoseph92 · 2022-11-02T16:04:10Z

Only failing:

test_bad_disk flaky #7208

fjetter requested changes Oct 27, 2022

View reviewed changes

gjoseph92 mentioned this pull request Oct 27, 2022

Transition queued->memory causes AssertionError #7200

Closed

transition_queued_memory

d2944ef

crusaderky force-pushed the queued-memory branch from 7dd2458 to d2944ef Compare October 27, 2022 13:46

gjoseph92 mentioned this pull request Oct 27, 2022

Investigate and remove unusual scheduler transitions to memory #7210

Closed

crusaderky added 4 commits October 31, 2022 11:56

Merge branch 'main' into queued-memory

7e10b25

test_transition_waiting_memory

0c5a1f6

TODOs

a9818e0

Merge branch 'main' into queued-memory

6ce05df

gjoseph92 reviewed Oct 31, 2022

View reviewed changes

distributed/scheduler.py Show resolved Hide resolved

distributed/tests/test_scheduler.py Outdated Show resolved Hide resolved

distributed/tests/test_scheduler.py Outdated Show resolved Hide resolved

crusaderky and others added 5 commits November 1, 2022 10:48

Merge branch 'main' into queued-memory

518b1c0

Update distributed/tests/test_scheduler.py

4380f60

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

Merge remote-tracking branch 'origin/queued-memory' into queued-memory

83fd838

Use freeze_batched_send

f4d3a86

Impossible transitions

a4d449b

crusaderky changed the title ~~queued->memory transition~~ Edge and impossible transitions to memory Nov 1, 2022

crusaderky self-assigned this Nov 1, 2022

crusaderky marked this pull request as ready for review November 1, 2022 14:20

gjoseph92 reviewed Nov 1, 2022

View reviewed changes

crusaderky added 3 commits November 2, 2022 13:55

Remove ad hoc code

145ceac

Merge branch 'main' into queued-memory

08b0b70

cosmetic

8fb3dab

gjoseph92 approved these changes Nov 2, 2022

View reviewed changes

crusaderky merged commit 02b9430 into dask:main Nov 2, 2022

crusaderky deleted the queued-memory branch November 2, 2022 16:17

gjoseph92 mentioned this pull request Nov 28, 2022

Issues with tasks completing on workers after being released and re-submitted #7356

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Edge and impossible transitions to memory #7205

Edge and impossible transitions to memory #7205

Uh oh!

crusaderky commented Oct 27, 2022 •

edited

Loading

Uh oh!

fjetter Oct 27, 2022

Uh oh!

fjetter commented Oct 27, 2022

Uh oh!

github-actions bot commented Oct 27, 2022 •

edited

Loading

Uh oh!

crusaderky commented Oct 27, 2022

Uh oh!

gjoseph92 commented Oct 27, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

crusaderky commented Nov 1, 2022

Uh oh!

gjoseph92 Nov 1, 2022

Uh oh!

crusaderky Nov 2, 2022 •

edited

Loading

Uh oh!

gjoseph92 left a comment

Uh oh!

gjoseph92 commented Nov 2, 2022

Uh oh!

Uh oh!

Uh oh!

Edge and impossible transitions to memory #7205

Edge and impossible transitions to memory #7205

Uh oh!

Conversation

crusaderky commented Oct 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fjetter Oct 27, 2022

Choose a reason for hiding this comment

Uh oh!

fjetter commented Oct 27, 2022

Uh oh!

github-actions bot commented Oct 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Unit Test Results

Uh oh!

crusaderky commented Oct 27, 2022

Uh oh!

gjoseph92 commented Oct 27, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

crusaderky commented Nov 1, 2022

Uh oh!

gjoseph92 Nov 1, 2022

Choose a reason for hiding this comment

Uh oh!

crusaderky Nov 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gjoseph92 left a comment

Choose a reason for hiding this comment

Uh oh!

gjoseph92 commented Nov 2, 2022

Uh oh!

Uh oh!

crusaderky commented Oct 27, 2022 •

edited

Loading

github-actions bot commented Oct 27, 2022 •

edited

Loading

crusaderky Nov 2, 2022 •

edited

Loading