Fix infinite wait bug in test_pthread_proxying_cpp.cpp #18358

tlively · 2022-12-12T23:21:56Z

The tests in test_pthread_proxying_cpp proxy functions that increment local
variables. For the async proxying functions, the proxying is followed by a wait
on a condition variable so the test can assert that the proxied work was
completed. However, the incrementing of the local variables was not protected by
the lock used with the condition variable, so it was possible for the variable
to be incremented and the condition variable notified after checking the wait
condition but before waiting. Since the condition variable would not be notified
again, the test would wait on the condition variable forever.

Fix the bug by taking the lock before incrementing the variable in tests where
this could cause problems.

Fixes #18353.

The tests in test_pthread_proxying_cpp proxy functions that increment local variables. For the async proxying functions, the proxying is followed by a wait on a condition variable so the test can assert that the proxied work was completed. However, the incrementing of the local variables was not protected by the lock used with the condition variable, so it was possible for the variable to be incremented and the condition variable notified after checking the wait condition but before waiting. Since the condition variable would not be notified again, the test would wait on the condition variable forever. Fix the bug by taking the lock before incrementing the variable in tests where this could cause problems. Fixes #18353.

tlively · 2022-12-12T23:22:09Z

Current dependencies on/for this PR:

main
- PR Fix infinite wait bug in test_pthread_proxying_cpp.cpp #18358 👈
  - PR [Proxying][Docs] Improve the docs for proxying with callback #18359
    - PR Enable test_pthread_proxying_cpp on wasm2js #18360

This comment was auto-generated by Graphite.

It had previously been disabled because of flakes on wasm2js, but those flakes are probably fixed by #18358.

sbc100 · 2022-12-13T00:34:26Z

test/pthread/test_pthread_proxying_cpp.cpp

+        {
+          std::unique_lock<std::mutex> lock(mutex);
+          i = 3;
+        }
        executor = std::this_thread::get_id();
        cond.notify_one();


You don't need to hold the lock when calling notify too?

No, it's actually a pessimization to hold the lock while calling notify because then the other thread wakes up just to immediately block on acquiring the lock.

sbc100 · 2022-12-13T00:37:31Z

test/pthread/test_pthread_proxying_cpp.cpp

+      {
+        std::unique_lock<std::mutex> lock(mutex);
+        i = 3;
+      }


Despite reading you description I've having trouble seeing how putting a lock around this one assignment fixes the issue. Would making i and atomic also fix the issue?

After reading https://stackoverflow.com/questions/17101922/do-i-have-to-acquire-lock-before-calling-condition-variable-notify-one I think i understand.. I had never known this about cond vars..

No, the lock is important because of the condition variable. The condition variable needs to atomically check the condition and block if the condition is not met. The mechanism for combining those into an atomic action is the lock, so if the condition can change without the lock being held, the condition variable doesn't work as intended.

kripken · 2022-12-13T00:41:33Z

I thought the idea was that the outside holds the mutex so that the thread blocks until the right time, but reading it again, I think I misunderstood it before as the outside grabs the mutex after firing off the async event, so they actually race. So I'm also not sure about this change, and not sure why it was failing before...

tlively · 2022-12-13T00:46:04Z

Right, since the proxying is async, the proxied work races with the following code on the proxying thread. To resolve the race, we use explicit condition variables to wait for the proxied work to be completed. The bug was that we weren't using the condition variables properly. Condition variables only work if the condition is protected by the same lock passed to wait, and in this case it was not.

sbc100 · 2022-12-13T00:50:32Z

You can go ahead and land without waiting for the restarted sockets tests if you like.. that failure was clearly unrelated.

It had previously been disabled because of flakes on wasm2js, but those flakes are probably fixed by #18358.

tlively requested a review from kripken December 12, 2022 23:21

tlively mentioned this pull request Dec 12, 2022

[Proxying][Docs] Improve the docs for proxying with callback #18359

Merged

kripken approved these changes Dec 13, 2022

View reviewed changes

tlively requested a review from sbc100 December 13, 2022 00:12

tlively added a commit that referenced this pull request Dec 13, 2022

Enable test_pthread_proxying_cpp on wasm2js

36d6d31

It had previously been disabled because of flakes on wasm2js, but those flakes are probably fixed by #18358.

tlively mentioned this pull request Dec 13, 2022

Enable test_pthread_proxying_cpp on wasm2js #18360

Merged

sbc100 reviewed Dec 13, 2022

View reviewed changes

sbc100 approved these changes Dec 13, 2022

View reviewed changes

kripken approved these changes Dec 13, 2022

View reviewed changes

tlively merged commit c17542f into main Dec 13, 2022

tlively deleted the fix-proxying-test-deadlock branch December 13, 2022 00:57

tlively added a commit that referenced this pull request Dec 19, 2022

Enable test_pthread_proxying_cpp on wasm2js

e9756cd

It had previously been disabled because of flakes on wasm2js, but those flakes are probably fixed by #18358.

tlively added a commit that referenced this pull request Dec 19, 2022

Enable test_pthread_proxying_cpp on wasm2js (#18360)

00cd741

It had previously been disabled because of flakes on wasm2js, but those flakes are probably fixed by #18358.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix infinite wait bug in test_pthread_proxying_cpp.cpp #18358

Fix infinite wait bug in test_pthread_proxying_cpp.cpp #18358

Uh oh!

tlively commented Dec 12, 2022

Uh oh!

tlively commented Dec 12, 2022 •

edited

Loading

Uh oh!

sbc100 Dec 13, 2022

Uh oh!

tlively Dec 13, 2022

Uh oh!

sbc100 Dec 13, 2022

Uh oh!

sbc100 Dec 13, 2022

Uh oh!

sbc100 Dec 13, 2022

Uh oh!

tlively Dec 13, 2022

Uh oh!

kripken commented Dec 13, 2022

Uh oh!

tlively commented Dec 13, 2022

Uh oh!

sbc100 commented Dec 13, 2022

Uh oh!

Uh oh!

Fix infinite wait bug in test_pthread_proxying_cpp.cpp #18358

Fix infinite wait bug in test_pthread_proxying_cpp.cpp #18358

Uh oh!

Conversation

tlively commented Dec 12, 2022

Uh oh!

tlively commented Dec 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbc100 Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

tlively Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

sbc100 Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

sbc100 Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

sbc100 Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

tlively Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

kripken commented Dec 13, 2022

Uh oh!

tlively commented Dec 13, 2022

Uh oh!

sbc100 commented Dec 13, 2022

Uh oh!

Uh oh!

tlively commented Dec 12, 2022 •

edited

Loading