gh-105699: Use a Linked List for the PyModuleDef Cache #106899

ericsnowcurrently · 2023-07-19T16:15:20Z

This fixes a crasher due to a race condition, triggered infrequently when two isolated (own GIL) subinterpreters simultaneously initialize their sys or builtins modules. The crash happened due the combination of the "detached" thread state we were using and the "last holder" logic we use for the GIL. It turns out it's tricky to use the same thread state for different threads. Who could have guessed? <wink/>

We solve the problem by eliminating the one object we were still sharing between interpreters. We replace it with a linked list, using the "raw" allocator to avoid tying it to the main interpreter. We assume that there will be few enough legacy extension modules loaded that the O(n) operations won't cause too much overhead.

We also remove the accommodations for "detached" thread states, which were a dubious idea to start with.

Issue: Crash During Subinterpreter Finalization #105699

Yhg1s

I'm not sure the assumption that a linked list is good enough because the number of extensions is small is actually warranted, and I think we should definitely double-check the performance on systems with non-trivial numbers of modules. Not using a dict for the cache avoids some problems, but replacing it with a linear search is very likely to have a bad performance hit.

Since the problem being fixed only manifests itself with isolated subinterpreters, I would rather live with the race in 3.12 than rush in a big performance regression, even if it's just for imports of extension modules.

Yhg1s · 2023-07-21T13:21:11Z

Python/import.c

-                                _PyInterpreterState_Main());
+    /* The lock is initialized directly with the general runtime state. */
+    assert(EXTENSIONS.mutex != NULL);
+    //assert(EXTENSIONS.head == NULL);


Leftover line I presume.

Sort of. I had expected the assert to work (and it would in a simpler world). I was going to circle back to it but, having slept on it, don't see the need. It can be dropped.

Yhg1s · 2023-07-21T13:22:03Z

Python/import.c

@@ -919,167 +919,134 @@ extensions_lock_release(void)
 static void
 _extensions_cache_init(void)


Why have this function at all, now that it's just a single assert?

Honestly, I figured I'd end up finding a better solution than a linked list that would revive the need for this function, so I kept it around. The point is a bit moot now though. 😄

Yhg1s · 2023-07-21T13:24:54Z

Python/import.c

        goto finally;
    }
+    strcpy(cached->filename, PyUnicode_AsUTF8(filename));


Use strncpy to avoid the deprecation in some compilers (macOS IIRC?)

Yhg1s · 2023-07-21T13:26:11Z

Python/import.c

    }
-    res = 0;
+
+    /* Destroy the cache data. */


Shouldn't this DECREF found->def ?

ericsnowcurrently · 2023-07-21T14:49:07Z

Understood. I was looking for a quick fix and mostly played a hunch that the performance impact would be minimal. I don't think I'm wrong but agree there isn't enough time to properly validate my hypothesis. Thanks for speaking up.

I do plan on finding an alternative that doesn't rely on a Python object, to address the crasher.

ericsnowcurrently added 5 commits July 19, 2023 07:50

Add a comment.

31414a3

Add a TODO.

a6f6c0c

Use a linked list instead of a dict object.

4c99023

Drop _PyThreadState_*Detached().

d5b3aa4

Add a NEWS entry.

fb5a9d3

ericsnowcurrently added the needs backport to 3.12 only security fixes label Jul 19, 2023

ericsnowcurrently requested a review from Yhg1s July 19, 2023 16:15

ericsnowcurrently requested review from brettcannon, ncoghlan and warsaw as code owners July 19, 2023 16:15

bedevere-bot added the awaiting core review label Jul 19, 2023

bedevere-bot mentioned this pull request Jul 19, 2023

Crash During Subinterpreter Finalization #105699

Closed

Yhg1s reviewed Jul 21, 2023

View reviewed changes

ericsnowcurrently closed this Jul 21, 2023

ericsnowcurrently mentioned this pull request Jul 21, 2023

gh-105699: Use a _Py_hashtable_t for the PyModuleDef Cache #106974

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-105699: Use a Linked List for the PyModuleDef Cache #106899

gh-105699: Use a Linked List for the PyModuleDef Cache #106899

ericsnowcurrently commented Jul 19, 2023 •

edited

Loading

Yhg1s left a comment

Yhg1s Jul 21, 2023

ericsnowcurrently Jul 21, 2023

Yhg1s Jul 21, 2023

ericsnowcurrently Jul 21, 2023

Yhg1s Jul 21, 2023

Yhg1s Jul 21, 2023

ericsnowcurrently Jul 21, 2023

ericsnowcurrently commented Jul 21, 2023

		@@ -919,167 +919,134 @@ extensions_lock_release(void)
		static void
		_extensions_cache_init(void)

gh-105699: Use a Linked List for the PyModuleDef Cache #106899

gh-105699: Use a Linked List for the PyModuleDef Cache #106899

Conversation

ericsnowcurrently commented Jul 19, 2023 • edited Loading

Yhg1s left a comment

Choose a reason for hiding this comment

Yhg1s Jul 21, 2023

Choose a reason for hiding this comment

ericsnowcurrently Jul 21, 2023

Choose a reason for hiding this comment

Yhg1s Jul 21, 2023

Choose a reason for hiding this comment

ericsnowcurrently Jul 21, 2023

Choose a reason for hiding this comment

Yhg1s Jul 21, 2023

Choose a reason for hiding this comment

Yhg1s Jul 21, 2023

Choose a reason for hiding this comment

ericsnowcurrently Jul 21, 2023

Choose a reason for hiding this comment

ericsnowcurrently commented Jul 21, 2023

ericsnowcurrently commented Jul 19, 2023 •

edited

Loading