gh-127750: Fix singledispatchmethod caching (v2) #128648

eendebakpt · 2025-01-08T20:25:25Z

Version based on idea from @dg-pb in #127839. This version

Fixes the issue of collisions between different objects with equal __hash__/__eq__ Regression in Django with singledispatchmethod on models #127750
Fixes the issue of keeping object instances alive
Adds two regression tests

There is still a cache (stored on the object instances). Quick benchmark (windows, non-pgo):

bench singledispatchmethod: Mean +- std dev: [main] 798 ns +- 64 ns -> [prx] 495 ns +- 38 ns: 1.61x faster

Benchmark hidden because not significant (1): bench singledispatchmethod slots

Geometric mean: 1.26x faster

(note that the alternative to this PR is not to keep main, but to revert #107148)

Issue: Regression in Django with singledispatchmethod on models #127750

dg-pb · 2025-01-08T21:04:34Z

Lib/functools.py

-        import weakref # see comment in singledispatch function
-        self._method_cache = weakref.WeakKeyDictionary()
+    def __set_name__(self, obj, name):
+        self.attrname = name


Check cached_property.__set_name__, it has some more stuff in it - might be needed here as well.

Hmm. The additions there prevent something like this:

@dataclass(frozen=True) class A: value: int @singledispatchmethod def dispatch(self, x): return id(self) renamed_dispatch = dispatch # allowed? if so, how should it behave

The corresponding test for the cached_property for this is

cpython/Lib/test/test_functools.py

Line 3315 in 34e840f

def test_reuse_different_names(self):

.

But on current main renaming is allowed for the singledispatchmethod.

I am not sure here what the desired behavior is (and why)

If this implementation is desirable, maybe later someone who knows more about this can comment.

As far as I know, the only reason cached properties can't be renamed is because the cache is keyed by the attribute's name.

Allowing a rebind would disconnect the cached property from it's cached value.

Actually, I think you might want to either ignore renames or do something along these lines (ignoring error handling):

if self.attrname: cache[name] = cache.pop(self.attrname) self.attrname = name

As far as I know, each binding shares the same instance of the descriptor, so as long as the cache key is constant, it should work no matter how many times it's been renamed.

Allowing a rebind would disconnect the cached property from it's cached value.

This is kind of the same situation.

If rename is allowed, then it would simply cache to the last attrname. Drawback is that there is a small risk for unused cached methods.

I think it might be most straight forward to copy+paste cached_property.__set_name__. It does seem a sensible restriction. It comes at expense of flexibility, but personally, I have never run into that TypeError.

Also, it will be easier to address changes/improvements when 2 implementations that use the same caching approach are aligned.

serhiy-storchaka · 2025-02-06T21:51:48Z

Lib/functools.py

-        if self._method_cache is not None:
-            self._method_cache[obj] = _method
+        if cache is not None:
+            cache[self.attrname] = _method


Does not it create a reference loop? obj refers to cache, cache refers to _method, _method refers to a cell which refers to obj.

Yes. But once there are no external references to the object obj any more the garbage collector removes the objects. (the cache is on the object obj, not on the singledispatchmethod itself or the class)

In the current main the caching is done on the singledispatchmethod which keeps the generated methods alive.

Yes, the current situation is worse, it creates strong references singledispatchmethod -> _method -> obj.

Relying on the garbage collection is not good. This particular loop can be broken by using a weak reference to obj instead of obj. But a reference from a bound method to the object should be strong, otherwise some code will not work (there was a similar issue with TemporaryFile).

I am not sure how much this optimization saves. Are there other ways to achieve the same speed up, without creating reference loops?

eendebakpt · 2025-03-05T07:52:43Z

Closing in favor of #130008

eendebakpt and others added 6 commits January 8, 2025 21:14

add singledispatchmethod test for objects with equal hash and ==

b9a05dc

add singledispatchmethod test for dangling references

0714c06

use id for object cache in singledispatchmethod

ef03f2d

📜🤖 Added by blurb_it.

6c96b9d

rework using __dict__

f282ab8

remove comments

89a4a5c

eendebakpt requested a review from rhettinger as a code owner January 8, 2025 20:25

eendebakpt marked this pull request as draft January 8, 2025 20:25

bedevere-app bot added the awaiting review label Jan 8, 2025

bedevere-app bot mentioned this pull request Jan 8, 2025

Regression in Django with singledispatchmethod on models #127750

Closed

bedevere-app bot removed the awaiting review label Jan 8, 2025

eendebakpt added 3 commits January 8, 2025 21:36

avoid exception

ecc3e82

whitespace

efd4034

whitespace

88b5e90

eendebakpt marked this pull request as ready for review January 8, 2025 20:50

bedevere-app bot added the awaiting review label Jan 8, 2025

Merge branch 'main' into singledispatchmethod_caching_v2

1885278

dg-pb reviewed Jan 8, 2025

View reviewed changes

serhiy-storchaka reviewed Feb 6, 2025

View reviewed changes

eendebakpt closed this Mar 5, 2025

dg-pb mentioned this pull request Mar 5, 2025

gh-127750: Fix and optimize functools.singledispatchmethod() #130008

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-127750: Fix singledispatchmethod caching (v2) #128648

gh-127750: Fix singledispatchmethod caching (v2) #128648

Uh oh!

eendebakpt commented Jan 8, 2025 •

edited

Loading

Uh oh!

dg-pb Jan 8, 2025

Uh oh!

eendebakpt Jan 8, 2025

Uh oh!

dg-pb Jan 8, 2025

Uh oh!

vodik Jan 23, 2025

Uh oh!

vodik Jan 23, 2025 •

edited

Loading

Uh oh!

dg-pb Jan 25, 2025

Uh oh!

serhiy-storchaka Feb 6, 2025

Uh oh!

eendebakpt Feb 6, 2025

Uh oh!

serhiy-storchaka Feb 11, 2025

Uh oh!

eendebakpt commented Mar 5, 2025

Uh oh!

Uh oh!

Uh oh!

gh-127750: Fix singledispatchmethod caching (v2) #128648

gh-127750: Fix singledispatchmethod caching (v2) #128648

Uh oh!

Conversation

eendebakpt commented Jan 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vodik Jan 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eendebakpt commented Mar 5, 2025

Uh oh!

Uh oh!

eendebakpt commented Jan 8, 2025 •

edited

Loading

vodik Jan 23, 2025 •

edited

Loading