[ty] Rewrite `Type::any_over_type` using a new generalised `TypeVisitor` trait #19094

AlexWaygood · 2025-07-02T15:05:37Z

Summary

Our Type::any_over_type() method has two problems:

There are several types where it does not recurse into all nested types!
It doesn't guard against recursion at all. This isn't much of a problem now, but becomes more of a problem when we introduce more recursive types, such as in [ty] Implement equivalence for protocols with method members #18659

This PR introduces a new TypeVisitor trait that aims to solve both of these issues in a generalized way:

Walking nested types is delegated down to walk_*_type functions that live close to the structs they walk. For example, walking all nested types in a NominalInstanceType is handled by a walk_nominal_instance_type function that lives next to the NominalInstanceType struct in instance.rs. This makes it less likely that we'll forget to update the function if we add a new field to NominalInstanceType in the future.
The trait makes it easy to write concrete implementations that guard against recursion in an efficient way.

Type::any_over_type is reimplemented as a standalone any_over_type function that uses a concrete implementation of the new TypeVisitor trait as its implementation.

If we like the direction this PR is going in, we might want to rename the existing TypeVisitor struct that @carljm added in #19003. We may also want to reimplement Type::normalize and Type::apply_type_mapping using a similar TypeTransformer trait, but I'll defer to @carljm on that point, since he has ongoing work in this area.

Test Plan

All existing tests pass.

…or` trait

github-actions · 2025-07-02T15:14:55Z

`mypy_primer` results

Changes were detected when running on open source projects

Expression (https://github.com/cognitedata/Expression)
-     memo fields = ~54MB
+     memo fields = ~49MB

operator (https://github.com/canonical/operator)
- TOTAL MEMORY USAGE: ~97MB
+ TOTAL MEMORY USAGE: ~106MB
-     memo fields = ~80MB
+     memo fields = ~88MB

hydra-zen (https://github.com/mit-ll-responsible-ai/hydra-zen)
- TOTAL MEMORY USAGE: ~80MB
+ TOTAL MEMORY USAGE: ~88MB

rich (https://github.com/Textualize/rich)
- TOTAL MEMORY USAGE: ~142MB
+ TOTAL MEMORY USAGE: ~129MB

discord.py (https://github.com/Rapptz/discord.py)
- TOTAL MEMORY USAGE: ~228MB
+ TOTAL MEMORY USAGE: ~251MB

pydantic (https://github.com/pydantic/pydantic)
- TOTAL MEMORY USAGE: ~156MB
+ TOTAL MEMORY USAGE: ~142MB

mkosi (https://github.com/systemd/mkosi)
-     memo fields = ~106MB
+     memo fields = ~97MB

vision (https://github.com/pytorch/vision)
- TOTAL MEMORY USAGE: ~368MB
+ TOTAL MEMORY USAGE: ~334MB

bandersnatch (https://github.com/pypa/bandersnatch)
-     memo fields = ~66MB
+     memo fields = ~72MB

paasta (https://github.com/yelp/paasta)
- TOTAL MEMORY USAGE: ~189MB
+ TOTAL MEMORY USAGE: ~207MB

psycopg (https://github.com/psycopg/psycopg)
- TOTAL MEMORY USAGE: ~228MB
+ TOTAL MEMORY USAGE: ~207MB

sphinx (https://github.com/sphinx-doc/sphinx)
- TOTAL MEMORY USAGE: ~304MB
+ TOTAL MEMORY USAGE: ~276MB

carljm

The code here looks nice. My main hesitation here is that I'm not totally clear that any_over_type is even a clearly defined operation for all types (see inline comment), and it's only currently used in deciding whether to issue a redundant-cast diagnostic. This seems like a lot of machinery to introduce for that small edge case, in the absence of a clear understanding (which I at least don't currently have) of the semantics of the generalized operation, and the future use cases for it.

I'm tempted to say that we should instead try to remove the need for any_over_type (as we already did for is_fully_static), and aim to avoid the need for these generalized recursive type walk tests entirely. To me, the ecosystem report on #19099 suggests that this is feasible. Removing the use of any_over_type entirely does not fail any existing tests or introduce many new diagnostics, and it seems like a fairly limited effort to address some Todos would remove most of the diagnostics it does introduce. (Plus, I don't think the current existence of todos and the desire to silence false positives arising from them should drive significant design decisions.)

I do think this PR is useful regardless, because I think a lot of it could be reused for a similar TypeTransformer trait, and that's something I do think we will need.

carljm · 2025-07-02T17:49:49Z

crates/ty_python_semantic/src/lib.rs


 type FxOrderSet<V> = ordermap::set::OrderSet<V, BuildHasherDefault<FxHasher>>;
 type FxIndexMap<K, V> = indexmap::IndexMap<K, V, BuildHasherDefault<FxHasher>>;
+type FxIndexSet<V> = indexmap::IndexSet<V, BuildHasherDefault<FxHasher>>;


I wonder if some of our existing uses of FxOrderSet should actually be FxIndexSet, if we don't need the stronger equality and order-maintenance guarantees provided by the ordermap wrapper crate? Not something for this PR, though.

I don't think an FxIndexSet is hashable, so I think we do need to use FxOrderSet for the fields on IntersectionType, for example. But yeah, it does look like a bunch of places are using an OrderSet right now when they probably only really need an IndexSet.

carljm · 2025-07-02T18:00:09Z

crates/ty_python_semantic/src/types.rs

+fn walk_pep_695_type_alias<'db, V: visitor::TypeVisitor<'db> + ?Sized>(
+    _db: &'db dyn Db,
+    _type_alias: PEP695TypeAliasType<'db>,
+    _visitor: &mut V,
+) {
+}
+


These are the sorts of cases that made me somewhat hesitant to implement a generic walk-types facility (or at least, unclear how it should be implemented). Is the RHS of a PEP695 type alias "part of" its type? It's certainly relevant to the meaning of the type. Is the answer the same for every possible type walk? I would probably expect any_over_type to descend into the RHS, but I'm not sure what a type transformer could be expected to do here, unless we implemented a "Synthesized" variant of PEP 695 type aliases. (Maybe we will need that in order to support recursive type aliases?)

It seems somewhat parallel to the case of class-defined (non-synthesized) protocols, where the semantics of the type includes details that require further queries and aren't stored directly in the type itself.

I think this is fine for this PR, just musing.

Follow-up from in-person discussion: this should be fixed to walk the RHS

carljm · 2025-07-02T19:09:31Z

Thank you for putting this together! I think my current feeling is we should go with #19099 instead.

carljm

After discussing in person, I'm convinced that we will likely have other future uses for this. I don't love that we do this potentially-expensive walk on two different types every time we see a cast, but it doesn't seem in practice like that's causing noticeable perf issues on any ecosystem project; we can deal with it later if it comes up.

So this looks good to me, modulo the PEP 695 type aliases fix.

carljm · 2025-07-03T16:41:46Z

Oh, and I do think we should rename my existing TypeVisitor to avoid confusion, either in this PR or as a follow-up. It could already be called TypeTransformer maybe?

AlexWaygood requested review from carljm, dcreager and sharkdp as code owners July 2, 2025 15:05

AlexWaygood force-pushed the alex/type-visitor branch from ec4e693 to 6034f2a Compare July 2, 2025 15:07

AlexWaygood added internal An internal refactor or improvement ty Multi-file analysis & type inference labels Jul 2, 2025

[ty] Rewrite Type::any_over_type using a new generalised `TypeVisit…

1aa2fe0

…or` trait

AlexWaygood force-pushed the alex/type-visitor branch from 6034f2a to 1aa2fe0 Compare July 2, 2025 15:11

AlexWaygood mentioned this pull request Jul 2, 2025

[ty] Implement equivalence for protocols with method members #18659

Merged

carljm reviewed Jul 2, 2025

View reviewed changes

AlexWaygood mentioned this pull request Jul 3, 2025

[ty] remove any_over_type #19099

Closed

carljm approved these changes Jul 3, 2025

View reviewed changes

rename other TypeVisitor, fix walking of PEP-695 aliases

737afeb

AlexWaygood force-pushed the alex/type-visitor branch from 9f3412b to 737afeb Compare July 3, 2025 18:15

AlexWaygood enabled auto-merge (squash) July 3, 2025 18:16

AlexWaygood merged commit 333191b into main Jul 3, 2025
35 checks passed

AlexWaygood deleted the alex/type-visitor branch July 3, 2025 18:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ty] Rewrite `Type::any_over_type` using a new generalised `TypeVisitor` trait #19094

[ty] Rewrite `Type::any_over_type` using a new generalised `TypeVisitor` trait #19094

Uh oh!

AlexWaygood commented Jul 2, 2025

Uh oh!

github-actions bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

carljm left a comment •

edited

Loading

Uh oh!

carljm Jul 2, 2025

Uh oh!

AlexWaygood Jul 3, 2025

Uh oh!

carljm Jul 2, 2025

Uh oh!

carljm Jul 3, 2025

Uh oh!

carljm commented Jul 2, 2025

Uh oh!

carljm left a comment

Uh oh!

carljm commented Jul 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ty] Rewrite Type::any_over_type using a new generalised TypeVisitor trait #19094

[ty] Rewrite Type::any_over_type using a new generalised TypeVisitor trait #19094

Uh oh!

Conversation

AlexWaygood commented Jul 2, 2025

Summary

Test Plan

Uh oh!

github-actions bot commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

mypy_primer results

Uh oh!

carljm left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carljm Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

AlexWaygood Jul 3, 2025

Choose a reason for hiding this comment

Uh oh!

carljm Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

carljm Jul 3, 2025

Choose a reason for hiding this comment

Uh oh!

carljm commented Jul 2, 2025

Uh oh!

carljm left a comment

Choose a reason for hiding this comment

Uh oh!

carljm commented Jul 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ty] Rewrite `Type::any_over_type` using a new generalised `TypeVisitor` trait #19094

[ty] Rewrite `Type::any_over_type` using a new generalised `TypeVisitor` trait #19094

github-actions bot commented Jul 2, 2025 •

edited

Loading

`mypy_primer` results

carljm left a comment •

edited

Loading