Fix/#13070 defer annotations when future is active #13395

Slyces · 2024-09-18T16:05:41Z

Summary

This PR tries to provide support for deferred resolution of type-hints in files with from __future__ import annotations. Fixes #13070.

Implementation

Currently, the method is_stub is already used in multiple places to resolve deferred annotations. As all current usage of is_stubs are dedicated to deferred type inference, I decided in this implementation to refactor is_stubs && has_future_annotations together in are_all_types_deferred.
We can also keep them separate and always test them together.

The main difficulty is how to know if that flag is active. I tried an implementation that checks this when building the semantic index. I do not know if it's the best choice (as SemanticIndex doesn't have any attribute similar to this), but the building of the semantic index already parses every statement in the file, which was ideal to find this flag.

Happy to get any feedback on a better spot for this if you can think of one.

Test Plan

There is currently one test addressing deferred annotations resolution, focused on builtins. I don't quite understand why builtins require deferred annotations, so I focused on test cases I encounter when coding in python.

In my experience, the two most common occurrences requiring deferred annotations (in regular code) are:

Referencing a symbol before it's defined
Referencing a class inside one of its own methods

Some testing led me to find that we don't currently support type resolution for methods, so I only tested the case of referencing a symbol before its definition.

I added 3 separate test case, all with the same base code:

def get_foo() -> Foo: ...
class Foo: ...
foo = get_foo()  # Resolved if and only if deferred annotations are active

Inside a source file (*.py, foo must not resolve (Unknown)
Inside a stub file (*.pyi), foo must resolve to Foo
Inside a source file with future.annotations, foo must resolve to Foo

Fixes astral-sh#13070

…tations`

codspeed-hq · 2024-09-18T16:11:36Z

CodSpeed Performance Report

Merging #13395 will degrade performances by 4.26%

_{Comparing Slyces:fix/#13070-defer-annotations-when-future-is-active (301d065) with main (d3530ab)}

Summary

❌ 1 regressions
✅ 31 untouched benchmarks

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Benchmarks breakdown

	Benchmark	`main`	`Slyces:fix/#13070-defer-annotations-when-future-is-active`	Change
❌	`red_knot_check_file[incremental]`	2.9 ms	3 ms	-4.26%

github-actions · 2024-09-18T16:19:23Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Slyces · 2024-09-18T16:36:37Z

I just realised that the performance issue can probably be fixed by relying on the fact that from __future__ import annotations should always be the first import in a file - I'll try to get that to work.

carljm

This is fantastic, thank you!!

The semantic index builder is the right place to look for this, and the semantic index is the right place to put it. Any analysis that we can do without requiring a dependency on other files, we want to do in semantic indexing.

I suspect the performance regression on the incremental check is because tomllib (which we are using for our benchmark) does use from __future__ import annotations, and deferred types require an extra Salsa query and thus increase Salsa overhead on incremental check.

So I suspect that requiring from __future__ import annotations to be the first statement in a file won't help with the performance.

And I'm not sure if we even want to require it. A mis-placed from __future__ import annotations won't work at runtime (the module won't even import), but for our purposes I think we should assume the user intended for it to take effect, and we should still check the code assuming its use. Otherwise accidentally adding a line before your from __future__ import annotations could suddenly result in a ton of bogus diagnostics on your forward-reference annotations, which isn't really useful.

I think we should add a comment to this effect, where we check for from __future__ import annotations in the semantic index builder.

And I think we should emit a diagnostic ~~in type inference~~ if we run across a misplaced __future__ import. But this is pretty much a separate thing and could be its own PR. (EDIT: I actually think probably it shouldn't be done in type inference; misplaced __future__ imports are a syntax-level error that isn't really part of type-checking. So we'll want this diagnostic, but probably not in type inference, and definitely not in this PR.)

crates/red_knot_python_semantic/src/types/infer.rs

crates/red_knot_python_semantic/src/semantic_index/builder.rs

Slyces · 2024-09-18T17:27:47Z

So I suspect that requiring from future import annotations to be the first statement in a file won't help with the performance.

That's somewhat of a relief (if it's true) because the implementation I have so far is not ideal. I will still try to go through with it to make sure that it is not the source of the regression

…nnotations

carljm · 2024-09-18T18:02:56Z

Some testing led me to find that we don't currently support type resolution for methods

This is true, though the spot in the code you linked is for supporting attribute access of attributes on function objects. To support method calls, I think what we are missing is understanding attribute access on instances of classes (as opposed to class objects):

ruff/crates/red_knot_python_semantic/src/types.rs

Line 460 in c173ec5

// TODO MRO? get_own_instance_member, get_instance_member

carljm · 2024-09-18T18:04:16Z

I will still try to go through with it to make sure that it is not the source of the regression

If this is a lot of work, an easier way to check the performance impact would be to see how performance on the benchmark fares if you just consider from __future__ import annotations to always be true, without even looking for it in semantic index.

Slyces · 2024-09-18T18:09:59Z

While this is not related to the PR, I did waste some time on the difference between "expected Unknown" (e.g. resolution failed) vs. "Not implemented Unknown" (e.g. this case should yield a value for the current AST but we didn't code it yet).

Would it be worth considering a temporary value (until red-knot is feature complete or almost so) that would make this difference obvious? To be fair I'll also be more careful next time now that I know 🙂

carljm · 2024-09-18T18:24:25Z

Would it be worth considering a temporary value (until red-knot is feature complete or almost so) that would make this difference obvious?

That's an interesting idea! I think we could have Type::Unknown contain another enum indicating the source of the unknown (mypy does something similar, @AlexWaygood has mentioned this before). Then most code handling Unknowns wouldn't have to care (we wouldn't have to add a new branch in every place that handles types), but we could display them differently.

This seems like a net positive to me! Not sure I would get to it soon, but if you're interested in doing it, I'd certainly consider a PR.

AlexWaygood · 2024-09-18T18:27:24Z

That's an interesting idea! I think we could have Type::Unknown contain another enum indicating the source of the unknown (mypy does something similar, @AlexWaygood has mentioned this before). Then most code handling Unknowns wouldn't have to care (we wouldn't have to add a new branch in every place that handles types), but we could display them differently.

Yes, see #12986 for previous discussion! I think the main concern with that PR was how to handle the inner enum in unions and intersections.

MichaReiser

This is great. Thanks you

crates/red_knot_python_semantic/src/semantic_index/builder.rs

…nnotations

MichaReiser · 2024-09-19T08:11:07Z

It's a bit a problem that I can't acknowledge the codspeed regression because the URl doesn't work haha

Okay, there's a way. Navigate to the runs page and open the results from there https://codspeed.io/astral-sh/ruff/runs/66ebdbcf6ba711013f9a14a5

Slyces added 2 commits September 18, 2024 17:34

[red-knot] feat: resolve deferred type hints with __future__ annotations

7cefe18

Fixes astral-sh#13070

[red-knot] refactor: condense usages of is_stubs & `has_future_anno…

75bed69

…tations`

Slyces requested review from carljm, MichaReiser and AlexWaygood as code owners September 18, 2024 16:05

AlexWaygood added the red-knot Multi-file analysis & type inference label Sep 18, 2024

carljm approved these changes Sep 18, 2024

View reviewed changes

crates/red_knot_python_semantic/src/types/infer.rs Outdated Show resolved Hide resolved

carljm reviewed Sep 18, 2024

View reviewed changes

crates/red_knot_python_semantic/src/semantic_index/builder.rs Outdated Show resolved Hide resolved

fixup! [red-knot] feat: resolve deferred type hints with __future__ a…

e4e0331

…nnotations

MichaReiser approved these changes Sep 19, 2024

View reviewed changes

crates/red_knot_python_semantic/src/semantic_index/builder.rs Outdated Show resolved Hide resolved

crates/red_knot_python_semantic/src/semantic_index/builder.rs Show resolved Hide resolved

fixup! [red-knot] feat: resolve deferred type hints with __future__ a…

301d065

…nnotations

MichaReiser merged commit a8d9104 into astral-sh:main Sep 19, 2024
19 of 20 checks passed

Slyces deleted the fix/#13070-defer-annotations-when-future-is-active branch September 19, 2024 08:13

Slyces mentioned this pull request Sep 24, 2024

Feat/unknown kinds #13500

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/#13070 defer annotations when future is active #13395

Fix/#13070 defer annotations when future is active #13395

Slyces commented Sep 18, 2024 •

edited

Loading

codspeed-hq bot commented Sep 18, 2024 •

edited

Loading

github-actions bot commented Sep 18, 2024 •

edited

Loading

Slyces commented Sep 18, 2024

carljm left a comment •

edited

Loading

Slyces commented Sep 18, 2024

carljm commented Sep 18, 2024

carljm commented Sep 18, 2024

Slyces commented Sep 18, 2024

carljm commented Sep 18, 2024

AlexWaygood commented Sep 18, 2024 •

edited

Loading

MichaReiser left a comment

MichaReiser commented Sep 19, 2024

Fix/#13070 defer annotations when future is active #13395

Fix/#13070 defer annotations when future is active #13395

Conversation

Slyces commented Sep 18, 2024 • edited Loading

Summary

Implementation

Test Plan

codspeed-hq bot commented Sep 18, 2024 • edited Loading

CodSpeed Performance Report

Merging #13395 will degrade performances by 4.26%

Summary

Benchmarks breakdown

github-actions bot commented Sep 18, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

Slyces commented Sep 18, 2024

carljm left a comment • edited Loading

Choose a reason for hiding this comment

Slyces commented Sep 18, 2024

carljm commented Sep 18, 2024

carljm commented Sep 18, 2024

Slyces commented Sep 18, 2024

carljm commented Sep 18, 2024

AlexWaygood commented Sep 18, 2024 • edited Loading

MichaReiser left a comment

Choose a reason for hiding this comment

MichaReiser commented Sep 19, 2024

Slyces commented Sep 18, 2024 •

edited

Loading

codspeed-hq bot commented Sep 18, 2024 •

edited

Loading

github-actions bot commented Sep 18, 2024 •

edited

Loading

`ruff-ecosystem` results

carljm left a comment •

edited

Loading

AlexWaygood commented Sep 18, 2024 •

edited

Loading