[`flake8-comprehensions`] Fix `C420` to prepend whitespace when needed #18616

robsdedude · 2025-06-10T19:55:33Z

Summary

This PR fixes rule C420's fix. The fix replaces {...} with dict....(...). Therefore, if there is any identifier or such right before the fix, the fix will fuse that previous token with dict....

The example in the issue is

0 or{x: None for x in "x"}
# gets "fixed" to
0 ordict.fromkeys(iterable)

Related Issues

Fixes: #18599

github-actions · 2025-06-10T20:17:12Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

robsdedude · 2025-06-10T20:20:49Z

crates/ruff_python_parser/src/lexer.rs

 /// identifier is ASCII-only or not by mutably altering a reference to a
 /// boolean value passed in.
-fn is_identifier_continuation(c: char, identifier_is_ascii_only: &mut bool) -> bool {
+pub fn is_identifier_continuation(c: char, identifier_is_ascii_only: &mut bool) -> bool {


Not sure you're fine with this PR increasing the coupling of the crates. However, I didn't want to duplicate that logic. But maybe there's even a better general approach for fixing the issue that works on a higher logical level than chars.

This is a neat find, I didn't know about this function before.

I think I would have instead reached for something more like was done in #17648 (checking if two token ranges are adjacent). Would that work here too?

I'm not totally against using the lexer function if not, but I'm not sure we really want to make it pub like you said.

Could we use edits::pad_start instead or is_identifier. I'd prefer to not make this function public

pad_start sounds exactly like what we want... However, looking at its implementation I suspect (not tested yet) that it's implemented incorrectly. It only considers is_ascii_alphabetic. Python's identifiers allow much more than ASCII alphabetic characters, which is exactly why I reached for is_identifier_continuation. How do you suggest to proceed? My gut feeling is

use pad_start for this PR

open a separate issue to discuss fixing pad_start

What is_identifier are you referring to? There are multiple functions with that name in the repo 😇 If you mean ruff_python_stdlib::identifiers::is_identifier, I suppose we could, but it'd require looking at more than just the preceding character which might only be a valid identifier continuation, but not a start (e.g., a digit). Sounds more costly and error prone.

Well, maybe in this case it's correct because we only care about preceding keywords, which only are contain ASCII alphabetic chars.

Yeah, after looking at some examples in the code-base as well as Python's grammar for expressions, I'm fairly convinced, that pad, pad_start, and pad_end are fine as long as all edits that are padded are replacing a full expression as those cannot be surrounded by arbitrary identifiers as far as I can tell.

ntBre

Thanks! This looks good overall, just a couple of suggestions.

ntBre · 2025-06-18T18:40:51Z

crates/ruff_python_parser/src/lexer.rs

 /// identifier is ASCII-only or not by mutably altering a reference to a
 /// boolean value passed in.
-fn is_identifier_continuation(c: char, identifier_is_ascii_only: &mut bool) -> bool {
+pub fn is_identifier_continuation(c: char, identifier_is_ascii_only: &mut bool) -> bool {


This is a neat find, I didn't know about this function before.

I think I would have instead reached for something more like was done in #17648 (checking if two token ranges are adjacent). Would that work here too?

I'm not totally against using the lexer function if not, but I'm not sure we really want to make it pub like you said.

..._linter/src/rules/flake8_comprehensions/rules/unnecessary_dict_comprehension_for_iterable.rs

ntBre

Thanks! This looks really nice with pad_start. Thanks for double checking that it was working correctly too.

astral-sh#18616)  ## Summary This PR fixes rule C420's fix. The fix replaces `{...}` with `dict....(...)`. Therefore, if there is any identifier or such right before the fix, the fix will fuse that previous token with `dict...`. The example in the issue is ```python 0 or{x: None for x in "x"} # gets "fixed" to 0 ordict.fromkeys(iterable) ``` ## Related Issues Fixes: astral-sh#18599

[flake8_comprehensions] Fix C420 to prepend whitespace when needed

bdf0ffb

robsdedude force-pushed the fix/18599-C420-prepend-padding branch from 9dae719 to bdf0ffb Compare June 10, 2025 20:01

robsdedude commented Jun 10, 2025

View reviewed changes

robsdedude marked this pull request as ready for review June 10, 2025 20:20

robsdedude requested review from MichaReiser and dhruvmanila as code owners June 10, 2025 20:21

ntBre added bug Something isn't working fixes Related to suggested fixes for violations labels Jun 10, 2025

ntBre reviewed Jun 18, 2025

View reviewed changes

robsdedude added 2 commits June 28, 2025 01:00

Use existing edits::pad_start instead

a9c405d

Merge branch 'main' into fix/18599-C420-prepend-padding

63101df

robsdedude requested a review from ntBre June 27, 2025 23:11

ntBre approved these changes Jun 30, 2025

View reviewed changes

ntBre changed the title ~~[flake8_comprehensions] Fix C420 to prepend whitespace when needed~~ [flake8-comprehensions] Fix C420 to prepend whitespace when needed Jun 30, 2025

ntBre merged commit 34052a1 into astral-sh:main Jun 30, 2025
35 checks passed

robsdedude deleted the fix/18599-C420-prepend-padding branch June 30, 2025 19:00

BrewTestBot mentioned this pull request Jul 3, 2025

ruff 0.12.2 Homebrew/homebrew-core#228969

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[`flake8-comprehensions`] Fix `C420` to prepend whitespace when needed #18616

[`flake8-comprehensions`] Fix `C420` to prepend whitespace when needed #18616

Uh oh!

robsdedude commented Jun 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 10, 2025 •

edited

Loading

Uh oh!

robsdedude Jun 10, 2025

Uh oh!

ntBre Jun 18, 2025

Uh oh!

MichaReiser Jun 23, 2025

Uh oh!

robsdedude Jun 27, 2025

Uh oh!

robsdedude Jun 27, 2025 •

edited

Loading

Uh oh!

robsdedude Jun 27, 2025

Uh oh!

ntBre left a comment

Uh oh!

ntBre Jun 18, 2025

Uh oh!

Uh oh!

ntBre left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[flake8-comprehensions] Fix C420 to prepend whitespace when needed #18616

[flake8-comprehensions] Fix C420 to prepend whitespace when needed #18616

Uh oh!

Conversation

robsdedude commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issues

Uh oh!

github-actions bot commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ruff-ecosystem results

Linter (stable)

Linter (preview)

Uh oh!

robsdedude Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

ntBre Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

MichaReiser Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

robsdedude Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

robsdedude Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

robsdedude Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

ntBre left a comment

Choose a reason for hiding this comment

Uh oh!

ntBre Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ntBre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[`flake8-comprehensions`] Fix `C420` to prepend whitespace when needed #18616

[`flake8-comprehensions`] Fix `C420` to prepend whitespace when needed #18616

robsdedude commented Jun 10, 2025 •

edited

Loading

github-actions bot commented Jun 10, 2025 •

edited

Loading

`ruff-ecosystem` results

robsdedude Jun 27, 2025 •

edited

Loading