bpo-36256: Fix bug in parsermodule when parsing if statements #12477

pablogsal · 2019-03-21T00:22:43Z

This one was tricky! :)

In the parser module, when validating nodes before starting the parsing with to create a ST in "parser_newstobject" there is a problem that appears when two arcs in the same DFA state has transitions with labels with the same type. For example, the DFA for if_stmt has a state with
two labels with the same type: "elif" and "else" (type NAME). The algorithm tries one by one the arcs until the label that starts the arc transition has a label with the same type of the current child label we are
triying to accept. In this case, the arc for "elif" comes before the arc for "else"and passes this test (because the current child label is "else" and has the same type as "elif"). This lead to expecting a namedexpr_test (305) instead of a colon (11). The solution is to compare also the string representation (in case there is one) of the labels to see if the transition that we have is the correct one.

https://bugs.python.org/issue36256

In the parser module, when validating nodes before starting the parsing with to create a ST in "parser_newstobject" there is a problem that appears when two arcs in the same DFA state has transitions with labels with the same type. For example, the DFA for if_stmt has a state with two labels with the same type: "elif" and "else" (type NAME). The algorithm tries one by one the arcs until the label that starts the arc transition has a label with the same type of the current child label we are triying to accept. In this case, the arc for "elif" comes before the arc for "else"and passes this test (because the current child label is "else" and has the same type as "elif"). This lead to expecting a namedexpr_test (305) instead of a colon (11). The solution is to compare also the string representation (in case there is one) of the labels to see if the transition that we have is the correct one.

tyomitch · 2019-03-21T05:25:06Z

Note that the generation of error message at https://github.com/python/cpython/pull/12477/files#diff-73f51bbc1366ee12a4f041d90bbb902dR700 wouldn't handle mismatching NAMEs correctly.

Please see https://github.com/python/cpython/pull/10995/files#diff-73f51bbc1366ee12a4f041d90bbb902dR698 for the remaining part of the fix

tyomitch · 2019-03-21T06:20:57Z

An illustration of what I mean:

Python 3.8.0a2+ (remotes/pablogsal/36256:ca3e88dca0, Mar 21 2019, 08:15:48) 
[GCC 8.2.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import parser
>>> parser.sequence2st((257,(269,(295,(297,(1,'fi'),(305,(306,(310,(311,(312,(313,(316,(317,(318,(319,(320,(321,(322,(323,(324,(325,(1,'True'))))))))))))))))),(11,':'),(304,(4,''),(5,''),(269,(270,(271,(277,(1,'pass'))),(4,''))),(6,''))))),(4,''),(0,'')))
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
parser.ParserError: Illegal terminal: expected NAME.

I.e., it complains on a NAME saying that it expected NAME instead: not very informative!

tyomitch · 2019-03-21T13:38:54Z

Very good, thanks!

Modules/parsermodule.c

Co-Authored-By: pablogsal <Pablogsal@gmail.com>

Modules/parsermodule.c

Co-Authored-By: pablogsal <Pablogsal@gmail.com>

pablogsal · 2019-03-21T23:34:43Z

Thank you, everyone, for your review and code suggestions!

miss-islington · 2019-03-21T23:36:03Z

Thanks @pablogsal for the PR 🌮🎉.. I'm working now to backport this PR to: 3.7.
🐍🍒⛏🤖

bedevere-bot · 2019-03-21T23:36:12Z

GH-12488 is a backport of this pull request to the 3.7 branch.

…GH-12477) bpo-36256: Fix bug in parsermodule when parsing if statements In the parser module, when validating nodes before starting the parsing with to create a ST in "parser_newstobject" there is a problem that appears when two arcs in the same DFA state has transitions with labels with the same type. For example, the DFA for if_stmt has a state with two labels with the same type: "elif" and "else" (type NAME). The algorithm tries one by one the arcs until the label that starts the arc transition has a label with the same type of the current child label we are trying to accept. In this case, the arc for "elif" comes before the arc for "else"and passes this test (because the current child label is "else" and has the same type as "elif"). This lead to expecting a namedexpr_test (305) instead of a colon (11). The solution is to compare also the string representation (in case there is one) of the labels to see if the transition that we have is the correct one. (cherry picked from commit 9a0000d) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>

pablogsal requested a review from brettcannon March 21, 2019 00:22

the-knights-who-say-ni added the CLA signed label Mar 21, 2019

bedevere-bot added the awaiting merge label Mar 21, 2019

Add News entry

ca3e88d

fixup! Add News entry

a902d96

brettcannon approved these changes Mar 21, 2019

View reviewed changes

Modules/parsermodule.c Outdated Show resolved Hide resolved

Modules/parsermodule.c Outdated Show resolved Hide resolved

Modules/parsermodule.c Outdated Show resolved Hide resolved

brettcannon and others added 3 commits March 21, 2019 21:24

Update Modules/parsermodule.c

c2aeb32

Co-Authored-By: pablogsal <Pablogsal@gmail.com>

Update Modules/parsermodule.c

eb14255

Co-Authored-By: pablogsal <Pablogsal@gmail.com>

Update Modules/parsermodule.c

a9b67d4

Co-Authored-By: pablogsal <Pablogsal@gmail.com>

ZackerySpytz reviewed Mar 21, 2019

View reviewed changes

Modules/parsermodule.c Outdated Show resolved Hide resolved

Update Modules/parsermodule.c

db148c0

Co-Authored-By: pablogsal <Pablogsal@gmail.com>

pablogsal merged commit 9a0000d into python:master Mar 21, 2019

bedevere-bot removed the awaiting merge label Mar 21, 2019

pablogsal deleted the 36256 branch March 21, 2019 23:33

pablogsal added the needs backport to 3.7 label Mar 21, 2019

bedevere-bot removed the needs backport to 3.7 label Mar 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-36256: Fix bug in parsermodule when parsing if statements #12477

bpo-36256: Fix bug in parsermodule when parsing if statements #12477

Uh oh!

pablogsal commented Mar 21, 2019 •

edited by bedevere-bot

Loading

Uh oh!

tyomitch commented Mar 21, 2019

Uh oh!

tyomitch commented Mar 21, 2019

Uh oh!

tyomitch commented Mar 21, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pablogsal commented Mar 21, 2019

Uh oh!

miss-islington commented Mar 21, 2019

Uh oh!

bedevere-bot commented Mar 21, 2019

Uh oh!

Uh oh!

Uh oh!

bpo-36256: Fix bug in parsermodule when parsing if statements #12477

bpo-36256: Fix bug in parsermodule when parsing if statements #12477

Uh oh!

Conversation

pablogsal commented Mar 21, 2019 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tyomitch commented Mar 21, 2019

Uh oh!

tyomitch commented Mar 21, 2019

Uh oh!

tyomitch commented Mar 21, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pablogsal commented Mar 21, 2019

Uh oh!

miss-islington commented Mar 21, 2019

Uh oh!

bedevere-bot commented Mar 21, 2019

Uh oh!

Uh oh!

pablogsal commented Mar 21, 2019 •

edited by bedevere-bot

Loading