GH-98831: Refactor generate_cases.py #99408

gvanrossum · 2022-11-12T05:49:24Z

Consider this after GH-99313.

Issue: Generate the interpreter #98831

Had to refactor the parser a bit for this.

Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in C files of the Objects/ directory.

Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in Objects/dictobject.c.

…thon#99280) Co-authored-by: Kumar Aditya <59607654+kumaraditya303@users.noreply.github.com>

… venvs (pythonGH-99206) Check to see if `base_executable` exists. If it does not, attempt to use known alternative names of the python binary to find an executable in the path specified by `home`. If no alternative is found, previous behavior is preserved. Signed-off-by: Vincent Fazio <vfazio@gmail.com> Signed-off-by: Vincent Fazio <vfazio@gmail.com>

…9299)

python#99271) Also mark those opcodes that have no stack effect as such. Co-authored-by: Brandt Bucher <brandtbucher@gmail.com>

This was a bin subtle, esp. BINARY_SUBSCR_DICT (which plays games with refcounts) and BINARY_SUBSCR_GETITEM (which ends with a goto).

gvanrossum · 2022-11-14T04:37:51Z

My expectation at this point is that converting the majority of instructions to non-legacy form (cache and stack effects) will be a big project, even if we skip opcodes needing arrays or conditional stack effects.

The good news is that it can be parallelized -- each family can be done independently from any others, so anyone who wants to can pick a family and convert it, as long as it doesn't require fixing the generator (it's probably best if I stay on top of that).

I don't know how much we need to do before we can start experimenting with "macro" opcodes (see faster-cpython/ideas#491). Possibly we can start with an example like the one Mark wrote up in the DSL definition.

markshannon · 2022-11-15T10:40:09Z

I won't attempt a review of this PR, but a few comments on how I'd like the tool(s) to look.

This is a compiler, so it should be built like one: Dumb IR(s) and multiple passes.
Any methods on IR classes should be simple query method. Put all processing in the passes.

A possible design:

Tokenizer: Tokenize the whole input file, so that locations are correct.
Parser: Parse instructions
Analyzer: Group and check families, check stack effects. Compute necessary labels and tables, instruction size, etc.
Code gen: Generate C code as stream of tokens and strings
Formatter: Format resulting C code, inserting #line directives and whitespace where needed.

Then, when we want to generate a trace interpreter, a register interpreter, compiler front-end, etc. we can re-use the tokenizer, parser, and formatter.

gvanrossum · 2022-11-15T16:10:22Z

Thanks, I'll try to use that as a north star. In the meantime I am hoping to keep things working the way they were as much as possible to avoid disrupting other work. I have half the analyzer in this refactor, what's missing is a design for the IR to target.

gvanrossum · 2022-11-16T04:25:49Z

I'm starting over from scratch now GH-99313 is merged.

gvanrossum and others added 27 commits November 9, 2022 18:38

Support simple cache effects

9f15c4b

Had to refactor the parser a bit for this.

More BINARY_OP instructions

6189043

Merge remote-tracking branch 'origin/main' into cache-effects

f5e1aed

Tweak dummy definitions in bytecodes.c after merge

a8d608d

pythongh-99300: Use Py_NewRef() in Objects/ directory (python#99332)

4ee85e7

Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in C files of the Objects/ directory.

pythongh-99300: Use Py_NewRef() in Objects/dictobject.c (python#99333)

873da31

Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in Objects/dictobject.c.

pythongh-90110: Update the C-analyzer Tool (pythongh-99307)

e0ab5b8

pythongh-99277: remove older version of get_write_buffer_limits (py…

882fdec

…thon#99280) Co-authored-by: Kumar Aditya <59607654+kumaraditya303@users.noreply.github.com>

pythonGH-99298: Don't perform jumps before error handling (pythonGH-9…

1aa0124

…9299)

pythonGH-98831: Remove all remaining DISPATCH() calls from bytecodes.c (

d094e42

python#99271) Also mark those opcodes that have no stack effect as such. Co-authored-by: Brandt Bucher <brandtbucher@gmail.com>

Remaining BINARY_OP family members

d3d907a

Uniformly skip 'unused' effects

0339a67

Remove superfluous asserts; fix one 'is not'

f3e7dd6

Make BINARY_OP result unused

e3ff6ac

Fix parser for family()

3db443a

Check family consistency

c58a85a

Add first family (binary_op)

756a41b

Add assert() to double-check cache struct size

48400ac

Merge commit '00ee6d506e' into cache-effects

433243a

Merge commit '694cdb24a6' into cache-effects

3d51484

Merge remote-tracking branch 'origin/main' into cache-effects

4d42a0a

Merge remote-tracking branch 'origin/main' into cache-effects

ff8e0ec

Make family() macro variadic

ea16382

Merge remote-tracking branch 'origin/main' into cache-effects

8eadf1c

Show correct lineno on error; get rid of eopen()

205f12e

Kill -q flag

bdba4d2

bedevere-bot mentioned this pull request Nov 12, 2022

Generate the interpreter #98831

Closed

bedevere-bot added the awaiting core review label Nov 12, 2022

Refactor generate_cases.py. Tweak output a teensy bit.

bf10431

gvanrossum force-pushed the cases-refactor branch from 8088808 to bf10431 Compare November 12, 2022 05:59

Further refactor

4cfcf77

gvanrossum added the skip news label Nov 12, 2022

gvanrossum added 6 commits November 12, 2022 12:10

Merge remote-tracking branch 'origin/main' into cases-refactor

a280167

Make {In,Out}putEffect unions instead of classes

382e248

Fix some mypy errors

b5c8a9f

Move check_overlaps to generate_cases.py

387fbe1

Fix cache effect when used

d682118

Demonstrate cache effect with BINARY_SUBSCR

14e75f1

This was a bin subtle, esp. BINARY_SUBSCR_DICT (which plays games with refcounts) and BINARY_SUBSCR_GETITEM (which ends with a goto).

gvanrossum mentioned this pull request Nov 13, 2022

GH-98831: Implement basic cache effects #99313

Merged

Fix test crashes by fiddling DECREF order in BINARY_SUBSCR_DICT

1a5356b

gvanrossum closed this Nov 16, 2022

gvanrossum deleted the cases-refactor branch November 16, 2022 04:25

This was referenced Nov 16, 2022

We need a design for the DSL-based code generator faster-cpython/ideas#497

Closed

GH-98831: Refactor and fix cases generator #99526

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

GH-98831: Refactor generate_cases.py #99408

GH-98831: Refactor generate_cases.py #99408

Uh oh!

gvanrossum commented Nov 12, 2022 •

edited by bedevere-bot

Loading

Uh oh!

gvanrossum commented Nov 14, 2022

Uh oh!

markshannon commented Nov 15, 2022 •

edited by gvanrossum

Loading

Uh oh!

gvanrossum commented Nov 15, 2022

Uh oh!

gvanrossum commented Nov 16, 2022

Uh oh!

Uh oh!

Uh oh!

GH-98831: Refactor generate_cases.py #99408

GH-98831: Refactor generate_cases.py #99408

Uh oh!

Conversation

gvanrossum commented Nov 12, 2022 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gvanrossum commented Nov 14, 2022

Uh oh!

markshannon commented Nov 15, 2022 • edited by gvanrossum Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gvanrossum commented Nov 15, 2022

Uh oh!

gvanrossum commented Nov 16, 2022

Uh oh!

Uh oh!

gvanrossum commented Nov 12, 2022 •

edited by bedevere-bot

Loading

markshannon commented Nov 15, 2022 •

edited by gvanrossum

Loading