GH-115506: Improve handling of constants in tier two #124809

brandtbucher · 2024-09-30T22:45:29Z

This adds a refs tuple to executor objects, which contains constants created during optimization of tier two traces. This isn't deduplicated at all, since not all of our "constants" are actually "constant", and arbitrary hashes/comparisons can open up a whole can of worms. If we want to go that route, we can probably re-use _PyCode_ConstantKey for known safe, immutable types, and compare everything else by identity.

This also updates some parts of the optimizer to improve the handling of known constants (such as adding peepholing for _POP_TOP_LOAD_CONST_INLINE_BORROW, _REPLACE_WITH_TRUE, and _COPY/_LOAD_FAST with known constant values).

Performance and memory are in the noise... perhaps a bit faster if you squint hard enough. But it's working: the stats show lots of instructions like _REPLACE_WITH_TRUE, _POP_TOP_LOAD_CONST_INLINE_BORROW, _LOAD_FAST, _COPY, and _BINARY_OP_ADD_INT being replaced with simpler instructions like _LOAD_CONST_INLINE_BORROW, _POP_TOP, and _LOAD_CONST_INLINE.

My next step will be experimenting with no-refcount variants of _COPY, _LOAD_FAST, _STORE_FAST, and _POP_TOP with known immortal values.

Issue: Eliminate constant inputs and output constants for simple operations #115506

…TH_TRUE

markshannon · 2024-10-01T16:52:06Z

I think this belongs in the (yet to be created) partial evaluation pass.

Any optimization that does evaluation, or transforms representation belongs in that pass.

In that pass, the _BINARY_OP_MULTIPLY_FLOAT case would look something like:

op(_BINARY_OP_MULTIPLY_FLOAT, (left, right -- res)) {
    if (is_const(left) && is_const(right)) {
        assert(PyFloat_CheckExact(get_const(left)));
        assert(PyFloat_CheckExact(get_const(right)));
        PyObject *temp = PyFloat_FromDouble(
            PyFloat_AS_DOUBLE(sym_get_const(left)) *
            PyFloat_AS_DOUBLE(sym_get_const(right)));
        res = virtual_const(temp, ctx);
    }
    else {
        materialize(left, ctx);
        materialize(right, ctx);
        emit(_BINARY_OP_MULTIPLY_FLOAT, ctx);
        res = concrete_tos(ctx);
     }
}

Fidget-Spinner · 2024-10-02T14:24:21Z

I think we could extract the useful part (the API to maintain a constant pool in the executor) though, and make the PR just that.

brandtbucher · 2024-10-02T15:47:01Z

Well, I think the rest of the PR is useful too... 🙂

I'm not sure I understand the desire to have an entirely new pass when the same work could be easily done as part of this one. It seems like a lot of extra code (and computation), for little gain. I also suspect that we would be repeating a ton of the same work in the second pass.

Is there a paper or something that explains why two passes are better than one? My intuition would be to reduce the number of passes, rather than increasing it (for example, by doing abstract interpretation during trace projection).

Fidget-Spinner · 2024-10-02T16:06:40Z

Is there a paper or something that explains why two passes are better than one? My intuition would be to reduce the number of passes, rather than increasing it (for example, by doing abstract interpretation during trace projection).

Actually the opposite, the literature suggests combining passes produces better code, not the latter 😉. I think we just want a clean separation of concerns though.

brandtbucher · 2024-10-02T16:44:45Z

This can wait until the mythical second pass is implemented.

brandtbucher and others added 11 commits September 30, 2024 15:18

Promote immortal locals to constants

279e61d

Promote constant math and locals to constant loads

89d6d45

Promote constant copies to constant loads

8977c39

Deduplicate constants, and make some attribute loads constant

60e1e10

Constant merging isn't needed for anything already created

0dfd1fe

Improve handling of _POP_TOP_LOAD_CONST_INLINE_BORROW and _REPLACE_WI…

0d2e040

…TH_TRUE

Remove original or unused constants

8e6e406

fixup

cc185ce

Don't resurrect dead caches

adfacc1

Don't bother deduplicating (and factor out some repeated code)

d93c5a3

blurb add

cc0d30f

brandtbucher added performance Performance or resource usage interpreter-core (Objects, Python, Grammar, and Parser dirs) labels Sep 30, 2024

brandtbucher self-assigned this Sep 30, 2024

brandtbucher requested review from Fidget-Spinner and markshannon as code owners September 30, 2024 22:45

bedevere-app bot added the awaiting core review label Sep 30, 2024

bedevere-app bot mentioned this pull request Sep 30, 2024

Eliminate constant inputs and output constants for simple operations #115506

Open

2 tasks

brandtbucher force-pushed the tier-two-constants branch from eb54546 to cc0d30f Compare September 30, 2024 22:46

brandtbucher closed this Oct 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

GH-115506: Improve handling of constants in tier two #124809

GH-115506: Improve handling of constants in tier two #124809

Uh oh!

brandtbucher commented Sep 30, 2024 •

edited by bedevere-app bot

Loading

Uh oh!

markshannon commented Oct 1, 2024

Uh oh!

Fidget-Spinner commented Oct 2, 2024

Uh oh!

brandtbucher commented Oct 2, 2024 •

edited

Loading

Uh oh!

Fidget-Spinner commented Oct 2, 2024

Uh oh!

brandtbucher commented Oct 2, 2024

Uh oh!

Uh oh!

Uh oh!

GH-115506: Improve handling of constants in tier two #124809

GH-115506: Improve handling of constants in tier two #124809

Uh oh!

Conversation

brandtbucher commented Sep 30, 2024 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markshannon commented Oct 1, 2024

Uh oh!

Fidget-Spinner commented Oct 2, 2024

Uh oh!

brandtbucher commented Oct 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fidget-Spinner commented Oct 2, 2024

Uh oh!

brandtbucher commented Oct 2, 2024

Uh oh!

Uh oh!

brandtbucher commented Sep 30, 2024 •

edited by bedevere-app bot

Loading

brandtbucher commented Oct 2, 2024 •

edited

Loading