Implement Scalar Scan (as dummy Op) #174

ricardoV94 · 2023-01-05T11:55:51Z

Alternative to #283

Closes #83

The idea here is to implement a specialized kind of looping operation that can be treated as an Elemwise. I create a dummy scalar Op, whose Elemwise version is then converted to a scan by a rewrite.

As a test case I made use of the Scalar Scans in the gradient of gammaincc. Performance is now 20-30x better than before when running the new benchmark test, even though the two branches are now always computed. Further improvements could be achieved with a lazy switch, but that is not really the goal of this PR.

PS: It shouldn't be so hard to write a scan without accumulation

pytensor/tensor/rewriting/elemwise.py

ricardoV94 · 2023-01-09T18:46:40Z

pytensor/tensor/rewriting/elemwise.py

+    # Scan output size is given by the size of the input leading dimension, by default its n_steps + 1.
+    # If we only want to store the last elements we can shorten the leading dimension to 1
+    scan_node = ret[0].owner.inputs[0].owner
+    scan_inputs = scan_node.inputs
+    n_steps = scan_inputs[0]
+    n_non_seqs = scan_node.op.info.n_non_seqs
+    carried_inputs = scan_inputs[1 : len(scan_inputs) - n_non_seqs :]
+    constant_inputs = scan_inputs[len(scan_inputs) - n_non_seqs :]
+    new_carried_inputs = []
+    for carried_input in carried_inputs:
+        assert isinstance(carried_input.owner.op, IncSubtensor)
+        fill_value = carried_input.owner.inputs[1]
+        # TODO: Check for the global flag where this is controlled
+        new_carried_inputs.append(expand_empty(fill_value, 1))
+    ret = scan_node.op.make_node(n_steps, *new_carried_inputs, *constant_inputs).outputs


One shouldn't have to hack into scan internals to avoid saving the intermediate results... This is needed because of #178

Fix bug in gradient of Elemwise containing multi-output scalars

33d4d36

ricardoV94 force-pushed the scalar_scan branch from aea10aa to 7396b31 Compare January 5, 2023 12:08

ricardoV94 mentioned this pull request Jan 6, 2023

Apply scan memory save rewrite to while scans #178

Closed

ricardoV94 added scan gradients performance labels Jan 6, 2023

ricardoV94 commented Jan 6, 2023

View reviewed changes

pytensor/tensor/rewriting/elemwise.py Show resolved Hide resolved

WIP implement scalar Scan Op

aad7681

ricardoV94 force-pushed the scalar_scan branch from 7396b31 to aad7681 Compare January 9, 2023 18:36

ricardoV94 commented Jan 9, 2023

View reviewed changes

ricardoV94 mentioned this pull request Feb 15, 2023

Optimize while scans when only last state is needed #216

Merged

1 task

ricardoV94 mentioned this pull request Apr 25, 2023

Implement scalar loop for iterative gradients #283

Merged

ricardoV94 changed the title ~~Implement Scalar Scan~~ Implement Scalar Scan (as dummy Op) Apr 25, 2023

ricardoV94 added Op implementation request discussion labels Apr 25, 2023

ricardoV94 closed this May 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement Scalar Scan (as dummy Op) #174

Implement Scalar Scan (as dummy Op) #174

Uh oh!

ricardoV94 commented Jan 5, 2023 •

edited

Loading

Uh oh!

Uh oh!

ricardoV94 Jan 9, 2023 •

edited

Loading

Uh oh!

Uh oh!

Implement Scalar Scan (as dummy Op) #174

Implement Scalar Scan (as dummy Op) #174

Uh oh!

Conversation

ricardoV94 commented Jan 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ricardoV94 Jan 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ricardoV94 commented Jan 5, 2023 •

edited

Loading

ricardoV94 Jan 9, 2023 •

edited

Loading