[MLIR] fold slice and copy int64_max support #1970

AmosLewis · 2023-03-24T20:18:15Z

➜  t5small git:(main) ✗ torch-mlir-opt -pass-pipeline='builtin.module(torchscript-module-to-torch-backend-pipeline{backend-legal-ops=torch.aten.flatten.using_ints,torch.aten.native_layer_norm,torch.aten.linear})' ./t5_small_torchscript_test2.mlir
module attributes {torch.debug_module_name = "_lambda"} {
  func.func @forward(%arg0: !torch.vtensor<[1,15],si64>, %arg1: !torch.vtensor<[1,4],si64>) -> !torch.vtensor<[1,4],si64> {
    %int1 = torch.constant.int 1
    %int0 = torch.constant.int 0
    %false = torch.constant.bool false
    %int4 = torch.constant.int 4
    %none = torch.constant.none
    %int-1 = torch.constant.int -1
    %int-100 = torch.constant.int -100
    %int9223372036854775807 = torch.constant.int 9223372036854775807
    %cpu = torch.constant.device "cpu"
    %0 = torch.prim.ListConstruct %int1, %int4 : (!torch.int, !torch.int) -> !torch.list<int>
    %1 = torch.aten.zeros %0, %int4, %int0, %cpu, %false : !torch.list<int>, !torch.int, !torch.int, !torch.Device, !torch.bool -> !torch.vtensor<[1,4],si64>
    %2 = torch.aten.slice.Tensor %arg1, %int1, %int0, %int-1, %int1 : !torch.vtensor<[1,4],si64>, !torch.int, !torch.int, !torch.int, !torch.int -> !torch.vtensor<[1,3],si64>
    %3 = torch.aten.clone %2, %none : !torch.vtensor<[1,3],si64>, !torch.none -> !torch.vtensor<[1,3],si64>
    %4 = torch.aten.slice.Tensor %1, %int1, %int1, %int9223372036854775807, %int1 : !torch.vtensor<[1,4],si64>, !torch.int, !torch.int, !torch.int, !torch.int -> !torch.vtensor<[1,3],si64>
    %5 = torch.aten.arange.start_step %int1, %int4, %int1, %none, %none, %none, %none : !torch.int, !torch.int, !torch.int, !torch.none, !torch.none, !torch.none, !torch.none -> !torch.vtensor<[3],si64>
    %6 = torch.prim.ListConstruct %none, %5 : (!torch.none, !torch.vtensor<[3],si64>) -> !torch.list<optional<vtensor>>
    %7 = torch.aten._index_put_impl %1, %6, %3, %false, %false : !torch.vtensor<[1,4],si64>, !torch.list<optional<vtensor>>, !torch.vtensor<[1,3],si64>, !torch.bool, !torch.bool -> !torch.vtensor<[1,4],si64>
    %8 = torch.aten.slice.Tensor %7, %int1, %int0, %int1, %int1 : !torch.vtensor<[1,4],si64>, !torch.int, !torch.int, !torch.int, !torch.int -> !torch.vtensor<[1,1],si64>
    %9 = torch.aten.squeeze.dim %8, %int1 : !torch.vtensor<[1,1],si64>, !torch.int -> !torch.vtensor<[1],si64>
    %10 = torch.aten.eq.Scalar %7, %int-100 : !torch.vtensor<[1,4],si64>, !torch.int -> !torch.vtensor<[1,4],i1>
    %11 = torch.prim.ListConstruct  : () -> !torch.list<int>
    %12 = torch.prim.NumToTensor.Scalar %int0 : !torch.int -> !torch.vtensor<[],si64>
    %13 = torch.aten.broadcast_to %12, %11 : !torch.vtensor<[],si64>, !torch.list<int> -> !torch.vtensor<[],si64>
    %14 = torch.aten.where.self %10, %13, %7 : !torch.vtensor<[1,4],i1>, !torch.vtensor<[],si64>, !torch.vtensor<[1,4],si64> -> !torch.vtensor<[1,4],si64>
    return %14 : !torch.vtensor<[1,4],si64>
  }
}

AmosLewis · 2023-03-25T01:58:42Z

Taken from @gpetters94 patch https://github.com/gpetters94/mlir-npcomp/tree/intmax
Looks like it fixes the int64_max, but go back to the original slice and copy issue
https://gist.github.com/AmosLewis/1826326e9f85480da9f13191cb4b86f7
%2 = torch.tensor_static_info_cast %1 : !torch.vtensor<[1,4],si64> to !torch.vtensor<*,si64>
The shape info disappears.

AmosLewis · 2023-03-27T19:52:38Z

This patch already fixes the int64_max issue. The new shape issues are from the masked_fill_ op:
%144 = torch.aten.masked_fill_.Scalar %134, %143, %int0 : !torch.tensor, !torch.tensor, !torch.int -> !torch.tensor
which is from python assigned a value to a slice:
x_new[..., 0] = 0

AmosLewis · 2023-03-27T20:50:32Z

I tried to only use the
x_new[..., 0] = 0 as a model. But got

ts_g.graph: 
graph(%self : __torch__.torch.fx.graph_module._lambda,
      %arg0_1 : Tensor,
      %arg1_1.1 : Tensor):
  %11 : bool = prim::Constant[value=0]() # <eval_with_key>.2:5:144
  %37 : Device = prim::Constant[value="cpu"]()
  %4 : int = prim::Constant[value=1]() # <eval_with_key>.2:5:50
  %5 : int = prim::Constant[value=4]() # <eval_with_key>.2:5:53
  %19 : int = prim::Constant[value=0]() # <eval_with_key>.2:8:49
  %6 : int[] = prim::ListConstruct(%4, %5)
  %new_zeros.1 : Tensor = aten::new_zeros(%arg1_1.1, %6, %5, %19, %37, %11) # <eval_with_key>.2:5:16
  %_tensor_constant0.1 : Tensor = prim::GetAttr[name="_tensor_constant0"](%self)
  %lift_fresh_copy.1 : Tensor = aten::lift_fresh_copy(%_tensor_constant0.1) # <eval_with_key>.2:7:22
  %select.1 : Tensor = aten::select(%new_zeros.1, %4, %19) # <eval_with_key>.2:8:13
  %fill_ : Tensor = aten::fill_(%select.1, %lift_fresh_copy.1) # <eval_with_key>.2:9:12
  return (%new_zeros.1)

Traceback (most recent call last):
  File "/home/chi/src/ubuntu20/shark/SHARK/tank/pytorch/t5small/test.py", line 101, in <module>
    module = torch_mlir.compile(
  File "/home/chi/src/ubuntu20/shark/torch-mlir/build/tools/torch-mlir/python_packages/torch_mlir/torch_mlir/__init__.py", line 358, in compile
    raise Exception(f"""
Exception: 
PyTorch TorchScript module -> torch-mlir Object Graph IR import failed with:
### Importer C++ Exception:
required keyword attribute 'split' is undefined
### Importer Diagnostics:

FIXED. Upgrade my torch and torch-vision version.

AmosLewis · 2023-03-27T22:21:59Z

Success test_slicecopy.py

ramiro050

There are multiple edge cases here. All should be e2e tested to avoid off-by-1 errors.

lib/Dialect/Torch/Transforms/RecomposeComplexOps.cpp

AmosLewis · 2023-04-10T16:48:29Z

#2005

AmosLewis · 2023-04-18T16:42:59Z

Add a TODO for the general clamp way. #2005 (comment)

ramiro050 · 2023-04-18T21:48:06Z

I suggest we just add the fragile INT64_MAX and add a TODO there for later fixing. Otherwise, I will just be stuck here.

Sure, it does require quite a few ops. Can we keep the same structure as the other patch? In particular, we should have a helper function clampDimToValidRange that checks if the value is equal to INT_MAX or INT_MIN and converts to 0 or dimsize, respectively. The helper function can then be used on both start and end, since both values require the same clamping.

This will make it easier in the future to add the full support for clamping, since now all that is needed is to improve clampDimToValidRange.

ramiro050 · 2023-04-18T21:49:54Z

Also, this PR should have e2e tests testing the new functionality.

AmosLewis · 2023-04-19T16:37:24Z

@gpetters94

new slice copy e2e test for my case.

class SliceCopy2DStaticModule(torch.nn.Module):
    def __init__(self):
        super().__init__()

    @export
    @annotate_args([
        None,
        ([1, 4], torch.int64, True),
        ([1, 3], torch.int64, True),
    ])
    def forward(self, x, y):
        xslice = torch.ops.aten.slice(x, 1, 1, 4, 1)
        xslice.copy_(y)
        return x


@register_test_case(module_factory=lambda: SliceCopy2DStaticModule())
def SliceCopy2DStaticModule_basic(module, tu: TestUtils):
    module.forward(tu.randint(1, 4, high=4), tu.randint(1, 3, high=1))

# ==============================================================================

AmosLewis · 2023-06-27T17:44:07Z

Fixed by a46b5c6

AmosLewis requested a review from gpetters94 March 24, 2023 20:18

AmosLewis force-pushed the int64_max branch 2 times, most recently from a241ec4 to 160cdeb Compare March 25, 2023 01:58

AmosLewis mentioned this pull request Mar 25, 2023

[MLIR] Fold slice+copy_ pattern None / INTMAX support #1953

Open

AmosLewis requested a review from ramiro050 March 27, 2023 22:23

AmosLewis marked this pull request as ready for review March 27, 2023 22:24

This was referenced Mar 27, 2023

[MLIR] select+fill_ op shape * support #1979

Open

t5-small To TOSA nod-ai/SHARK-Studio#986

Closed

[TOSA] Add support for AtenZerosOp 0 layout #1983

Merged

ramiro050 requested changes Mar 29, 2023

View reviewed changes

lib/Dialect/Torch/Transforms/RecomposeComplexOps.cpp Show resolved Hide resolved

AmosLewis force-pushed the int64_max branch from 160cdeb to 13cd371 Compare April 6, 2023 21:44

AmosLewis force-pushed the int64_max branch from 13cd371 to b9ba1cc Compare April 12, 2023 21:26

AmosLewis mentioned this pull request Apr 18, 2023

Add support for negative and out-of-bound indices for the slice+copy_ pattern #2005

Closed

AmosLewis force-pushed the int64_max branch 2 times, most recently from fc19ff4 to 9fb9051 Compare April 18, 2023 16:42

AmosLewis requested a review from ramiro050 April 18, 2023 18:01

AmosLewis force-pushed the int64_max branch from 9fb9051 to 8ce63aa Compare April 19, 2023 18:27

ramiro050 mentioned this pull request Apr 20, 2023

[TOSA] Add aten._index_put_impl support #2031

Merged

AmosLewis force-pushed the int64_max branch from 8ce63aa to 7795ec9 Compare April 20, 2023 20:12

[MLIR] Fix fold slice and copy int64_max bug

6127148

AmosLewis force-pushed the int64_max branch from 7795ec9 to 6127148 Compare April 21, 2023 02:12

mgehre-amd mentioned this pull request May 19, 2023

SliceCopyMax_Module: Fix crash Xilinx/torch-mlir#13

Merged

AmosLewis closed this Jun 27, 2023

AmosLewis deleted the int64_max branch January 19, 2024 19:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MLIR] fold slice and copy int64_max support #1970

[MLIR] fold slice and copy int64_max support #1970

Uh oh!

AmosLewis commented Mar 24, 2023 •

edited

Loading

Uh oh!

AmosLewis commented Mar 25, 2023 •

edited

Loading

Uh oh!

AmosLewis commented Mar 27, 2023

Uh oh!

AmosLewis commented Mar 27, 2023 •

edited

Loading

Uh oh!

AmosLewis commented Mar 27, 2023 •

edited

Loading

Uh oh!

ramiro050 left a comment

Uh oh!

Uh oh!

AmosLewis commented Apr 10, 2023

Uh oh!

AmosLewis commented Apr 18, 2023 •

edited

Loading

Uh oh!

ramiro050 commented Apr 18, 2023

Uh oh!

ramiro050 commented Apr 18, 2023

Uh oh!

AmosLewis commented Apr 19, 2023 •

edited

Loading

Uh oh!

AmosLewis commented Jun 27, 2023

Uh oh!

Uh oh!

[MLIR] fold slice and copy int64_max support #1970

[MLIR] fold slice and copy int64_max support #1970

Uh oh!

Conversation

AmosLewis commented Mar 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AmosLewis commented Mar 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AmosLewis commented Mar 27, 2023

Uh oh!

AmosLewis commented Mar 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AmosLewis commented Mar 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ramiro050 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AmosLewis commented Apr 10, 2023

Uh oh!

AmosLewis commented Apr 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ramiro050 commented Apr 18, 2023

Uh oh!

ramiro050 commented Apr 18, 2023

Uh oh!

AmosLewis commented Apr 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AmosLewis commented Jun 27, 2023

Uh oh!

Uh oh!

AmosLewis commented Mar 24, 2023 •

edited

Loading

AmosLewis commented Mar 25, 2023 •

edited

Loading

AmosLewis commented Mar 27, 2023 •

edited

Loading

AmosLewis commented Mar 27, 2023 •

edited

Loading

AmosLewis commented Apr 18, 2023 •

edited

Loading

AmosLewis commented Apr 19, 2023 •

edited

Loading