Add pytorch interface to ATen Dialect #30

stephenneuendorffer · 2020-08-14T17:26:07Z

This patch adds a pytorch interface to npcomp. This interface is modeled
after pytorch_xla and exposes the MLIR-based flow as a virtual device (similar
to a gpu device or an xla backend). Usage is something like:

dev = torch_mlir.mlir_device()
t0 = torch.randn((4,4), device=dev)
t1 = torch.randn((4,4), device=dev)
t2 = t0 + t1
t2_mlir = torch_mlir.get_mlir( t2 )
t2_cpu = t2.to('cpu')

In this case t2_cpu contains the result of the computation, and t2_mlir
contains the mlir description of the computation. Note that this also
properly returns backward paths synthesized by pytorch. There are roughly
speaking three parts of this:

A tensor type (implemented by Tensor.* and TensorImpl.*)
The device modeling (ATenMLIRBridge, ATenMLIRDevice, ATenMLIRType)
a temporary IR (implemented by ir.cpp)
The driver that uses the IR to generate MLIR, run Passes and compile the
result using mlir::ExecutionEngine (implemented by jit.cpp and
MLIRGenerator.cpp)
A runtime library implemented by lib/aten_ops.cpp

Particular feedback that would be useful include:

how do we merge the runtime library and jit infrastructure with what is
being done for numpy?
Suggestions about naming, or how to merge this into the existing directory
hierarchy. Currently it seems like a bit of a blob with everything depending
on everything else. There's also some function definitions that don't live in
the file corresponding to their declaration.
It's unclear to me how much of the 'IR' is necessary, or whether we should
just create MLIR on the fly.

Some aspects of this are known to be less than optimal, in particular:

Some files don't follow the LLVM naming convention. pragma once is used
everywhere (pending file renaming)
It's unclear how much of the complexity inherited from pytorch_xla is
necessary, is something simpler sufficient, or should pytorch_xla be extended,
or should we keep the complexity and support backend devices like pytorch_xla,
or will there be a different model about how backend devices are handled.
More aspects of this (e.g. the IR) seem like they should be automatically
generated.
The tests should get moved around and live with the existing tests.

silvasean

First round of comments.

torch_mlir/csrc/ATenMLIRType.h

torch_mlir/csrc/jit.cpp

torch_mlir/csrc/Tensor.cpp

torch_mlir/csrc/MLIRGenerator.h

torch_mlir/lib/aten_ops.cpp

torch_mlir/csrc/InitPythonBindings.cpp

torch_mlir/test/test_export_ResA.py

torch_mlir/test/test_jit_add_views.py

* Adds/updates readmes with some notes about code organization and direction. * Meant to prepare a space for upcoming integration of #30.

frontends/pytorch/csrc/aten_mlir_bridge.cpp

frontends/pytorch/csrc/aten_mlir_type.cpp

frontends/pytorch/test/test_export_batchnorm.py

python/npcomp/frontends/pytorch/__init__.py

frontends/pytorch/README.md

This patch adds a pytorch interface to npcomp. This interface is modeled after pytorch_xla and exposes the MLIR-based flow as a virtual device (similar to a gpu device or the xla backend). Usage is intended to be something like: dev = torch_mlir.mlir_device() t0 = torch.randn((4,4), device=dev) t1 = torch.randn((4,4), device=dev) t2 = t0 + t1 t2_mlir = torch_mlir.get_mlir( t2 ) t2_cpu = t2.to('cpu') In this case t2_cpu would contain the result of the computation, and t2_mlir contains the mlir description of the computation. Note that this also properly returns backward paths synthesized by pytorch. There are several parts of this: 1) A tensor type (implemented by tensor.* and tensor_impl.*) 2) The device modeling (aten_mlir_bridge.*, aten_mlir_device.*, aten_mlir_type*) 3) a temporary IR (implemented by ir.cpp) There is also a reference lowering directly from the ATen dialect to C function calls consisting of two parts: 1) The driver that uses the IR to generate MLIR, run Passes and compile the result using mlir::ExecutionEngine (implemented by jit.cpp and mlir_gen.cpp) 2) A runtime library implemented by lib/aten_ops.cpp. Most of the operations are implemented by callbacks into the torch C++ libraries. Some aspects of this are known to be less than optimal, in particular: 1) There's some function definitions that don't live in the file corresponding to their declaration. 2) More aspects of this (e.g. the IR) seem like they should be automatically generated. 3) It's unclear to me how much of the 'IR' is actually necessary, or whether MLIR could be created on the fly. Note that this code is licensed in a way similar to pytorch, with the intention that eventually (when npcomp reaches some maturity) it should be pushed there. (see frontends/pytorch/LICENSE) The code is also structured much closer to the pytorch coding style than the LLVM coding style.

* Import initialized tensor as dense attribute * Import all initialize tensors as dense constants * Remove unintentional code * Fix value attribute format in shape inference tests of reshape * Readd rank check for reshape's shape inference * Remove a redundant variable Co-authored-by: Gheorghe-Teodor Bercea <gt.bercea@gmail.com>

silvasean requested changes Aug 14, 2020

View reviewed changes

stellaraccident added a commit that referenced this pull request Aug 18, 2020

Create frontends/pytorch directory.

2d5963b

* Adds/updates readmes with some notes about code organization and direction. * Meant to prepare a space for upcoming integration of #30.

stellaraccident mentioned this pull request Aug 18, 2020

Create frontends/pytorch directory. #31

Merged

stellaraccident added a commit that referenced this pull request Aug 18, 2020

Create frontends/pytorch directory. (#31)

77b235f

* Adds/updates readmes with some notes about code organization and direction. * Meant to prepare a space for upcoming integration of #30.

stephenneuendorffer force-pushed the aten branch from f5f616d to 862834e Compare August 20, 2020 19:58

stellaraccident reviewed Aug 21, 2020

View reviewed changes

stephenneuendorffer force-pushed the aten branch 2 times, most recently from 8f4e4ec to 30533c0 Compare August 21, 2020 07:04

stellaraccident approved these changes Aug 21, 2020

View reviewed changes

stephenneuendorffer force-pushed the aten branch from 30533c0 to 9dc71f5 Compare August 21, 2020 18:01

stephenneuendorffer merged commit 31b3041 into llvm:master Aug 21, 2020

byronyi mentioned this pull request Apr 2, 2021

mlir-npcomp intersects with torch-xla pytorch/xla#2854

Open

MIONkb mentioned this pull request May 10, 2023

Assertion fails when lowering a torch mlir to tosa #2096

Closed

renxida mentioned this pull request Mar 1, 2024

[ONNX] Support onnx.LSTM #2969

Merged

renxida mentioned this pull request Mar 19, 2024

Converting while-like PrimLoopOp with tensor arg to scf fails with "operation destroyed but still has uses" #3039

Open

AnaT246 mentioned this pull request Feb 12, 2025

LLVM ERROR: pattern '(anonymous namespace)::AdjustCallingConventionForReturn' does not support 1:N conversion #4018

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add pytorch interface to ATen Dialect #30

Add pytorch interface to ATen Dialect #30

stephenneuendorffer commented Aug 14, 2020

silvasean left a comment

Add pytorch interface to ATen Dialect #30

Add pytorch interface to ATen Dialect #30

Conversation

stephenneuendorffer commented Aug 14, 2020

silvasean left a comment

Choose a reason for hiding this comment