Canonicalizers for doubly strided ops such as npu.dma_cpy_nd #680

newling · 2024-08-15T18:23:36Z

Doubly strided ops have, for both source and target, and for all of offsets/sizes/strides, 2 fields:

a vector of dynamic Values
a vector of static integers

For example it might be

dynamic = {v0}
static = {16, kDynamic, 32, 64}

The size of dynamic is always exactly equal to the number of appearances of kDynamic in the static vector. This pass does the following: it checks if any of the Values in dynamic are actually MLIR constants, and if they are then it removes the Value from the dynamic vector and updates the corresponding index in static. So for example if v0 above is actually

%v0 = arith.constant 6 : index

then this canonicalization updates dynamic/static to

dynamic = {}
static = {16, 6, 32, 64}

MaheshRavishankar

There is a canonicalizer upstream here that does this, but it is a bit hard-coded for tensor.extract_slice like op that have a single offset , size and stride. It should be split up to handle each of the offset , size and stride separately. Then you could reuse that here. Maybe just use the same style here to be consistent

jtuyls

Nice, should help make IR more compact and readable! Would be great to do for all doubly strided ops in one go I think.

MaheshRavishankar · 2024-08-16T08:23:01Z

Actually, if your wait for this PR (llvm/llvm-project#104488) it might get even easier

newling · 2024-08-16T18:25:46Z

@MaheshRavishankar thanks for the pointer, I've changed to use the same style for now.
@jtuyls ok, will get this working for all the doubly strided ops

jtuyls

LGTM, just a few small nits

jtuyls · 2024-08-19T12:25:42Z

compiler/plugins/target/AMD-AIE/iree-amd-aie/IR/AMDAIEOps.cpp

@@ -93,6 +90,70 @@ TileOp CoreOp::getTileOp() {
 // AMDAIE_DmaCpyNdBaseOp
 //===----------------------------------------------------------------------===//

+namespace {
+// Simplified from upstream MLIR's foldDynamicIndexList:


Suggested change

// Simplified from upstream MLIR's foldDynamicIndexList:

/// Simplified from upstream MLIR's foldDynamicIndexList:

Nit, but IREE uses /// for function/class comments.

jtuyls · 2024-08-19T12:42:53Z

compiler/plugins/target/AMD-AIE/iree-amd-aie/IR/AMDAIEOps.cpp

+// Based on upstream MLIR's
+// OpWithOffsetSizesAndStridesConstantArgumentFolder


Suggested change

// Based on upstream MLIR's

// OpWithOffsetSizesAndStridesConstantArgumentFolder

/// Based on upstream MLIR's

/// OpWithOffsetSizesAndStridesConstantArgumentFolder

Nit, but IREE uses /// for function/class comments.

jtuyls · 2024-08-19T12:43:37Z

compiler/plugins/target/AMD-AIE/iree-amd-aie/IR/AMDAIEOps.cpp

+// Build a NpuDmaCpyNdOp with mixed static and dynamic entries and target
+// and source BD IDs.


Suggested change

// Build a NpuDmaCpyNdOp with mixed static and dynamic entries and target

// and source BD IDs.

/// Build a NpuDmaCpyNdOp with mixed static and dynamic entries and target

/// and source BD IDs.

newling · 2024-08-19T16:52:08Z

LGTM, just a few small nits

Ok I'll use /// by default in future. As this particular file uses both /// and // quite a lot I'm not going to update this PR.

newling requested review from MaheshRavishankar, nirvedhmeshram, yzhang93, Abhishek-Varma and jtuyls as code owners August 15, 2024 18:23

newling changed the title ~~Canonicalize npu.dma_cpy_nd~~ Canonicalizer for npu.dma_cpy_nd Aug 15, 2024

MaheshRavishankar reviewed Aug 16, 2024

View reviewed changes

jtuyls reviewed Aug 16, 2024

View reviewed changes

newling force-pushed the npu_dma_cpy_nd_canonicalizer branch from 3f0ca36 to 1de7f26 Compare August 16, 2024 18:23

newling changed the title ~~Canonicalizer for npu.dma_cpy_nd~~ Canonicalizers for doubly strided ops such as npu.dma_cpy_nd Aug 16, 2024

squashed commit

a0a4bad

newling force-pushed the npu_dma_cpy_nd_canonicalizer branch from 8e99191 to a0a4bad Compare August 16, 2024 20:29

newling requested a review from jtuyls August 16, 2024 21:34

jtuyls approved these changes Aug 19, 2024

View reviewed changes

newling merged commit aa112f7 into nod-ai:main Aug 19, 2024
2 checks passed

newling deleted the npu_dma_cpy_nd_canonicalizer branch August 29, 2024 18:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Canonicalizers for doubly strided ops such as npu.dma_cpy_nd #680

Canonicalizers for doubly strided ops such as npu.dma_cpy_nd #680

newling commented Aug 15, 2024 •

edited

Loading

MaheshRavishankar left a comment

jtuyls left a comment

MaheshRavishankar commented Aug 16, 2024

newling commented Aug 16, 2024

jtuyls left a comment

jtuyls Aug 19, 2024

jtuyls Aug 19, 2024

jtuyls Aug 19, 2024

newling commented Aug 19, 2024

	// Simplified from upstream MLIR's foldDynamicIndexList:
	/// Simplified from upstream MLIR's foldDynamicIndexList:

		// Based on upstream MLIR's
		// OpWithOffsetSizesAndStridesConstantArgumentFolder

		// Build a NpuDmaCpyNdOp with mixed static and dynamic entries and target
		// and source BD IDs.

Canonicalizers for doubly strided ops such as npu.dma_cpy_nd #680

Canonicalizers for doubly strided ops such as npu.dma_cpy_nd #680

Conversation

newling commented Aug 15, 2024 • edited Loading

MaheshRavishankar left a comment

Choose a reason for hiding this comment

jtuyls left a comment

Choose a reason for hiding this comment

MaheshRavishankar commented Aug 16, 2024

newling commented Aug 16, 2024

jtuyls left a comment

Choose a reason for hiding this comment

jtuyls Aug 19, 2024

Choose a reason for hiding this comment

jtuyls Aug 19, 2024

Choose a reason for hiding this comment

jtuyls Aug 19, 2024

Choose a reason for hiding this comment

newling commented Aug 19, 2024

newling commented Aug 15, 2024 •

edited

Loading