[Tosa] : Fix integer overflow for computing intmax+1 in tosa.cast to linalg. #112455

sahas3 · 2024-10-16T00:16:53Z

This PR fixes an issue related to integer overflow when computing (intmax+1) for i64 during tosa-to-linalg pass for tosa.cast.

Found this issue while debugging a numerical mismatch for deeplabv3 model from torchvision represented in tosa dialect using the TorchToTosa pipeline in torch-mlir repository. torch.aten.to.dtype is converted to tosa.cast that casts f32 to i64 type. Technically by the specification, tosa.cast doesn't handle casting f32 to i64. So it's possible to add a verifier to error out for such tosa ops instead of producing incorrect code. However, I chose to fix the overflow issue to still be able to represent the deeplabv3 model with tosa ops in the above-mentioned pipeline. Open to suggestions if adding the verifier is more appropriate instead.

…linalg.

llvmbot · 2024-10-16T00:17:27Z

@llvm/pr-subscribers-mlir-linalg
@llvm/pr-subscribers-mlir-tosa

@llvm/pr-subscribers-mlir

Author: Sayan Saha (sahas3)

Changes

This PR fixes an issue related to integer overflow when computing (intmax+1) for i64 during tosa-to-linalg pass for tosa.cast.

Found this issue while debugging a numerical mismatch for deeplabv3 model from torchvision represented in tosa dialect using the TorchToTosa pipeline in torch-mlir repository. torch.aten.to.dtype is converted to tosa.cast that casts f32 to i64 type. Technically by the specification, tosa.cast doesn't handle casting f32 to i64. So it's possible to add a verifier to error out for such tosa ops instead of producing incorrect code. However, I chose to fix the overflow issue to still be able to represent the deeplabv3 model with tosa ops in the above-mentioned pipeline. Open to suggestions if adding the verifier is more appropriate instead.

Full diff: https://github.com/llvm/llvm-project/pull/112455.diff

2 Files Affected:

(modified) mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp (+1-1)
(modified) mlir/test/Conversion/TosaToLinalg/tosa-to-linalg.mlir (+27)

diff --git a/mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp b/mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp
index c88f4db27c304e..e6b3e4b677e4f2 100644
--- a/mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp
+++ b/mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp
@@ -563,7 +563,7 @@ static Value createLinalgBodyCalculationForElementwiseOp(
                    getElementTypeOrSelf(srcTy),
                    APInt::getSignedMaxValue(dstTy.getIntOrFloatBitWidth())
                            .getSExtValue() +
-                       1));
+                       1.0));
 
       auto intMax = rewriter.create<arith::ConstantOp>(
           loc, rewriter.getIntegerAttr(
diff --git a/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg.mlir b/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg.mlir
index f9d37f9427d4f4..7e2ec67d38d378 100644
--- a/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg.mlir
+++ b/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg.mlir
@@ -1929,3 +1929,30 @@ func.func @test_dynamic_fft2d(%arg0: tensor<?x?x?xf32>, %arg1: tensor<?x?x?xf32>
   %output_real, %output_imag = "tosa.fft2d"(%arg0, %arg1) {inverse = true} : (tensor<?x?x?xf32>, tensor<?x?x?xf32>) -> (tensor<?x?x?xf32>, tensor<?x?x?xf32>)
   return %output_real, %output_imag : tensor<?x?x?xf32>, tensor<?x?x?xf32>
 }
+
+
+// -----
+
+// CHECK: #[[$MAP0:.+]] = affine_map<(d0) -> (0)>
+// CHECK: #[[$MAP1:.+]] = affine_map<(d0) -> (d0)>
+
+// CHECK-LABEL:   func.func @test_cast_fp32_i64(
+// CHECK-SAME:                                  %[[ARG0:.*]]: tensor<1xf32>) -> tensor<1xi64> {
+// CHECK:           %[[VAL_0:.*]] = tensor.empty() : tensor<1xi64>
+// CHECK:           %[[RESULT:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]]], iterator_types = ["parallel"]} ins(%[[ARG0]] : tensor<1xf32>) outs(%[[VAL_0]] : tensor<1xi64>) {
+// CHECK:           ^bb0(%[[VAL_2:.*]]: f32, %[[VAL_3:.*]]: i64):
+// CHECK:             %[[VAL_4:.*]] = math.roundeven %[[VAL_2]] : f32
+// CHECK:             %[[VAL_5:.*]] = arith.constant -9.22337203E+18 : f32
+// CHECK:             %[[VAL_6:.*]] = arith.constant 9.22337203E+18 : f32
+// CHECK:             %[[VAL_7:.*]] = arith.constant 9223372036854775807 : i64
+// CHECK:             %[[VAL_8:.*]] = arith.maximumf %[[VAL_4]], %[[VAL_5]] : f32
+// CHECK:             %[[VAL_9:.*]] = arith.fptosi %[[VAL_8]] : f32 to i64
+// CHECK:             %[[VAL_10:.*]] = arith.cmpf uge, %[[VAL_4]], %[[VAL_6]] : f32
+// CHECK:             %[[VAL_11:.*]] = arith.select %[[VAL_10]], %[[VAL_7]], %[[VAL_9]] : i64
+// CHECK:             linalg.yield %[[VAL_11]] : i64
+// CHECK:           } -> tensor<1xi64>
+// CHECK:           return %[[RESULT]] : tensor<1xi64>
+func.func @test_cast_fp32_i64(%arg0: tensor<1xf32>) -> (tensor<1xi64>) {
+  %0 = tosa.cast %arg0 : (tensor<1xf32>) -> tensor<1xi64>
+  return %0: tensor<1xi64>
+}

sahas3 · 2024-10-16T00:32:56Z

Without the fix the IR after tosa-to-linalg for

func.func @test_cast_fp32_i64(%arg0: tensor<1xf32>) -> (tensor<1xi64>) {
  %0 = tosa.cast %arg0 : (tensor<1xf32>) -> tensor<1xi64>
  return %0: tensor<1xi64>
}

was

#map = affine_map<(d0) -> (0)>
#map1 = affine_map<(d0) -> (d0)>
module {
  func.func @test_cast_fp32_i64(%arg0: tensor<1xf32>) -> tensor<1xui64> {
    %0 = tensor.empty() : tensor<1xi64>
    %1 = linalg.generic {indexing_maps = [#map, #map1], iterator_types = ["parallel"]} ins(%arg0 : tensor<1xf32>) outs(%0 : tensor<1xi64>) {
    ^bb0(%in: f32, %out: i64):
      %3 = math.roundeven %in : f32
      %cst = arith.constant -9.22337203E+18 : f32
      %cst_0 = arith.constant -9.22337203E+18 : f32
      %c9223372036854775807_i64 = arith.constant 9223372036854775807 : i64
      %4 = arith.maximumf %3, %cst : f32
      %5 = arith.fptosi %4 : f32 to i64
      %6 = arith.cmpf uge, %3, %cst_0 : f32
      %7 = arith.select %6, %c9223372036854775807_i64, %5 : i64
      linalg.yield %7 : i64
    } -> tensor<1xi64>
    %2 = builtin.unrealized_conversion_cast %1 : tensor<1xi64> to tensor<1xui64>
    return %2 : tensor<1xui64>
  }
}

mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp

GeorgeARM · 2024-10-16T02:03:18Z

So it's possible to add a verifier to error out for such tosa ops instead of producing incorrect code. However, I chose to fix the overflow issue to still be able to represent the deeplabv3 model with tosa ops in the above-mentioned pipeline. Open to suggestions if adding the verifier is more appropriate instead.

@sahas3 IMHO I think is better to fix; my expectation is that at some point TOSA will get i64 support.
Moreover, as you mentioned is good to get models legalizing rather than explicitly restricting.

Please wait also for @RoboTux to review the patch before merging.

github-actions · 2024-10-16T02:30:49Z

✅ With the latest revision this PR passed the C/C++ code formatter.

[Tosa] : Fix integer overflow for computing intmax+1 in tosa.cast to …

ec18d2b

…linalg.

llvmbot added mlir:linalg mlir mlir:tosa labels Oct 16, 2024

CoTinker requested a review from RoboTux October 16, 2024 01:30

GeorgeARM reviewed Oct 16, 2024

View reviewed changes

mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp Outdated Show resolved Hide resolved

GeorgeARM approved these changes Oct 16, 2024

View reviewed changes

Explicit cast to double for i64 intmax.

d8cc8c6

Run clang-format.

1a18184

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tosa] : Fix integer overflow for computing intmax+1 in tosa.cast to linalg. #112455

[Tosa] : Fix integer overflow for computing intmax+1 in tosa.cast to linalg. #112455

sahas3 commented Oct 16, 2024

llvmbot commented Oct 16, 2024 •

edited

Loading

sahas3 commented Oct 16, 2024

GeorgeARM commented Oct 16, 2024 •

edited

Loading

github-actions bot commented Oct 16, 2024 •

edited

Loading

[Tosa] : Fix integer overflow for computing intmax+1 in tosa.cast to linalg. #112455

Are you sure you want to change the base?

[Tosa] : Fix integer overflow for computing intmax+1 in tosa.cast to linalg. #112455

Conversation

sahas3 commented Oct 16, 2024

llvmbot commented Oct 16, 2024 • edited Loading

sahas3 commented Oct 16, 2024

GeorgeARM commented Oct 16, 2024 • edited Loading

github-actions bot commented Oct 16, 2024 • edited Loading

llvmbot commented Oct 16, 2024 •

edited

Loading

GeorgeARM commented Oct 16, 2024 •

edited

Loading

github-actions bot commented Oct 16, 2024 •

edited

Loading