Fix dequantize_affine before iOS18 #2589

metascroy · 2025-09-03T04:11:26Z

dequantize_affine does not work correctly before iOS18, when the op gets translated to constexpr_affine_dequantize instead of constexpr_blockwise_shift_scale.

This is because the axis should be None (default value) instead of -1. In this case, the axis gets inferred from the shape of scales.

This is similar to the change in the ExecuTorch PR: pytorch/executorch#13896

metascroy · 2025-09-03T04:11:40Z

@YifanShenSZ can you have a look?

YifanShenSZ · 2025-09-03T17:37:00Z

Hey Scott, first to confirm your use case: Do you want weight-only quantization, or activation quantization?

Some background about coreml ops that might be helpful:

The constexpr_ ops are for weight-only quantization, that's why they are "const expression"s: the quantized weight are known at compile time
Among const expr ops, Blockwise shift scale is new in iOS18, affine dequantize was new in iOS16
Dequantize is for activation quantization, and was new in iOS17

metascroy · 2025-09-03T17:49:39Z

@YifanShenSZ this is for weight-only quantization, which is represented as dequantize_affine (applied to the weights) followed by linear/embedding.

The existing registration works for iOS18 (blockwise scale shift), but has a bug in affine_dequantize (iOS16). The bug is the axis is incorrect. This doesn't affect iOS18 because axis is not used in blockwise scale shift.

Dequantize is for activation quantization, and was new in iOS17

I assume you mean "quantize" here is used for activation quantization?

YifanShenSZ · 2025-09-03T21:01:33Z

@YifanShenSZ this is for weight-only quantization, which is represented as dequantize_affine (applied to the weights) followed by linear/embedding.

The existing registration works for iOS18 (blockwise scale shift), but has a bug in affine_dequantize (iOS16). The bug is the axis is incorrect. This doesn't affect iOS18 because axis is not used in blockwise scale shift.

I see. So in this case, you will want

For iOS 18 and higher, translate to constexpr_blockwise_shift_scale
For iOS 16 and 17, translate to constexpr_affine_dequantize
For iOS 15 and earlier, error out

Dequantize is for activation quantization, and was new in iOS17

I assume you mean "quantize" here is used for activation quantization?

Yeah I mean quantize and dequantize, which are for activation quantization (i.e. quantize / dequantize variables) and introduced in iOS 17

metascroy · 2025-09-03T21:48:47Z

@YifanShenSZ this is for weight-only quantization, which is represented as dequantize_affine (applied to the weights) followed by linear/embedding.
The existing registration works for iOS18 (blockwise scale shift), but has a bug in affine_dequantize (iOS16). The bug is the axis is incorrect. This doesn't affect iOS18 because axis is not used in blockwise scale shift.

I see. So in this case, you will want

For iOS 18 and higher, translate to constexpr_blockwise_shift_scale

For iOS 16 and 17, translate to constexpr_affine_dequantize

For iOS 15 and earlier, error out

Dequantize is for activation quantization, and was new in iOS17

I assume you mean "quantize" here is used for activation quantization?

Yeah I mean quantize and dequantize, which are for activation quantization (i.e. quantize / dequantize variables) and introduced in iOS 17

Yes, but the function _utils._construct_constexpr_dequant_op already does this translation. I'm just fixing the arg passed to it for axis (axis is only used by _utils._construct_constexpr_dequant_op before iOS18, where it translates the op to constexpr_affine_dequantize; it is not used for iOS18, where the op gets translated to constexpr_blockwise_shift_scale). The CI only tests on iOS18, so I fix the arg for axis and add a test for iOS16.

YifanShenSZ · 2025-09-04T22:58:51Z

Got it, then yes this is correct. Let's hear what CI has to say

CI 🔴 https://gitlab.com/coremltools1/coremltools/-/commit/c47235b01c27d101c373ce06b03f1758d3457557/pipelines

YifanShenSZ · 2025-09-04T23:35:41Z

coremltools/converters/mil/frontend/torch/test/test_torch_quantization_ops.py

+    @pytest.mark.skipif(not _HAS_TORCHAO, reason=MSG_TORCHAO_NOT_FOUND)
+    @pytest.mark.parametrize(
+        "compute_unit, has_zeros",
+        itertools.product(compute_units, [True, False], [ct.target.IOS16, ct.target.IOS17]),


nit: lower case "i", i.e. iOS

YifanShenSZ · 2025-09-05T20:02:01Z

CI 🔴 https://gitlab.com/coremltools1/coremltools/-/commit/30b74a15f3f3d6fef6b7ffbbfb1b79b2fcb0bfb6/pipelines

YifanShenSZ · 2025-09-05T21:32:20Z

coremltools/converters/mil/frontend/torch/test/test_torch_quantization_ops.py


+    @pytest.mark.skipif(not _HAS_TORCHAO, reason=MSG_TORCHAO_NOT_FOUND)
+    @pytest.mark.parametrize(
+        "compute_unit, has_zeros",


nit: you might missed minimum_deployment_target?

YifanShenSZ · 2025-09-05T21:32:55Z

@metascroy please rebase on top of latest main + address comment, then let's try CI again

YifanShenSZ · 2025-09-16T22:36:18Z

CI 🔴 https://gitlab.com/coremltools1/coremltools/-/commit/30b74a15f3f3d6fef6b7ffbbfb1b79b2fcb0bfb6/pipelines

because we support torch up to 2.7.1 😮‍💨 please rebase on top of latest main to pick up #2591

metascroy · 2025-09-25T20:33:59Z

CI 🔴 https://gitlab.com/coremltools1/coremltools/-/commit/30b74a15f3f3d6fef6b7ffbbfb1b79b2fcb0bfb6/pipelines

because we support torch up to 2.7.1 😮‍💨 please rebase on top of latest main to pick up #2591

rebased

YifanShenSZ reviewed Sep 5, 2025

View reviewed changes

Fix dequantize_affine before iOS18

fb4eb52

metascroy force-pushed the fix-torchao-dequant-ios16 branch from 9721717 to fb4eb52 Compare September 25, 2025 20:33

Fix dequantize_affine before iOS18 #2589

Are you sure you want to change the base?

Fix dequantize_affine before iOS18 #2589

Uh oh!

Conversation

metascroy commented Sep 3, 2025

Uh oh!

metascroy commented Sep 3, 2025

Uh oh!

YifanShenSZ commented Sep 3, 2025

Uh oh!

metascroy commented Sep 3, 2025

Uh oh!

YifanShenSZ commented Sep 3, 2025

Uh oh!

metascroy commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YifanShenSZ commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YifanShenSZ Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

YifanShenSZ commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YifanShenSZ Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

YifanShenSZ commented Sep 5, 2025

Uh oh!

YifanShenSZ commented Sep 16, 2025

Uh oh!

metascroy commented Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

metascroy commented Sep 3, 2025 •

edited

Loading

YifanShenSZ commented Sep 4, 2025 •

edited

Loading

YifanShenSZ commented Sep 5, 2025 •

edited

Loading