[ONNX] Compress quantize weights transformation by andrey-churkin · Pull Request #3662 · openvinotoolkit/nncf

andrey-churkin · 2025-09-19T08:09:34Z

Changes

Added the compress_quantize_weights_transformation() method that transforms the model by folding QuantizeLinear nodes with constant inputs into precomputed, quantized initializers.

Reason for changes

Models after NNCF PTQ should be saved with INT8 weights for the ONNX backend.

Related tickets

Ref: 101733

Tests

tests/cross_fw/examples/test_examples.py[post_training_quantization_onnx_mobilenet_v2]
tests/onnx/test_passes.py

andrey-churkin · 2025-09-23T10:44:07Z

Test Examples: https://github.com/openvinotoolkit/nncf/actions/runs/18187232744

daniil-lyakhov

Minor

src/nncf/onnx/quantization/backend_parameters.py

daniil-lyakhov · 2025-09-25T11:31:36Z

src/nncf/onnx/quantization/quantize_model.py

+        copied_parameters = AdvancedQuantizationParameters()
+    else:
+        copied_parameters = deepcopy(advanced_quantization_parameters)
+    copied_parameters.backend_params[BackendParameters.COMPRESS_WEIGHTS] = False


Why should we update this parameter here?

We need to disable COMPRESS_WEIGHT here to properly remove Quantize-Dequantize pairs during the quantize_with_accuracy_control() pipeline. For reference, we do the same for the OpenVINO backend.

nncf/src/nncf/openvino/quantization/quantize_model.py

Line 220 in 54aba4a

copied_parameters.backend_params[BackendParameters.COMPRESS_WEIGHTS] = False

daniil-lyakhov · 2025-09-25T11:34:04Z

tests/onnx/test_passes.py

+    )
+
+    check_operation_count(quantized_model, {"QuantizeLinear": 2, "DequantizeLinear": 2})
+    compress_quantize_weights_transformation(quantized_model)


This test is covering the transformation, but I would suggest to additionally test the nncf.quantize with COMPRESS_WEIGHTS: True to check that the API is working as expected.

I don't think it is necessary here. We already have an end-to-end test (tests/cross_fw/examples/test_examples.py[post_training_quantization_onnx_mobilenet_v2] where we compare the model's compression rate with the reference. So we'll be able to catch the error there if COMPRESS_WEIGHTS: True doesn't work as expected.

src/nncf/onnx/graph/passes.py

src/nncf/onnx/quantization/backend_parameters.py

### Changes Revert the temporary test changes that were introduced in PR #3662

draft

9a4fa4f

andrey-churkin requested a review from a team as a code owner September 19, 2025 08:09

andrey-churkin marked this pull request as draft September 19, 2025 08:09

andrey-churkin added 2 commits September 22, 2025 14:08

minor fixes

cc09695

minor fix

44a55be

andrey-churkin marked this pull request as ready for review September 23, 2025 07:34

update examples

a6240fa

github-actions bot added the NNCF ONNX Pull requests that updates NNCF ONNX label Sep 23, 2025

andrey-churkin added 2 commits September 23, 2025 10:16

update metrics

9779e3b

add test

5796b59

andrey-churkin requested review from daniil-lyakhov and nikita-savelyevv September 23, 2025 10:06

andrey-churkin added 2 commits September 23, 2025 11:07

minor

7e3551d

minor

c884c23

andrey-churkin added 2 commits September 23, 2025 12:26

update

0f2bce8

temporary changes

61a3fb4

github-actions bot added the NNCF PTQ Pull requests that updates NNCF PTQ label Sep 25, 2025

minor

0adf45c

daniil-lyakhov reviewed Sep 25, 2025

View reviewed changes

nikita-savelyevv reviewed Sep 26, 2025

View reviewed changes

src/nncf/onnx/graph/passes.py Outdated Show resolved Hide resolved

src/nncf/onnx/quantization/backend_parameters.py Show resolved Hide resolved

andrey-churkin requested a review from daniil-lyakhov October 1, 2025 08:19

andrey-churkin mentioned this pull request Oct 2, 2025

Move is_weight_compression_needed function to common #3669

Closed

minor update

1ffcd90

andrey-churkin requested a review from nikita-savelyevv October 2, 2025 08:08

nikita-savelyevv approved these changes Oct 2, 2025

View reviewed changes

daniil-lyakhov approved these changes Oct 2, 2025

View reviewed changes

andrey-churkin merged commit ff287ac into openvinotoolkit:develop Oct 6, 2025
34 of 36 checks passed

andrey-churkin mentioned this pull request Oct 6, 2025

[ONNX] Revert the temporary test changes #3677

Merged

andrey-churkin added a commit that referenced this pull request Oct 6, 2025

[ONNX] Revert the temporary test changes (#3677)

fb6d97d

### Changes Revert the temporary test changes that were introduced in PR #3662

andrey-churkin mentioned this pull request Nov 15, 2025

[release_v2190] release notes template #3731

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[ONNX] Compress quantize weights transformation#3662

[ONNX] Compress quantize weights transformation#3662
andrey-churkin merged 12 commits intoopenvinotoolkit:developfrom
andrey-churkin:ac/compress_quantize_weights_transformation

andrey-churkin commented Sep 19, 2025 •

edited

Loading

Uh oh!

andrey-churkin commented Sep 23, 2025 •

edited

Loading

Uh oh!

daniil-lyakhov left a comment

Uh oh!

Uh oh!

daniil-lyakhov Sep 25, 2025

Uh oh!

andrey-churkin Oct 1, 2025

Uh oh!

daniil-lyakhov Sep 25, 2025

Uh oh!

andrey-churkin Oct 1, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

andrey-churkin commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Reason for changes

Related tickets

Tests

Uh oh!

andrey-churkin commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

daniil-lyakhov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

daniil-lyakhov Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

andrey-churkin Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

daniil-lyakhov Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

andrey-churkin Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andrey-churkin commented Sep 19, 2025 •

edited

Loading

andrey-churkin commented Sep 23, 2025 •

edited

Loading