Add QDQ scale propagation pass #713

javier-intel · 2025-06-16T23:38:30Z

Description

Adding pass to propagate scale values with a magnitude above a certain threshold to avoid numerical overflows.

Motivation and Context

Improve precision on certain networks

preetha-intel · 2025-06-18T16:30:04Z

onnxruntime/core/providers/openvino/backend_manager.cc

+  } else if (session_context_.device_type.find("GPU") != std::string::npos) {
+    // Create a copy of the model
+    std::unique_ptr<onnxruntime::Model> model;
+    Status status = qdq_scales_fix::Transform(subgraph, logger, model);


Is this pass happening even for non quantized models?

…des and duplicate DQ nodes

* Use infer instead of start async/wait * Introduce OvExeceptionBoundary for exception handling * unbound infer request pool * Fix dynamically sized i/o * Rename onnx->ort + remove unused parameter shape functions * fix linux build issue + review dog comments * more linux build fixes + copilot feedback * disable ReduceSum_noop_axes_input_initializer_opset_18 * review feedback + last minute touch ups * slightly more scalable llm handling * Simplify dynamic shape checks * add missing staged changes * Remove references to IO_BUFFER_ENABLED * Minor tweaks to InferRequestPool * remove unused mem_info * Move ParameterShape and ParameterInfo out of ov_interface --------- Co-authored-by: MayureshV1 <47039074+MayureshV1@users.noreply.github.com>

* feat: Enable EpContext OVIR Encapsulation * fix: refactor EpCtx OVIR parsing logic to use ep.context_file_path * fix: Fix logic for parsing model_file_path * fix: enable EPCtx OVIR encapsulation compiled blob caching * fix: fix merge conflicts * fix: fix bugs

javier-intel requested a review from preetha-intel June 16, 2025 23:40

javier-intel added 2 commits June 17, 2025 08:58

Add pass to perform QDQ stripping and propagate scales

e917ca9

Fix disconnected outptu node

4cb9374

javier-intel force-pushed the jemartin/scale_propagation branch from 3d0ca12 to 4cb9374 Compare June 17, 2025 15:59

preetha-intel reviewed Jun 18, 2025

View reviewed changes

javier-intel added 2 commits June 23, 2025 13:59

Fixes to support session.disable_quant_qdq output, remove dangling no…

334d82f

…des and duplicate DQ nodes

Fix lack of scales updates and remove stray QDQ nodes in certain models

5df5419

javier-intel requested a review from MayureshV1 June 24, 2025 16:37

ericcraw and others added 3 commits June 25, 2025 07:36

Address issues with Linux CI

635456e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add QDQ scale propagation pass #713

Add QDQ scale propagation pass #713

javier-intel commented Jun 16, 2025

Uh oh!

preetha-intel Jun 18, 2025

Uh oh!

Uh oh!

Add QDQ scale propagation pass #713

Are you sure you want to change the base?

Add QDQ scale propagation pass #713

Conversation

javier-intel commented Jun 16, 2025

Description

Motivation and Context

Uh oh!

preetha-intel Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!