Changing the hashing methodology for cache folder creation of models. #481

quic-dhirajku · 2025-06-24T08:30:42Z

Detaching hash function for model cache path calculation. changes for QNN compilation not included yet.

Cache folder mechanism has been modified to have a parent directory for a model based on the architecture that we retrieve from the model config. The hash calculation for the ONNX export now incorporates all model kwargs as well as export kwargs and parameters. the parameters that were used to create the hash also gets dumped as a serialized JSON file in the ONNX folder, the same happens for the compile parameters inside the respective qpc folder.

… QNN compilation not included yet. Cache folder mechanism has been modified to have a parent directory for a model based on the architecture that we retrieve from the model config. The hash calculation for the ONNX export now incorporates all model kwargs as well as export kwargs and parameters. the parameters that were used to create the hash also gets dumped as a serialized JSON file in the ONNX folder, the same happens for the compile parameters inside the respective qpc folder. Signed-off-by: Dhiraj Kumar Sah <quic_dhirajku@quicinc.com>

ochougul

review WIP.

ochougul · 2025-06-24T10:23:04Z

QEfficient/base/modeling_qeff.py

@@ -5,7 +5,7 @@
 #
 # ----------------------------------------------------------------------------

-import hashlib
+# import hashlib


commented code.
Make sure that commented code is not there in ready to review PRs.

ochougul · 2025-06-24T14:08:58Z

QEfficient/base/modeling_qeff.py

+        self.model_params.update(kwargs)
+        self.model_params["config"] = self.model.config.to_diff_dict()
+        self.model_params["_transform_names"] = self._transform_names()
+        self.compile_params = {}


initialize this only when compile is called.
No point in creating this dictionary if user not calling compile.

ochougul · 2025-06-24T14:10:06Z

QEfficient/base/modeling_qeff.py

+        self.model_params = {}
+        self.model_params.update(kwargs)


Better to do self.model_params = copy.deepcopy(kwargs)
This lets other methods mutate kwargs.
Otherwise we would need to ensure that no other method mutates the kwargs.

ochougul · 2025-06-24T14:13:11Z

QEfficient/base/modeling_qeff.py

+        if export_kwargs is not None:
+            self.model_params.update(export_kwargs)
+        if onnx_transform_kwargs is not None:
+            self.model_params.update(onnx_transform_kwargs)


One liners are better

self.model_params.update(export_kwargs) if export_kwargs is not None else None self.model_params.update(onnx_transform_kwargs) if export_kwargs is not None else None

ochougul · 2025-06-24T14:14:42Z

QEfficient/base/modeling_qeff.py

+        self.model_params["output_names"] = output_names
+        self.model_params["dynamic_axes"] = dynamic_axes


Better to keep them in one more level as
self.model_params["export_params"] = export_params
And add all exports params in export_params which is another dict.

Makes the dumped JSON readable by user.

ochougul · 2025-06-24T14:16:12Z

QEfficient/base/modeling_qeff.py

+        model_params_json = export_dir / "model_params.json"
+        with open(model_params_json, "w") as fp:
+            json.dump(
+                {
+                    "model_params": [
+                        {k: make_serializable(self.model_params[k]) for k in sorted(self.model_params.keys())}
+                    ]
+                },
+                fp,
+                indent=4,
+            )


Dumping should happen after export.
If model errors out during export and we still dump the json, it does not make sense

ochougul · 2025-06-24T14:16:40Z

QEfficient/transformers/models/modeling_auto.py


-        self.pretrained_model_name_or_path = kwargs.get("pretrained_model_name_or_path", None)
+        # self.pretrained_model_name_or_path = kwargs.get("pretrained_model_name_or_path", None)


Signed-off-by: Dhiraj Kumar Sah <quic_dhirajku@quicinc.com>

quic-amitraj

To control the user should not initialize the model directly using the init of modeling class. Please use Metaclass Control to control the flow and print the warning or error. For example-

class NoInitMeta(type):
    def __call__(cls, *args, **kwargs):
        raise RuntimeError("Use `from_pretrained` to create an instance.")

class MyModel(metaclass=NoInitMeta):
    def __init__(self, data):
        self.data = data

    @classmethod
    def from_pretrained(cls, path):
        instance = object.__new__(cls)
        instance.__init__(f"Loaded from {path}")
        return instance

More about this you can read from here- https://stackoverflow.com/questions/100003/what-are-metaclasses-in-python.
Put that meta class in the utils and use this for all the modelling class.

quic-amitraj · 2025-07-02T05:33:09Z

QEfficient/base/modeling_qeff.py

@@ -357,6 +388,19 @@ def _compile(
        logger.info(f"Running compiler: {' '.join(command)}")
        try:
            subprocess.run(command, capture_output=True, check=True)
+
+            # Dumping compile paramters in a JSON file after successful QPC compilation


Remove all the code related to compile_param_json from here including dumping and handle these inside the decorator dump_qconfig. Lets keep the base methods clean.

quic-amitraj · 2025-07-02T05:35:17Z

QEfficient/base/modeling_qeff.py

+            model_params_json = export_dir / "model_params.json"
+            with open(model_params_json, "w") as fp:
+                json.dump(
+                    {


Same like compile create a decorator and handle all these param updates and dumping inside that.

quic-amitraj · 2025-07-02T05:35:56Z

QEfficient/base/modeling_qeff.py

+        export_params["output_names"] = output_names
+        export_params["dynamic_axes"] = dynamic_axes
+
+        self.model_params["export_params"] = export_params


Handle in decorator. Lets keep our base methods clean.

Agree, lets write decorator implementation to handle this.

quic-dhirajku requested review from quic-rishinr, ochougul, quic-hemagnih and quic-amitraj as code owners June 24, 2025 08:30

quic-rishinr added the 1.21.0 label Jun 24, 2025

ochougul requested changes Jun 24, 2025

View reviewed changes

ochougul mentioned this pull request Jun 24, 2025

Bug fix for spdTransform #467

Merged

quic-dhirajku added 2 commits June 27, 2025 11:17

Incorporated changes suggested in comments

b540ea1

Signed-off-by: Dhiraj Kumar Sah <quic_dhirajku@quicinc.com>

Edited a comment on compile params dump

47673cf

Signed-off-by: Dhiraj Kumar Sah <quic_dhirajku@quicinc.com>

quic-rishinr marked this pull request as draft June 27, 2025 12:24

quic-amitraj requested changes Jul 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Changing the hashing methodology for cache folder creation of models. #481

Changing the hashing methodology for cache folder creation of models. #481

quic-dhirajku commented Jun 24, 2025

Uh oh!

ochougul left a comment

Uh oh!

ochougul Jun 24, 2025

Uh oh!

ochougul Jun 24, 2025

Uh oh!

ochougul Jun 24, 2025

Uh oh!

ochougul Jun 24, 2025

Uh oh!

ochougul Jun 24, 2025

Uh oh!

ochougul Jun 24, 2025

Uh oh!

ochougul Jun 24, 2025

Uh oh!

quic-amitraj left a comment •

edited

Loading

Uh oh!

quic-amitraj Jul 2, 2025

Uh oh!

quic-amitraj Jul 2, 2025

Uh oh!

quic-amitraj Jul 2, 2025

Uh oh!

quic-hemagnih Jul 3, 2025

Uh oh!

Uh oh!

		self.model_params["output_names"] = output_names
		self.model_params["dynamic_axes"] = dynamic_axes


		self.pretrained_model_name_or_path = kwargs.get("pretrained_model_name_or_path", None)
		# self.pretrained_model_name_or_path = kwargs.get("pretrained_model_name_or_path", None)

Changing the hashing methodology for cache folder creation of models. #481

Are you sure you want to change the base?

Changing the hashing methodology for cache folder creation of models. #481

Conversation

quic-dhirajku commented Jun 24, 2025

Uh oh!

ochougul left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

quic-amitraj left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

quic-amitraj left a comment •

edited

Loading