Updated patch for NNCF integration to the latest transformers release #1328

ljaljushkin · 2022-10-18T20:15:36Z

Changes

Updated patch for NNCF integration to the latest transformers release (v4.23.1)

Reason for changes

for smoother integration of JPQD #1319

Related tickets

94449

Tests

3rd party sanity

… v4.23.1

ljaljushkin · 2022-10-18T20:20:25Z

third_party_integration/huggingface_transformers/0001-Modifications-for-NNCF-usage.patch

+-        default_label_names = find_labels(self.model.__class__)
+        model_class = self.model.__class__
+        if isinstance(self.model, NNCFNetwork):
+            model_class = self.model.get_nncf_wrapped_model().__class__
+        default_label_names = find_labels(model_class)


In new version labels are defined based on model signature, that's why need to unwrap NNCFNetwork

ljaljushkin · 2022-10-18T20:24:07Z

third_party_integration/huggingface_transformers/0001-Modifications-for-NNCF-usage.patch

                     model.zero_grad()
                     self.state.global_step += 1
                     self.state.epoch = epoch + (step + 1) / steps_in_epoch
-+                    self.state.curr_loss = curr_loss.cpu().detach().item()
+                    self.state.curr_loss = tr_loss_step.cpu().detach().item()


tr_loss and tr_loss_step are explicitly separated

ljaljushkin · 2022-10-18T20:25:22Z

third_party_integration/huggingface_transformers/0001-Modifications-for-NNCF-usage.patch

+++ b/src/transformers/utils/__init__.py
+@@ -154,6 +154,7 @@ from .import_utils import (
+
+
+ WEIGHTS_NAME = "pytorch_model.bin"
+NNCF_PT_STATE_NAME = "nncf_state.bin"


file_utils.py just does backward compatibility imports

ljaljushkin · 2022-10-18T20:26:29Z

third_party_integration/huggingface_transformers/0001-Modifications-for-NNCF-usage.patch

- DISABLE_TELEMETRY = os.getenv("DISABLE_TELEMETRY", False) in ENV_VARS_TRUE_VALUES
-
- WEIGHTS_NAME = "pytorch_model.bin"
-+NNCF_PT_STATE_NAME = "nncf_state.bin"


That was moved to utils/__init__.py

ljaljushkin · 2022-10-18T20:27:49Z

third_party_integration/huggingface_transformers/0001-Modifications-for-NNCF-usage.patch

@@ -1205,6 +1146,9 @@ index 000000000..edbe0f84d
 +        "initializer": {
 +            "range": {
 +                "num_init_samples": 24
+            },
+            "batchnorm_adaptation": {
+                "num_bn_adaptation_samples": 0


BatchNorm adaptation is useless for transformers, because there's no BN layers. It can be turned off by specifying zero number of samples only. It saves some validation time, at least.

ljaljushkin · 2022-10-18T20:28:41Z

third_party_integration/huggingface_transformers/0001-Modifications-for-NNCF-usage.patch

+     # Model has labels -> use them.
+-    if model.config.label2id != PretrainedConfig(num_labels=num_labels).label2id:
+-        if list(sorted(model.config.label2id.keys())) == list(sorted(label_list)):
+    if config.label2id != PretrainedConfig(num_labels=num_labels).label2id:


Config is passed to model later.

ljaljushkin · 2022-10-18T20:29:57Z

third_party_integration/huggingface_transformers/0001-Modifications-for-NNCF-usage.patch

-    AutoConfig,
-    AutoModelForTokenClassification,
-    AutoTokenizer,
-    DataCollatorForTokenClassification,


Just haven't reordered imports for the consistency with other samples.

ljaljushkin · 2022-10-18T20:32:04Z

tests/torch/test_sanity_third_party.py

@@ -204,7 +204,8 @@ def test_glue_distilbert_eval(self, temp_folder):

    @pytest.mark.dependency(depends=['install_trans'], name='lm_train')
    def test_lm_train(self, temp_folder):
-        com_line = "examples/pytorch/language-modeling/run_clm.py --model_name_or_path gpt2" \
+        # GPT2 is loaded via torch.frombuffer which is not available in torch==1.9.1 yet
+        com_line = "examples/pytorch/language-modeling/run_clm.py --model_name_or_path distilgpt2" \


GPT2 is loaded via safetensor functionality that uses torch.frombuffer. This function appears since torch1.10, but we support torch==1.9.1 only.

ljaljushkin · 2022-10-18T20:32:36Z

tests/torch/test_sanity_third_party.py

@@ -21,7 +21,7 @@
 from tests.common.helpers import PROJECT_ROOT
 from tests.torch.helpers import Command

-TRANSFORMERS_COMMIT = "bff1c71e84e392af9625c345f9ea71f7b6d75fb3"
+TRANSFORMERS_COMMIT = "bd469c40659ce76c81f69c7726759d249b4aef49"


ljaljushkin · 2022-10-19T07:57:13Z

@vshampor 3rd party sanity tests are green (Build 23)
the only degradation is gpt2 model, which can't be loaded with pytorch 1.9.1.
Sanity tests use 'distilgpt2' instead, which doesn't have an issue with this version of pytorch.

Updated patch for NNCF integration to the latest transformers release…

cf6511d

… v4.23.1

github-actions bot added documentation Improvements or additions to documentation NNCF PT Pull requests that updates NNCF PyTorch labels Oct 18, 2022

ljaljushkin requested a review from vshampor October 18, 2022 20:17

ljaljushkin commented Oct 18, 2022

View reviewed changes

ljaljushkin marked this pull request as ready for review October 19, 2022 07:54

vshampor approved these changes Oct 19, 2022

View reviewed changes

vshampor merged commit 66f2681 into openvinotoolkit:develop Oct 19, 2022

ljaljushkin referenced this pull request in vuiseng9/nncf Oct 19, 2022

change PrunableOp to a dataclass

8374a59

ljaljushkin mentioned this pull request Oct 25, 2022

Support Transformers 4.16.0 #1093

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated patch for NNCF integration to the latest transformers release #1328

Updated patch for NNCF integration to the latest transformers release #1328

ljaljushkin commented Oct 18, 2022

ljaljushkin Oct 18, 2022

ljaljushkin Oct 18, 2022

ljaljushkin Oct 18, 2022

ljaljushkin Oct 18, 2022

ljaljushkin Oct 18, 2022

ljaljushkin Oct 18, 2022

ljaljushkin Oct 18, 2022

ljaljushkin Oct 18, 2022

ljaljushkin Oct 18, 2022

ljaljushkin commented Oct 19, 2022

Updated patch for NNCF integration to the latest transformers release #1328

Updated patch for NNCF integration to the latest transformers release #1328

Conversation

ljaljushkin commented Oct 18, 2022

Changes

Reason for changes

Related tickets

Tests

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ljaljushkin commented Oct 19, 2022