Pr nncf jpqd with bert example only #2

yujiepan-work · 2023-02-13T08:25:00Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

…nd distillation via NNCF

#2) * use nncf official branch for install since JPQD is merged * copy ac scripts from transformer repo * init commit for wav2vec2 * add onnx_config argument in OVTrainer for onnx export with unsupported model * enable customized teancher kd * add readme * delete debugging lines

…port_option

* test part 1 * clean "compute_distillation_loss' in OVTrainer * add test of OVTrainer for int8+kd / movement / movement+int8/ movement+int8+kd * add expectedFailuremark to test of OVModelForAudioClassification * revert unncessary codes about "OVModelForAudioClassification" * change to a shorter train for w2v2 in readme * revert compute_metrics change since it is not unnecessary * fix task_loss non-scalar bug for kd logging * make regex clearer in QA bert config

* MO IR pruning depends on scheduler stage * Readme tweaks for all example tasks * Minor tweak on tests * Align setup.py for openvino-dev and nncf versions needed for JPQD

* copy text-classification example from transformers * init draft for sst example * update sst2 accuracy & training time

… quantization config

…oint compression variations, misc. patches * use builtin onnx configs for wav2vec onnx export * move teacher model argument from OVTrainingArgs to model args * fix duplicate call of `epoch_step` * temporal workaround about compression metrics * test for all training cases * temporal workaround for eval only * cover train/eval tests * style fix

…training loss check (#6) * removing old tests in test_quantization since they are now in `test_training` * bugfix in checking compression metrics during training

HuggingFaceDocBuilderDev · 2023-02-13T08:30:36Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

… dist. reduce)

…uggingface#150) * Initial commit to enable OVTrainer with joint pruning, quantization and distillation via NNCF * Review OpenVINO Q&A readme and configs * Update README.md * Add post init value checker to OVTrainingArguments * Initial enabling of audio classification/wav2vec2 [tests not included] (#2) * use nncf official branch for install since JPQD is merged * copy ac scripts from transformer repo * init commit for wav2vec2 * add onnx_config argument in OVTrainer for onnx export with unsupported model * enable customized teancher kd * add readme * delete debugging lines * Update openvino-dev and nncf version in setup.py * refactor _enable_standard_onnx_export_option to _set_standard_onnx_export_option * add tests for (movement/quantization) with distillation (#3) * test part 1 * clean "compute_distillation_loss' in OVTrainer * add test of OVTrainer for int8+kd / movement / movement+int8/ movement+int8+kd * add expectedFailuremark to test of OVModelForAudioClassification * revert unncessary codes about "OVModelForAudioClassification" * change to a shorter train for w2v2 in readme * revert compute_metrics change since it is not unnecessary * fix task_loss non-scalar bug for kd logging * make regex clearer in QA bert config * Refactor compression-related logging * Refactor OpenVINO IR generation and patch tests * Miscellaneous refactoring * MO IR pruning depends on scheduler stage * Readme tweaks for all example tasks * Minor tweak on tests * Align setup.py for openvino-dev and nncf versions needed for JPQD * Fix lint with Black * Refactor OpenVINO IR generation using python api * Fix via isort * Handle IR generation error to avoid run termination * Update QA readme * Enable distillation on openvino's image classification example * Minor refactoring in openvino's audio classification example * Move openvino-dev dependency to be extra of NNCF * Configure IR model to accept dynamic-shaped input * Revert _enable_standard_onnx_export_option method in OVConfig * Update wav2vec2 configs for audio classification * Add BERT-base/glue-sst2 example with QAT / JPQD (#4) * copy text-classification example from transformers * init draft for sst example * update sst2 accuracy & training time * Revise wav2vec2 config and audio classification readme * Patch _enable_standard_onnx_export_option to only add the key pair to quantization config * Set logging level to INFO in openvino/trainer.py * Review readme of text and image classification * Revert IR generation with static input shape for joint compression * Add distillation and advance optimization section in optimization_ov.mdx * Patch tests * Revise formatting of optimization_ov.mdx * Limit #checkpoint saved for JPQD samples * Handle NNCF output to text log and only print errors to stdout * Replace hardcoded model.onnx filename with constant variable * Fix movement sparsity config in optimization_ov.mdx * Change _set_feature to _set_task to align with OVQuantizer * Revert onnx_config exposure in OVTrainer, expand test coverages for joint compression variations, misc. patches * use builtin onnx configs for wav2vec onnx export * move teacher model argument from OVTrainingArgs to model args * fix duplicate call of `epoch_step` * temporal workaround about compression metrics * test for all training cases * temporal workaround for eval only * cover train/eval tests * style fix * Move old ovtrainer tests to a new `test_training.py` file; bugfix in training loss check (#6) * removing old tests in test_quantization since they are now in `test_training` * bugfix in checking compression metrics during training * keep bert examples only and misc. fixes (#7) * temporarily keep bert examples only; remove w2v2 and swin * move nncf_compression_config out of OVTrainingArguments * type hint change for nncf_compression_config * documnet rename feature to task * revert existing QAT image classification example * delete useless codes in test quantization * revert existing test_ quantization * misc change in compute_metric * revert unnecessary changes * temporal workaround for logging distill & compression loss (not using dist. reduce) * revert set_task method * bugfix in compression metric in qa task * bugfix in importing tpu * simplify pruning ir codes * clean unncessary distillation weight attribute in trainer * Change nncf requirement to official 2.4 * Log nncf compression statistics at the beginning of each training epoch * Revise optimization_ov.mdx documentation * Consolidate during training optimization to QAT and JPQD * Add known limitation regarding OpenVINO IR with static input shape * fix data parallel crashes and add tests for DP/DDP (#8) * fix "not same device" error in data parallel * wrap teacher model with data parallel * add sst2 tests for dp/ddp with fixes * Add remark in optimization_ov.mdx on supported model architecture for structured pruning * Refactor JPQD IR generation where final IR is dynamic in input shape * Revise optimization_ov.mdx to remove static IR limitations * revert snippet for inference with Transformers pipeline * Remove commented codes in openvino/trainer.py * Add tests about new OV IR export - check dynamic graph and output equivalence to torch model (#9) * draft for new export with some todos * draft for tests * delete onnx export debugging when errors on saving * add back the debug info when ir export fails * bugfix in random setting zeros in movement masks * Add tests on OV IR reshape-ability * Remove unused imports in openvino/trainer.py * Refine inference pipeline with OVModel in optimization_ov.mdx * Revise openvino extras in setup.py --------- Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com> Co-authored-by: Yujie Pan <yujie.pan@intel.com>

vuiseng9 and others added 30 commits February 10, 2023 16:37

Initial commit to enable OVTrainer with joint pruning, quantization a…

98fa4d3

…nd distillation via NNCF

Review OpenVINO Q&A readme and configs

1ae0a23

Update README.md

7c131c3

Add post init value checker to OVTrainingArguments

85b21df

Update openvino-dev and nncf version in setup.py

2f14d8c

refactor _enable_standard_onnx_export_option to _set_standard_onnx_ex…

145547d

…port_option

Refactor compression-related logging

9ca14d9

Refactor OpenVINO IR generation and patch tests

8e10ee2

Miscellaneous refactoring

bb04ca7

* MO IR pruning depends on scheduler stage * Readme tweaks for all example tasks * Minor tweak on tests * Align setup.py for openvino-dev and nncf versions needed for JPQD

Fix lint with Black

557c7a0

Refactor OpenVINO IR generation using python api

689c69a

Fix via isort

e85bb0e

Handle IR generation error to avoid run termination

68c7da4

Update QA readme

44a594c

Enable distillation on openvino's image classification example

50977f6

Minor refactoring in openvino's audio classification example

addd8b1

Move openvino-dev dependency to be extra of NNCF

f1c5a53

Configure IR model to accept dynamic-shaped input

3073733

Revert _enable_standard_onnx_export_option method in OVConfig

c0cf8c7

Update wav2vec2 configs for audio classification

7c21a39

Add BERT-base/glue-sst2 example with QAT / JPQD (#4)

df486fc

* copy text-classification example from transformers * init draft for sst example * update sst2 accuracy & training time

Revise wav2vec2 config and audio classification readme

0db9c20

Patch _enable_standard_onnx_export_option to only add the key pair to…

08a3c84

… quantization config

Set logging level to INFO in openvino/trainer.py

cd03ea8

Review readme of text and image classification

1bf2d73

Revert IR generation with static input shape for joint compression

7724ce7

Add distillation and advance optimization section in optimization_ov.mdx

f2a0eb7

Patch tests

9d5697e

vuiseng9 and others added 11 commits February 10, 2023 16:37

Revise formatting of optimization_ov.mdx

6e5b723

Limit #checkpoint saved for JPQD samples

e883927

Handle NNCF output to text log and only print errors to stdout

7672aaa

Replace hardcoded model.onnx filename with constant variable

d0eed82

Fix movement sparsity config in optimization_ov.mdx

acece3a

Change _set_feature to _set_task to align with OVQuantizer

eb89594

Move old ovtrainer tests to a new test_training.py file; bugfix in …

c45edfd

…training loss check (#6) * removing old tests in test_quantization since they are now in `test_training` * bugfix in checking compression metrics during training

temporarily keep bert examples only; remove w2v2 and swin

12cefec

move nncf_compression_config out of OVTrainingArguments

37c57d2

type hint change for nncf_compression_config

859d20e

yujiepan-work added 12 commits February 13, 2023 16:33

documnet rename feature to task

c7ea70e

revert existing QAT image classification example

cac8c0c

delete useless codes in test quantization

6114f89

revert existing test_ quantization

95135e6

misc change in compute_metric

899b14e

revert unnecessary changes

6b1ce8a

temporal workaround for logging distill & compression loss (not using…

e71aebe

… dist. reduce)

revert set_task method

f7b2675

bugfix in compression metric in qa task

cc9d276

bugfix in importing tpu

ead90d0

simplify pruning ir codes

9579c82

clean unncessary distillation weight attribute in trainer

e2cbed4

yujiepan-work force-pushed the pr-nncf-jpqd-with-bert-example-only branch from fbb18af to e2cbed4 Compare February 13, 2023 15:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pr nncf jpqd with bert example only #2

Pr nncf jpqd with bert example only #2

Uh oh!

yujiepan-work commented Feb 13, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Feb 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Pr nncf jpqd with bert example only #2

Are you sure you want to change the base?

Pr nncf jpqd with bert example only #2

Uh oh!

Conversation

yujiepan-work commented Feb 13, 2023

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Feb 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants