This repository was archived by the owner on Sep 18, 2024. It is now read-only.

[Compression] Add quantization tutorial #5454

Merged

J-shang merged 13 commits into microsoft:master from Bonytu:tutorials

Apr 23, 2023

Contributor

Bonytu commented Mar 16, 2023

Description

Test Options

fast test
full test - HPO
full test - NAS
full test - compression

Checklist

test case
doc

How to test


          add tutorial

5d4476d

Contributor

liuzhe-lz commented Mar 17, 2023

https://nni.readthedocs.io/en/stable/notes/contributing.html#writing-new-tutorials


          fix pip bug

4bd9829

liuzhe-lz requested a review from ultmaster

March 21, 2023 09:00


          tutorial

669a18d

liuzhe-lz requested a review from J-shang

March 31, 2023 02:39

matluster reviewed

View reviewed changes

dependencies/recommended.txt Outdated

@@ @@ -21,3 +21,4 @@ matplotlib @@
               git+https://github.com/microsoft/nn-Meter.git#egg=nn_meter
               sympy
               timm >= 0.5.4
+              datasets == 2.10.1

matluster Mar 31, 2023

Any particular reason to freeze the version?

Contributor Author

Bonytu Mar 31, 2023

good suggestion, there is no need to freeze the version. I will modify it.


          fix comments

b305920

J-shang reviewed

View reviewed changes

dependencies/recommended.txt Outdated Show resolved Hide resolved

Bonytu added 7 commits

April 4, 2023 03:06


          fix commentts

4c0838c


          update experiment results

d0862c9


          fix conflict

93bbb57


          fix conflicts

d7a60bf


          fix conflict

3b971b7


          fix lint bug

7780d1a


          fix bugs

0fee3cf

Bonytu requested review from J-shang and matluster

April 13, 2023 06:52

J-shang reviewed

View reviewed changes

docs/source/examples.rst Outdated

+              .. cardlinkitem::
+                 :header: Quantize Bert on Task MNLI
+                 :description: An end to end example for how to using NNI to quantize transformer and show the real speedup number

Contributor

J-shang Apr 17, 2023

can we real speedup?😂

Contributor Author

Bonytu Apr 18, 2023

no no no

J-shang reviewed

View reviewed changes

examples/tutorials/quantization_bert_glue.py Outdated

+              Quantize BERT on Task GLUE
+              ==========================
+              Here we show an effective transformer simulated quantization process that NNI team has tried, and users can use NNI to discover better process

Contributor

J-shang Apr 17, 2023

.

J-shang reviewed

View reviewed changes

examples/tutorials/quantization_bert_glue.py Outdated


		Here we show an effective transformer simulated quantization process that NNI team has tried, and users can use NNI to discover better process

		we use the BERT model and the trainer pipeline in the Transformers to do some experiments.

Contributor

J-shang Apr 17, 2023

We

J-shang reviewed

View reviewed changes

examples/tutorials/quantization_bert_glue.py Outdated

+              # .. note::
+              #     Please set ``is_trace`` to ``False`` to fine-tune the BERT model and set ``is_trace`` to ``True``
+              #     When you need to create a traced trainer for model quantization.

Contributor

J-shang Apr 17, 2023

, when

Contributor

J-shang Apr 17, 2023

and fine-tuning model can also use a traced trainer.

J-shang reviewed

View reviewed changes

examples/tutorials/quantization_bert_glue.py

+                  config_list = [{
+                      'op_types': ['Linear'],
+                      'op_names_re': ['bert.encoder.layer.{}'.format(i) for i in range(12)],
+                      'target_names': ['weight', '_output_'],

Contributor

J-shang Apr 17, 2023

why not quant input?

J-shang reviewed

View reviewed changes

examples/tutorials/quantization_bert_glue.py Outdated

+              if __name__ == "__main__":
+                  fake_quantize()
+                  evaluate()

Contributor

J-shang Apr 17, 2023

remove if __name__ == "__main__":, this is a ipython style script.

ultmaster approved these changes

View reviewed changes

Bonytu added 2 commits

April 18, 2023 07:10


          fix doc bug

f08f5f7


          fix comments

1e5c223

Bonytu requested a review from J-shang

April 18, 2023 08:36

J-shang approved these changes

View reviewed changes

Contributor

J-shang left a comment

will review this doc after merge

J-shang merged commit 0f0d145 into microsoft:master

super-dainiu pushed a commit to super-dainiu/nni that referenced this pull request


          [Compression] Add quantization tutorial (microsoft#5454)

a03166c

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet