quantization aware training pass #3817

daquexian · 2020-11-13T07:46:14Z

包含 quantization aware training 自动插入相关 op 的 pass

…g_dqx

…ing_ops' into quant_aware_training_dqx

…g_dqx

oneflow/python/framework/function_util.py

oneflow/core/job_rewriter/quantization_aware_training.cpp

Ldpe2G · 2020-11-16T06:17:33Z

oneflow/core/job_rewriter/quantization_aware_training.cpp

+
+namespace {
+
+void VerifyQATList(const QATList& amp_list) {


void VerifyQATList(const QATList& qat_list) {

oneflow/core/job_rewriter/quantization_aware_training.cpp

…g_dqx

Signed-off-by: daquexian <daquexian566@gmail.com>

* init qat pass * fix bugs * add calculate weight scale and zero_point op & unit tests * clear batch axis of scale and zero_point * add calculate activation scale and zero_point op & unit tests * add fake quantization ops & unit tests * add sbp signature to fake quantization op & improve code style * imporve unit test speed * update pass * add QatConfig * format * code clean * add calculate weight scale and zero_point op & unit tests * clear batch axis of scale and zero_point * add calculate activation scale and zero_point op & unit tests * add fake quantization ops & unit tests * add sbp signature to fake quantization op & improve code style * imporve unit test speed * make changes according to review comments * rename quantize ops following the pytorch's naming scheme * change the input zero_point of fake_quantize op to optional * stop updating moving_min and moving_max after training iteration reaches the given point * add calculate weight scale and zero_point op & unit tests * clear batch axis of scale and zero_point * add calculate activation scale and zero_point op & unit tests * add fake quantization ops & unit tests * add sbp signature to fake quantization op & improve code style * imporve unit test speed * make changes according to review comments * rename quantize ops following the pytorch's naming scheme * change the input zero_point of fake_quantize op to optional * stop updating moving_min and moving_max after training iteration reaches the given point * align with latest fake quant ops Signed-off-by: daquexian <daquexian566@gmail.com> * optimize CHECK Signed-off-by: daquexian <daquexian566@gmail.com> * add multiple devices tests && fix sbp infer error * fix bugs on mobilenetv2 Signed-off-by: daquexian <daquexian566@gmail.com> * align with latest fake quant ops Signed-off-by: daquexian <daquexian566@gmail.com> * format Signed-off-by: daquexian <daquexian566@gmail.com> * add calculate weight scale and zero_point op & unit tests * clear batch axis of scale and zero_point * add calculate activation scale and zero_point op & unit tests * add fake quantization ops & unit tests * add sbp signature to fake quantization op & improve code style * imporve unit test speed * make changes according to review comments * rename quantize ops following the pytorch's naming scheme * change the input zero_point of fake_quantize op to optional * stop updating moving_min and moving_max after training iteration reaches the given point * add multiple devices tests && fix sbp infer error * stop udpating moving max and min during the prediction mode * imporve ReduceMaxMinPerChannel cuda kernel slightly * align with cfg job_conf Signed-off-by: daquexian <daquexian566@gmail.com> * amp_lsit -> op_list Signed-off-by: daquexian <daquexian566@gmail.com> * support conv op with bias input Signed-off-by: daquexian <daquexian566@gmail.com> * add calculate weight scale and zero_point op & unit tests * clear batch axis of scale and zero_point * add calculate activation scale and zero_point op & unit tests * add fake quantization ops & unit tests * add sbp signature to fake quantization op & improve code style * imporve unit test speed * make changes according to review comments * rename quantize ops following the pytorch's naming scheme * change the input zero_point of fake_quantize op to optional * stop updating moving_min and moving_max after training iteration reaches the given point * add multiple devices tests && fix sbp infer error * stop udpating moving max and min during the prediction mode * imporve ReduceMaxMinPerChannel cuda kernel slightly * change quantize_to_bit to quantization_bit * change quantize to quantization * format Signed-off-by: daquexian <daquexian566@gmail.com> * fix bias zero point shape, add tests Signed-off-by: daquexian <daquexian566@gmail.com> * set 'training' attr according to job desc Signed-off-by: daquexian <daquexian566@gmail.com> * refine tests Signed-off-by: daquexian <daquexian566@gmail.com> * polish Signed-off-by: daquexian <daquexian566@gmail.com> * reformat Signed-off-by: daquexian <daquexian566@gmail.com> * fix cpu test Signed-off-by: daquexian <daquexian566@gmail.com> Co-authored-by: Ldpe2G <liangdepeng@gmail.com> Co-authored-by: oneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Former-commit-id: a48c6e4

daquexian and others added 16 commits October 28, 2020 19:20

init qat pass

4fcb53a

fix bugs

c2f7c24

Merge remote-tracking branch 'origin/master' into quant_aware_trainin…

b20f7be

…g_dqx

add calculate weight scale and zero_point op & unit tests

8369e5b

clear batch axis of scale and zero_point

1719c1e

add calculate activation scale and zero_point op & unit tests

f89ac2d

add fake quantization ops & unit tests

b6dd00f

add sbp signature to fake quantization op & improve code style

8c2b8b9

imporve unit test speed

0944547

Merge remote-tracking branch 'origin/master' into quant_aware_trainin…

c5e3817

…g_dqx

Merge remote-tracking branch 'origin/dev_add_quantization_aware_train…

a8706de

…ing_ops' into quant_aware_training_dqx

update pass

42ca848

add QatConfig

4d21070

Merge remote-tracking branch 'origin/master' into quant_aware_trainin…

de11311

…g_dqx

format

c00107b

code clean

d2a9409

daquexian marked this pull request as draft November 13, 2020 07:46

daquexian added feature system WIP work in progress labels Nov 13, 2020

Ldpe2G reviewed Nov 13, 2020

View reviewed changes

oneflow/python/framework/function_util.py Show resolved Hide resolved

Ldpe2G reviewed Nov 15, 2020

View reviewed changes

oneflow/core/job_rewriter/quantization_aware_training.cpp Outdated Show resolved Hide resolved

Ldpe2G reviewed Nov 16, 2020

View reviewed changes

oneflow/core/job_rewriter/quantization_aware_training.cpp Outdated Show resolved Hide resolved

Ldpe2G reviewed Nov 16, 2020

View reviewed changes

oneflow/core/job_rewriter/quantization_aware_training.cpp Outdated Show resolved Hide resolved

Ldpe2G added 5 commits November 17, 2020 16:45

add calculate weight scale and zero_point op & unit tests

5e7b278

clear batch axis of scale and zero_point

6a95254

add calculate activation scale and zero_point op & unit tests

b43323f

add fake quantization ops & unit tests

b9bc05f

add sbp signature to fake quantization op & improve code style

e2a1550

Merge remote-tracking branch 'origin/master' into quant_aware_trainin…

45d813c

…g_dqx

daquexian marked this pull request as ready for review December 21, 2020 03:03

daquexian added 3 commits December 21, 2020 16:14

fix bias zero point shape, add tests

01769b0

Signed-off-by: daquexian <daquexian566@gmail.com>

set 'training' attr according to job desc

5447627

Signed-off-by: daquexian <daquexian566@gmail.com>

refine tests

272b1a5

Signed-off-by: daquexian <daquexian566@gmail.com>

daquexian changed the title ~~[WIP] quantization aware training pass~~ quantization aware training pass Jan 13, 2021

Merge branch 'master' into quant_aware_training_dqx

0cfa8ad

daquexian requested a review from oneflow-ci-bot January 13, 2021 06:18

Ldpe2G approved these changes Jan 13, 2021

View reviewed changes

polish

3832433

Signed-off-by: daquexian <daquexian566@gmail.com>

daquexian force-pushed the quant_aware_training_dqx branch from 047c9d6 to 3832433 Compare January 13, 2021 06:20

daquexian requested review from oneflow-ci-bot and removed request for oneflow-ci-bot January 13, 2021 06:24

daquexian removed the WIP work in progress label Jan 13, 2021

oneflow-ci-bot removed their request for review January 13, 2021 06:27

reformat

48061cb

Signed-off-by: daquexian <daquexian566@gmail.com>

daquexian requested a review from oneflow-ci-bot January 13, 2021 06:33

daquexian added the automerge label Jan 13, 2021

oneflow-ci-bot removed their request for review January 13, 2021 08:56

daquexian requested a review from oneflow-ci-bot January 13, 2021 09:07

Merge branch 'master' into quant_aware_training_dqx

5b4f005

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot January 13, 2021 09:35

fix cpu test

22bf725

Signed-off-by: daquexian <daquexian566@gmail.com>

daquexian requested a review from oneflow-ci-bot January 13, 2021 11:55

oneflow-ci-bot removed their request for review January 13, 2021 14:54

oneflow-ci-bot merged commit a48c6e4 into master Jan 13, 2021

oneflow-ci-bot deleted the quant_aware_training_dqx branch January 13, 2021 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantization aware training pass #3817

quantization aware training pass #3817

daquexian commented Nov 13, 2020

Ldpe2G Nov 16, 2020

quantization aware training pass #3817

quantization aware training pass #3817

Conversation

daquexian commented Nov 13, 2020

Ldpe2G Nov 16, 2020

Choose a reason for hiding this comment