Fp8 ptq kernel function #65441

YZW-explorer · 2024-06-25T06:54:17Z

PR Category

Inference

PR Types

New features

Description

quantize_linear and dequantize_linear now support fp8 ptq function

…ernel

paddle-bot · 2024-06-25T06:54:23Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle/phi/kernels/funcs/fake_quantize_functor.cu

…ernel

qingqing01 · 2024-07-02T05:51:48Z

python/paddle/nn/quant/format.py

@@ -78,18 +78,44 @@ def __init__(
        self._zero_point.set_value(zero_point)
        self._quant_axis = -1 if quant_axis is None else quant_axis
        self._bit_length = bit_length
+        self._group_size = group_size
+        if isinstance(self._bit_length, tuple):


看起来接口文档里bit_length也得增加下注释，这里使用方式变了

YZW-explorer added 2 commits June 25, 2024 06:49

fp8 ptq kernel function

a09fb25

Merge branch 'develop' of github.com:PaddlePaddle/Paddle into fp8ptqk…

2189e5f

…ernel

paddle-bot bot added the contributor External developers label Jun 25, 2024

ceci3 reviewed Jun 25, 2024

View reviewed changes

paddle/phi/kernels/funcs/fake_quantize_functor.cu Outdated Show resolved Hide resolved

paddle/phi/kernels/funcs/fake_quantize_functor.cu Outdated Show resolved Hide resolved

change bin_cnt into qmax

d7cc5fb

ceci3 previously approved these changes Jun 25, 2024

View reviewed changes

add qmin qmax in test/legacy_test/test_fake_dequantize_op.py

f0cf64d

YZW-explorer dismissed ceci3’s stale review via f0cf64d June 26, 2024 07:38

YZW-explorer added 10 commits June 26, 2024 07:53

code style

026b32b

qat

a96e12c

change quantization pass .py

76cff0c

solve conflict

0652268

QuantizeLinearOpTranscriber add qmin qmax

33e7e55

fix error in python/paddle/static/quantization/quantization_pass.py

9b903c6

Merge branch 'develop' of github.com:PaddlePaddle/Paddle into fp8ptqk…

f6b7f45

…ernel

delet files

1e0d5e6

add fp8 op

e89eeaa

fp 8 ptq code style change

4ac4eaf

ceci3 approved these changes Jul 2, 2024

View reviewed changes

qingqing01 approved these changes Jul 2, 2024

View reviewed changes

ceci3 merged commit 6a21a78 into PaddlePaddle:develop Jul 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fp8 ptq kernel function #65441

Fp8 ptq kernel function #65441

Uh oh!

YZW-explorer commented Jun 25, 2024 •

edited

Loading

Uh oh!

paddle-bot bot commented Jun 25, 2024

Uh oh!

Uh oh!

Uh oh!

qingqing01 Jul 2, 2024

Uh oh!

Uh oh!

Fp8 ptq kernel function #65441

Fp8 ptq kernel function #65441

Uh oh!

Conversation

YZW-explorer commented Jun 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Uh oh!

paddle-bot bot commented Jun 25, 2024

Uh oh!

Uh oh!

Uh oh!

qingqing01 Jul 2, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

YZW-explorer commented Jun 25, 2024 •

edited

Loading