2024-05-30 Add FP8 PTQ #1877

YZW-explorer · 2024-05-30T10:52:12Z

修改Uniform Observer支持FP8量化

paddle-bot · 2024-05-30T10:52:17Z

Thanks for your contribution!

ceci3

可以增加一下使用文档和使用示例

ceci3 · 2024-06-17T08:23:34Z

docs/zh_cn/tutorials/quant/post_training_quantization.md

@@ -21,6 +21,7 @@
 - `EMDObserver`：收集最大绝对值并通过最小化EMD误差，收集量化scale
 - `HistObserver`：将张量值收集到直方图中，并根据百分比计算量化scale
 - `KLObserver`：以最小化浮点值分布与量化浮点值分布之间的 Kullback-Leibler散度计算量化scale
+- `AbsmaxObserver`：根据目标权重的Tensor维度，收集最大绝对值作为量化scale，可使用quant_bits调整量化的数据类型，支持FP8


只有 absmaxobserver支持fp8吗？

ceci3 · 2024-06-17T08:24:03Z

docs/zh_cn/tutorials/quant/post_training_quantization.md

@@ -60,8 +61,8 @@ model = mobilenet_v1()
 q_config = QuantConfig(activation=None, weight=None)

 # define act_quanter and weight_quanter
-act_quanter = MSEObserver()
-weight_quanter = MSEObserver()
+activation = AbsmaxObserver(quant_bits = (4,3)) # quant_bits = (4,3) and quant_bits = (5,2) for float8_e4m3 and float8_e5m2 formats quantization.


不要直接修改已有的示例，新增一个示例表示fp8量化，新增文档说明

ceci3 · 2024-06-25T07:06:50Z

可以把不同observer的fp8量化实验结果贴上来

CLAassistant · 2024-09-24T03:47:35Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

2024-05-30 Add FP8 PTQ

ce12dc6

paddle-bot bot added the contributor label May 30, 2024

ceci3 requested review from qingqing01, wanghaoshuang and ceci3 May 31, 2024 03:00

ceci3 reviewed May 31, 2024

View reviewed changes

YZW-explorer added 2 commits June 13, 2024 05:57

dequantize_linear quantize_linear now support fp8 ptq function

64627ba

add fp8 quantization description in post_training_quantization.md

0085711

ceci3 reviewed Jun 17, 2024

View reviewed changes

YZW-explorer added 4 commits June 18, 2024 00:50

Deleting unnecessary code may lead to runtime conflict errors in FP.

6bdcff0

fix overflow bug in base_hist.py

4a6c45d

KLObserver now support fp8

c3b56e6

Add notes in base_hist.py

20bec2b

ceci3 previously approved these changes Jun 25, 2024

View reviewed changes

change docs/zh_cn/tutorials/quant/post_training_quantization.md

ebca0b9

YZW-explorer dismissed ceci3’s stale review via ebca0b9 June 25, 2024 07:43

ceci3 approved these changes Jun 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2024-05-30 Add FP8 PTQ #1877

2024-05-30 Add FP8 PTQ #1877

YZW-explorer commented May 30, 2024 •

edited

Loading

paddle-bot bot commented May 30, 2024

ceci3 left a comment

ceci3 Jun 17, 2024

ceci3 Jun 17, 2024

ceci3 commented Jun 25, 2024

CLAassistant commented Sep 24, 2024

2024-05-30 Add FP8 PTQ #1877

Are you sure you want to change the base?

2024-05-30 Add FP8 PTQ #1877

Conversation

YZW-explorer commented May 30, 2024 • edited Loading

paddle-bot bot commented May 30, 2024

ceci3 left a comment

Choose a reason for hiding this comment

ceci3 Jun 17, 2024

Choose a reason for hiding this comment

ceci3 Jun 17, 2024

Choose a reason for hiding this comment

ceci3 commented Jun 25, 2024

CLAassistant commented Sep 24, 2024

YZW-explorer commented May 30, 2024 •

edited

Loading