[Inference] Update fakequant script #9054

lixcli · 2024-08-30T03:23:54Z

PR types

Others

PR changes

Docs

Description

update quantization.md

2. add llama3.1 and qwen2 ptq config 3. update quantization.md

…nto add_new_fakequant_type

paddle-bot · 2024-08-30T03:23:59Z

Thanks for your contribution!

codecov · 2024-08-30T03:57:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.82%. Comparing base (ae691e2) to head (8d26cb1).
Report is 12 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9054      +/-   ##
===========================================
+ Coverage    53.76%   53.82%   +0.06%     
===========================================
  Files          652      652              
  Lines       104507   104529      +22     
===========================================
+ Hits         56190    56265      +75     
+ Misses       48317    48264      -53

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DrownFish19

LGTM

DrownFish19

LGTM

* 1. add a8w8(fp8) a8w8c8(int8) quant_type support 2. add llama3.1 and qwen2 ptq config 3. update quantization.md * fix load_quant_model bug * fix load quant bug * update ll/README.md * remove useless code * update quant observer config * resolve wrong modify * fix prepare_qconfig * remove unuse files * update quantization.md * reformat quantization.md and argument.py * update prepare data method for ceval ptq

lixcli added 11 commits August 28, 2024 07:36

1. add a8w8(fp8) a8w8c8(int8) quant_type support

005f2ad

2. add llama3.1 and qwen2 ptq config 3. update quantization.md

fix load_quant_model bug

e56d9c4

fix load quant bug

e2b9a49

update ll/README.md

d21ace7

remove useless code

e89372c

update quant observer config

e7160d3

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

630b3d6

…nto add_new_fakequant_type

resolve wrong modify

7032bf2

fix prepare_qconfig

323c465

remove unuse files

df416ac

update quantization.md

db61a99

reformat quantization.md and argument.py

f114947

DrownFish19 previously approved these changes Aug 30, 2024

View reviewed changes

DrownFish19 changed the title ~~Update fakequant type~~ [Inference] Update fakequant type Aug 30, 2024

update prepare data method for ceval ptq

8d26cb1

lixcli dismissed DrownFish19’s stale review via 8d26cb1 August 30, 2024 14:17

lixcli changed the title ~~[Inference] Update fakequant type~~ [Inference] Update fakequant Sep 2, 2024

lixcli changed the title ~~[Inference] Update fakequant~~ [Inference] Update fakequant script Sep 2, 2024

lixcli requested a review from DrownFish19 September 2, 2024 02:38

yuanlehome approved these changes Sep 2, 2024

View reviewed changes

DrownFish19 approved these changes Sep 2, 2024

View reviewed changes

DrownFish19 merged commit cda9594 into PaddlePaddle:develop Sep 2, 2024
10 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference] Update fakequant script #9054

[Inference] Update fakequant script #9054

lixcli commented Aug 30, 2024

paddle-bot bot commented Aug 30, 2024

codecov bot commented Aug 30, 2024 •

edited

Loading

DrownFish19 left a comment

DrownFish19 left a comment

[Inference] Update fakequant script #9054

[Inference] Update fakequant script #9054

Conversation

lixcli commented Aug 30, 2024

PR types

PR changes

Description

paddle-bot bot commented Aug 30, 2024

codecov bot commented Aug 30, 2024 • edited Loading

Codecov Report

DrownFish19 left a comment

Choose a reason for hiding this comment

DrownFish19 left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 30, 2024 •

edited

Loading