[Inference] Fix weight_only_int4 bug #9073

lixcli · 2024-09-03T07:17:30Z

PR types

Bug fixes

PR changes

Others

Description

Fix weight_only_int4 bug

2. add llama3.1 and qwen2 ptq config 3. update quantization.md

…nto add_new_fakequant_type

paddle-bot · 2024-09-03T07:17:35Z

Thanks for your contribution!

codecov · 2024-09-03T07:51:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.51%. Comparing base (4e7fb49) to head (5f49b75).
Report is 3 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9073      +/-   ##
===========================================
- Coverage    53.56%   53.51%   -0.05%     
===========================================
  Files          652      652              
  Lines       106397   105187    -1210     
===========================================
- Hits         56987    56291     -696     
+ Misses       49410    48896     -514

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DesmonDay · 2024-09-04T08:44:14Z

代码库中很多地方有 weight_quant_method，麻烦全局确认一遍。

DesmonDay

LGTM

lixcli · 2024-09-04T09:47:45Z

代码库中很多地方有 weight_quant_method，麻烦全局确认一遍。

全局检查后发现argment.py里出现weight_quant_method重复，已在本次pr中删除重复的weight_quant_method字段

lixcli added 16 commits August 28, 2024 07:36

1. add a8w8(fp8) a8w8c8(int8) quant_type support

005f2ad

2. add llama3.1 and qwen2 ptq config 3. update quantization.md

fix load_quant_model bug

e56d9c4

fix load quant bug

e2b9a49

update ll/README.md

d21ace7

remove useless code

e89372c

update quant observer config

e7160d3

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

630b3d6

…nto add_new_fakequant_type

resolve wrong modify

7032bf2

fix prepare_qconfig

323c465

remove unuse files

df416ac

update quantization.md

db61a99

reformat quantization.md and argument.py

f114947

update prepare data method for ceval ptq

8d26cb1

fix wint4 bug

cebf8f0

Merge branch 'develop' of https://github.com/lixcli/PaddleNLP

bc67b75

fix wint4 config bug

5f49b75

yuanlehome approved these changes Sep 4, 2024

View reviewed changes

DesmonDay approved these changes Sep 4, 2024

View reviewed changes

DesmonDay merged commit 70da482 into PaddlePaddle:develop Sep 4, 2024
10 of 12 checks passed

ckl117 pushed a commit to ckl117/PaddleNLP that referenced this pull request Sep 9, 2024

[Inference] Fix weight_only_int4 bug (PaddlePaddle#9073)

7dc3245

Mangodadada pushed a commit to Mangodadada/PaddleNLP that referenced this pull request Sep 10, 2024

[Inference] Fix weight_only_int4 bug (PaddlePaddle#9073)

549dcf8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference] Fix weight_only_int4 bug #9073

[Inference] Fix weight_only_int4 bug #9073

lixcli commented Sep 3, 2024

paddle-bot bot commented Sep 3, 2024

codecov bot commented Sep 3, 2024 •

edited

Loading

DesmonDay commented Sep 4, 2024

DesmonDay left a comment

lixcli commented Sep 4, 2024

[Inference] Fix weight_only_int4 bug #9073

[Inference] Fix weight_only_int4 bug #9073

Conversation

lixcli commented Sep 3, 2024

PR types

PR changes

Description

paddle-bot bot commented Sep 3, 2024

codecov bot commented Sep 3, 2024 • edited Loading

Codecov Report

DesmonDay commented Sep 4, 2024

DesmonDay left a comment

Choose a reason for hiding this comment

lixcli commented Sep 4, 2024

codecov bot commented Sep 3, 2024 •

edited

Loading