【pir save 】Modiy export llama model file in pir mode #8689

xiaoguoguo626807 · 2024-07-01T06:54:07Z

PR types

Others

PR changes

Others

Description

pcard-67164
修改多处代码支持在pir模式下对llama-2-7b模型导出

动转静下遇到动态shape 无法导出，需要将paddlenlp/transformers/llama/modeling.py 中关于attn_weights.shape 的判断代码在动转静下跳过。因为动态图运行此处可以拦截错误，动转静不会出现问题。
当pad_token_id = None 时，PIR下不允许传递给full_like 的value 是none,此处逻辑不完备，generate 函数中会判断如果没有pad_token_id 时将pad_token_id 设置为eos_token_id
PIR下没有print op 且op相关的方法也不同。需要进行分支处理

paddle-bot · 2024-07-01T06:54:11Z

Thanks for your contribution!

codecov · 2024-07-01T07:27:33Z

Codecov Report

Attention: Patch coverage is 60.00000% with 4 lines in your changes missing coverage. Please review.

Project coverage is 55.62%. Comparing base (be5bb14) to head (e86f5bc).
Report is 231 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/generation/utils.py	50.00%	4 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #8689   +/-   ##
========================================
  Coverage    55.61%   55.62%           
========================================
  Files          620      620           
  Lines        96965    96991   +26     
========================================
+ Hits         53930    53949   +19     
- Misses       43035    43042    +7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

wtmlon · 2024-07-02T08:43:04Z

paddlenlp/generation/utils.py

@@ -1038,6 +1042,7 @@ def greedy_search(
        synced_gpus=False,
        **model_kwargs
    ):
+        pad_token_id = self.set_pad_token_id(pad_token_id, eos_token_id)


def generate已经 set_pad_token_id过了，这里也就不用再次设置了

wtmlon · 2024-07-02T08:43:23Z

paddlenlp/generation/utils.py

@@ -1143,6 +1148,7 @@ def sample(
        synced_gpus=False,
        **model_kwargs
    ):
+        pad_token_id = self.set_pad_token_id(pad_token_id, eos_token_id)


wawltor

LGTM

xiaoguoguo626807 added 2 commits July 1, 2024 06:39

modify model to support llama export json

8a618ef

modify model to support llama export json

f796f54

xiaoguoguo626807 added 3 commits July 1, 2024 09:18

modify id

092c008

modify full_like None

a6465f3

modfiy utils

39c80a6

wtmlon reviewed Jul 2, 2024

View reviewed changes

reply comment

e86f5bc

wawltor approved these changes Jul 2, 2024

View reviewed changes

wawltor merged commit d832282 into PaddlePaddle:develop Jul 2, 2024
8 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【pir save 】Modiy export llama model file in pir mode #8689

【pir save 】Modiy export llama model file in pir mode #8689

xiaoguoguo626807 commented Jul 1, 2024 •

edited

Loading

paddle-bot bot commented Jul 1, 2024

codecov bot commented Jul 1, 2024 •

edited

Loading

wtmlon Jul 2, 2024

xiaoguoguo626807 Jul 2, 2024

wtmlon Jul 2, 2024

xiaoguoguo626807 Jul 2, 2024

wawltor left a comment

【pir save 】Modiy export llama model file in pir mode #8689

【pir save 】Modiy export llama model file in pir mode #8689

Conversation

xiaoguoguo626807 commented Jul 1, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Jul 1, 2024

codecov bot commented Jul 1, 2024 • edited Loading

Codecov Report

wtmlon Jul 2, 2024

Choose a reason for hiding this comment

xiaoguoguo626807 Jul 2, 2024

Choose a reason for hiding this comment

wtmlon Jul 2, 2024

Choose a reason for hiding this comment

xiaoguoguo626807 Jul 2, 2024

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

xiaoguoguo626807 commented Jul 1, 2024 •

edited

Loading

codecov bot commented Jul 1, 2024 •

edited

Loading