Skip to content

Conversation

xiaoxiaohehe001
Copy link
Contributor

PR types

Function optimization

PR changes

Others

Description

Support pre_caches input of llama

@paddle-bot
Copy link

paddle-bot bot commented Sep 1, 2023

Thanks for your contribution!

@CLAassistant
Copy link

CLAassistant commented Sep 1, 2023

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 3 committers have signed the CLA.

✅ xiaoxiaohehe001
❌ root
❌ wj-Mcat


root seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
You have signed the CLA already but the status is still pending? Let us recheck it.

Comment on lines +753 to +755
# "类型#裙*材质#针织*颜色#纯色*风格#复古*风格#文艺*风格#简约*图案#格子*图案#纯色*图案#复古*裙型#背带裙*裙长#连衣裙*裙领型",
"类型#上衣*材质#牛仔布*颜色#白色*风格#简约*图案#刺绣*衣样式#外套*衣款式#破洞",
"类型#上衣*风格#嘻哈*图案#卡通*图案#印花*图案#撞色*衣样式#卫衣*衣款式#连帽",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这种测试的 case 就不要提交上来嘛

@@ -399,13 +404,16 @@ def dybatch_preprocess(

inputs = {}
pad_token_id = tokenizer([tokenizer.pad_token], return_tensors="np")["input_ids"][0][-1]
pad_token_id = 0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里的 hard-code 还是删掉吧,后面通过 tokenizer config 文件来调整比较好=

@@ -455,6 +458,7 @@ def forward(
q_out, k_out, v_out = qkv_transpose_split(
qkv_out, padding_offset, seq_lens, input_ids, self.num_heads // self.nranks, self.head_dim
)
# import pdb; pdb.set_trace()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

测试的代码要删掉。

Comment on lines 209 to 213
print("cum_offsets", cum_offsets)
print("padding_offset", padding_offset)

# if is_decoder:
# import pdb; pdb.set_trace()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

代码要删掉

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

看了下有很多其他类似的代码,也是需要删掉的。

Copy link

github-actions bot commented Nov 4, 2023

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。

@github-actions github-actions bot added the stale label Nov 4, 2023
@github-actions github-actions bot removed the stale label Jan 11, 2024
Copy link

github-actions bot commented Apr 1, 2024

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。

@github-actions github-actions bot added the stale label Apr 1, 2024
@github-actions github-actions bot removed the stale label Jul 11, 2024
Copy link

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。

@github-actions github-actions bot added the stale label Sep 23, 2024
@github-actions github-actions bot removed the stale label Oct 16, 2024
Copy link

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。

@github-actions github-actions bot added the stale label Jan 23, 2025
@github-actions github-actions bot removed the stale label Feb 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants