add llama2 autoTP support in replace_module #4022

dc3671 · 2023-07-24T07:33:55Z

Problem

After llama2 update in transformers [Llama2] Add support for Llama 2, the autoTP for llama is broken.

Reason

After dig into transformers modification, I noticed that there is another self.num_key_value_heads attribute in the attention module:

https://github.com/huggingface/transformers/pull/24891/files#diff-06392bad3b9e97be9ade60d4ac46f73b6809388f4d507c2ba1384ab872711c51R253

Solution

In DeepSpeed, this part is controlled by a hardcoded name list in update_mp_params of autoTP's replace_wo_policy(see this PR's modification). So I added one more key in this list, and the problem is fixed.

@tjruwase @jeffra please kindly review, thanks~

* add lm_head tensor parallel * fix conflict * add embed_out tp * add llama2 autoTP support in replace_module (deepspeedai#4022) --------- Co-authored-by: Lai, Yejing <yejing.lai@intel.com> Co-authored-by: Dino Chen <zhenhuan.chen@intel.com>

add llama2 autoTP support in replace_module

192b323

dc3671 requested review from RezaYazdaniAminabadi, arashb, awan-10, cmikeh2, jeffra and mrwyattii as code owners July 24, 2023 07:33

puneeshkhanna mentioned this pull request Jul 24, 2023

[BUG] size mismatch for tp_size>1 in Llama and Llama2 where transformer>=4.31.0 #4016

Closed

mrwyattii approved these changes Jul 24, 2023

View reviewed changes

mrwyattii added this pull request to the merge queue Jul 24, 2023

mrwyattii mentioned this pull request Jul 24, 2023

Support num_key_value_heads configuration for grouped-query attention. #4028

Closed

Merged via the queue into deepspeedai:master with commit f3943cf Jul 24, 2023

dc3671 deleted the fix-llama2-autotp branch August 8, 2023 06:37

blzheng pushed a commit to blzheng/DeepSpeedSYCLSupport that referenced this pull request Aug 15, 2023

add llama2 autoTP support in replace_module (deepspeedai#4022)

df4503b

mrwyattii mentioned this pull request Aug 16, 2023

[BUG] Inference errors with Llama2 #4027

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add llama2 autoTP support in replace_module #4022

add llama2 autoTP support in replace_module #4022

Uh oh!

dc3671 commented Jul 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

add llama2 autoTP support in replace_module #4022

add llama2 autoTP support in replace_module #4022

Uh oh!

Conversation

dc3671 commented Jul 24, 2023

Problem

Reason

Solution

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants