[LLM] Add tensor parallel for chatglmv2 #9014

SevenSamon · 2024-08-27T03:26:20Z

PR types

Bug fixes

PR changes

Models

Description

fix chatglmv2 8k

在8k情况下，zero_padding=true, max_sequence_length=8192, src_length=4096

比较 w/o recompute 的显存占用和吞吐情况
Sharding_parallel_degree=8, sharding=stage3, recompute=True, step=9时, OOM
Sharding_parallel_degree=8, sharding=stage3, recompute=False, step=1时, OOM
比较 w/o tp、w/o sp 的显存占用和吞吐情况, sharding=4

Tp=2, sequence parallel = 0, sharding_degree=4,sharding=stage2, recompute=True,
train_runtime: 293.3676, train_samples_per_second: 0.409, train_steps_per_second: 0.1023, train_loss: 9.233072916666666, progress_or_epoch: 0.0647
Effective_Tokens_per_second: 3322.318257026338 内存：430G
TP=2, sequence parallel = 1, sharding_degree=4, sharding=stage2, recompute=True, (padding_length = max_sequence_length)
train_runtime: 141.7329, train_samples_per_second: 0.8467, train_steps_per_second: 0.2117, train_loss: 9.233072916666666, progress_or_epoch: 0.0647
Effective_Tokens_per_second: 6876.741628090584 内存：320G

paddle-bot · 2024-08-27T03:26:24Z

Thanks for your contribution!

codecov · 2024-08-27T07:06:29Z

Codecov Report

Attention: Patch coverage is 43.38843% with 137 lines in your changes missing coverage. Please review.

Project coverage is 53.79%. Comparing base (48820cb) to head (42bf444).
Report is 44 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/chatglm_v2/modeling.py	43.56%	136 Missing ⚠️
...p/experimental/transformers/chatglm_v2/modeling.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9014      +/-   ##
===========================================
+ Coverage    53.34%   53.79%   +0.45%     
===========================================
  Files          650      652       +2     
  Lines       105414   104710     -704     
===========================================
+ Hits         56230    56329      +99     
+ Misses       49184    48381     -803

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

paddlenlp/transformers/chatglm_v2/modeling.py

DrownFish19

LGTM

ZHUI · 2024-09-05T12:56:24Z

paddlenlp/transformers/chatglm_v2/modeling.py

 __all__ = [
    "ChatGLMv2Model",
    "ChatGLMv2PretrainedModel",
    "ChatGLMv2ForCausalLM",
 ]


+def seed_guard_context(name=None):


这个问题出现的多吗？

* fix_chatglmv2_8k

fix_chatglmv2_8k

eda6ceb

update modeling and modeling_pp

d46530e

SevenSamon added 3 commits August 27, 2024 09:27

update_pp_for_chatglmv2

b43b650

update_pp

75b0d31

delete_pp

a1af3f3

DrownFish19 changed the title ~~fix_chatglmv2_8k~~ [LLM] Add tensor parallel for chatglmv2 Aug 28, 2024

fix_flash_attention and seed_guard

f08edfd

DrownFish19 reviewed Aug 28, 2024

View reviewed changes

paddlenlp/transformers/chatglm_v2/modeling.py Outdated Show resolved Hide resolved

paddlenlp/transformers/chatglm_v2/modeling.py Show resolved Hide resolved

SevenSamon added 2 commits August 28, 2024 07:16

update_flashattention

4272b02

delete_flash_attention

c6d8f15

DrownFish19 reviewed Aug 29, 2024

View reviewed changes

paddlenlp/transformers/chatglm_v2/modeling.py Outdated Show resolved Hide resolved

paddlenlp/transformers/chatglm_v2/modeling.py Outdated Show resolved Hide resolved

SevenSamon added 2 commits August 29, 2024 03:48

update

f854776

add_merge_weights_func

42bf444

DrownFish19 approved these changes Sep 5, 2024

View reviewed changes

ZHUI approved these changes Sep 5, 2024

View reviewed changes

ZHUI merged commit f4cff96 into PaddlePaddle:develop Sep 5, 2024
9 of 12 checks passed

ckl117 pushed a commit to ckl117/PaddleNLP that referenced this pull request Sep 9, 2024

[LLM] Add tensor parallel for chatglmv2 (PaddlePaddle#9014)

07dbe85

* fix_chatglmv2_8k

Mangodadada pushed a commit to Mangodadada/PaddleNLP that referenced this pull request Sep 10, 2024

[LLM] Add tensor parallel for chatglmv2 (PaddlePaddle#9014)

2496a2e

* fix_chatglmv2_8k

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] Add tensor parallel for chatglmv2 #9014

[LLM] Add tensor parallel for chatglmv2 #9014

SevenSamon commented Aug 27, 2024 •

edited by DrownFish19

Loading

paddle-bot bot commented Aug 27, 2024

codecov bot commented Aug 27, 2024 •

edited

Loading

DrownFish19 left a comment

ZHUI Sep 5, 2024

[LLM] Add tensor parallel for chatglmv2 #9014

[LLM] Add tensor parallel for chatglmv2 #9014

Conversation

SevenSamon commented Aug 27, 2024 • edited by DrownFish19 Loading

PR types

PR changes

Description

paddle-bot bot commented Aug 27, 2024

codecov bot commented Aug 27, 2024 • edited Loading

Codecov Report

DrownFish19 left a comment

Choose a reason for hiding this comment

ZHUI Sep 5, 2024

Choose a reason for hiding this comment

SevenSamon commented Aug 27, 2024 •

edited by DrownFish19

Loading

codecov bot commented Aug 27, 2024 •

edited

Loading