Fit sharding optimization for auto parallel llama #8021

From00 · 2024-02-26T09:04:53Z

PR types

Bug fixes

PR changes

Models

Description

静半Llama组网适配sharding优化相关开关，包括：
data_parallel_config：新增dp相关优化开关，目前支持enable_allreduce_avg_in_gradinent_scale配置，用于使用allreduce_avg通信算子做dp通信同步。框架相关实现PR：PaddlePaddle/Paddle#61622
sharding_parallel_config：适配enable_stage1_tensor_fusion、enable_stage1_overlap、enable_stage2_overlap配置，其中enable_stage1_tensor_fusion用于开启sharding通信fusion优化，enable_stage1_overlap和enable_stage2_overlap用于开启sharding通信overlap优化。目前静半下通信fusion和overlap策略均只在stage2下才会起作用，stage1没有相关实现。

paddle-bot · 2024-02-26T09:04:58Z

Thanks for your contribution!

codecov · 2024-02-26T09:46:56Z

Codecov Report

Attention: Patch coverage is 7.69231% with 12 lines in your changes are missing coverage. Please review.

Project coverage is 56.54%. Comparing base (b7bfb26) to head (e89375b).
Report is 3 commits behind head on develop.

Files	Patch %	Lines
paddlenlp/trainer/training_args.py	7.69%	12 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8021      +/-   ##
===========================================
- Coverage    56.55%   56.54%   -0.01%     
===========================================
  Files          592      592              
  Lines        91036    91067      +31     
===========================================
+ Hits         51484    51493       +9     
- Misses       39552    39574      +22

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…nto fit-sharding-optimation-for-auto-llama

JZ-LIANG

LGTM

ZHUI

有空可以也写一下中文文档，在这里 https://github.com/PaddlePaddle/PaddleNLP/blob/develop/docs/trainer.md?plain=1#L540-L553 先approve了

zhiqiu

LGTM

Fit sharding optimization for auto parallel llama

c663c7b

From00 added 2 commits March 2, 2024 16:58

Add args enable_allreduce_avg_in_gradinent_scale

91c1779

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

271ece0

…nto fit-sharding-optimation-for-auto-llama

From00 mentioned this pull request Mar 3, 2024

Using allreduce_avg to eliminate scale in auto parallel DP PaddlePaddle/Paddle#61622

Merged

Fix CI errors

e89375b

JZ-LIANG approved these changes Mar 4, 2024

View reviewed changes

ZHUI reviewed Mar 5, 2024

View reviewed changes

zhiqiu approved these changes Mar 5, 2024

View reviewed changes

wawltor merged commit e3cb5d2 into PaddlePaddle:develop Mar 5, 2024
7 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fit sharding optimization for auto parallel llama #8021

Fit sharding optimization for auto parallel llama #8021

From00 commented Feb 26, 2024 •

edited

Loading

paddle-bot bot commented Feb 26, 2024

codecov bot commented Feb 26, 2024 •

edited

Loading

JZ-LIANG left a comment

ZHUI left a comment

zhiqiu left a comment

Fit sharding optimization for auto parallel llama #8021

Fit sharding optimization for auto parallel llama #8021

Conversation

From00 commented Feb 26, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Feb 26, 2024

codecov bot commented Feb 26, 2024 • edited Loading

Codecov Report

JZ-LIANG left a comment

Choose a reason for hiding this comment

ZHUI left a comment

Choose a reason for hiding this comment

zhiqiu left a comment

Choose a reason for hiding this comment

From00 commented Feb 26, 2024 •

edited

Loading

codecov bot commented Feb 26, 2024 •

edited

Loading