Skip to content

Conversation

GhostScreaming
Copy link
Contributor

PR types

Others

PR changes

Others

Description

Sequence Parallel逐位对齐代码。LLaMA 规模缩减到1层,mp_degree = 2,第一个step,动手SP与动半SP能够逐位对齐(前向每层输出和反向params.grad)。基于PR 7609

Copy link

paddle-bot bot commented Dec 14, 2023

Thanks for your contribution!

Copy link

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。

@github-actions github-actions bot added the stale label Feb 13, 2024
@CLAassistant
Copy link

CLAassistant commented Oct 14, 2024

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
0 out of 3 committers have signed the CLA.

❌ wuhuachaocoding
❌ GhostScreaming
❌ liuzhenhai93
You have signed the CLA already but the status is still pending? Let us recheck it.

@github-actions github-actions bot removed the stale label Oct 15, 2024
Copy link

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。

@github-actions github-actions bot added the stale label Jan 23, 2025
@github-actions github-actions bot removed the stale label Feb 5, 2025
Copy link

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。

@github-actions github-actions bot added the stale label Apr 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants