Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support cpm on xpu #76

Merged
merged 9 commits into from
May 25, 2023
Merged

support cpm on xpu #76

merged 9 commits into from
May 25, 2023

Conversation

ScoThunder
Copy link
Contributor

No description provided.

training/benchmarks/cpm/pytorch/config/_base.py Outdated Show resolved Hide resolved
@@ -0,0 +1,19 @@
from config_common import *

dist_backend = "xccl"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

1*1的配置如果没有跑实验,运行情况一栏没有写相关数据,可以不提供这个配置,

@@ -0,0 +1,21 @@
from config_common import *
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

同上

@@ -0,0 +1,19 @@
from config_common import *
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

同上,如果不是实际运行的1*4配置,这边可以不用给出。主要是担心,没有验证的配置容易不符合预期

return model


def remap_attn_parameters(model_dict):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个函数是不是不需要啦

dist_backend = "xccl"

use_env = True
target_embedding_average = 0.92
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

target_embedding_average和gradient_accumulation_steps,和_base.py中的配置一致,可以删除。

warmup = 0.2
learning_rate = 0.0005

beta_1: float = 0.9
Copy link
Contributor

@yuzhou03 yuzhou03 May 18, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

beta_1, beta_2, eps值没有改变的,可以删除。

@upvenly upvenly merged commit d583c5c into FlagOpen:main May 25, 2023
@ScoThunder ScoThunder deleted the cpm branch May 30, 2023 07:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants