Fix xpu2 kp compile error #53548

zhangboSJTU · 2023-05-06T04:57:18Z

PR types

Bug fixes

PR changes

OPs

Description

fix compile error in xpu2

paddle-bot · 2023-05-06T04:57:23Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

AnnaTrainingG · 2023-05-08T02:07:01Z

paddle/phi/kernels/funcs/broadcast_function.h

+                                               const uint32_t numel,
+                                               int read_lens) {
    using Type = std::tuple_element_t<Index, ArgsT>;
+#ifdef PADDLE_WITH_XPU_KP


89行的vec_size只与out有关吗？看你修改前的代码是与in/out同时有关的，不确定这里会不会隐藏性能问题

参考elementwise也是只取了out的vec_size

elementwise是因为dim是相同的，而broadcast 输入输出的dim可能是不同的……

vec_size 原本是取 min( in out 4), 现在是取min( out 4)，那应该值是>=之前的值，所以应该不会造成性能下降，有其他原因考虑需要加上吗

AnnaTrainingG · 2023-05-08T02:11:20Z

paddle/phi/kernels/funcs/broadcast_function.h

  __simd__ ArgsT args[VecSize];
  __simd__ ConditionalT<OutT, NumOuts> result[VecSize];

+#ifdef PADDLE_WITH_XPU_KP


这里为什么要单独区分kp

XPUKP 在broadcast的功能与GPU是一样的呀

这里之前铭书对GPU的broadcast进行了特化优化（减少了其中重复的fast_divmod计算），这里为了保持其优化效果，就需要单独拿出来

AnnaTrainingG · 2023-05-08T02:17:48Z

paddle/phi/kernels/funcs/broadcast_function.h

  } else {
    BcUnroller<BroadcastDataLoader, IsBoundary, LoadType, VecSize, Arity>::step(
-        ins, args, configs, use_broadcast, block_offset, num, numel);
+        ins, args, configs, use_broadcast, block_offset, num, numel, read_lens);


read_lens是给XPU KP 使用的，此处代码已经被else包含为什么还要添加read_lens

上一个 comment 中，gpu部分做了特化，但 kp和 gpu 使用的是相同的非特化函数，参数就需要保持一致了

AnnaTrainingG · 2023-05-08T02:20:39Z

paddle/phi/kernels/primitive/datamover_primitives_xpu2.h

+  }
+#pragma unroll
+  for (int idx = 0; idx < read_lens; ++idx) {
+    std::get<Index>(dst[idx]) = in_temp[0];


这里错误！ read_lens表示向量化读取的数量的格式，说明最终往dst里面存的数据应给是read_len个，而此处的只是循环写了in_temp 错误！！！

这里没有 is boundary 的判断条件，不确定 read_lens 和Nx的关系，不知道具体是怎么读取数据，现在清楚已经修改

ZzSean

LGTM for CI-OP-Benchmark

…to Release/2.5 (#53623) * Support different dtypes of inputs for broadcast for dropout optimization (#52093) * change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * PR comment * dropout_nd_optimization (#51479) * with printf * add DropOutNdForwardKernel * PR comment * Dropout optimize & clean broadcast inT and ElementwiseType (#52969) * change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * clean ElementwiseT and InT for BroadcastKernel * default axis and clean inT * remove redundant fast divmod computation * optimize drop_nd & drop_nd_grad * optimize BroadcastDataLoader bf16 fp16 * rm InT etc. after merge develop * delete constexpr for windows ci * fix conflict * fix conflic with develop * fix conflic * new clean * clean * Fix xpu2 kp compile error (#53548) * fix conflict * conflict

fix kp compile bug

437495c

zhangboSJTU requested a review from AnnaTrainingG May 6, 2023 04:57

index fix

aca19a4

AnnaTrainingG reviewed May 8, 2023

View reviewed changes

PR review

efc2d0a

zhangboSJTU requested a review from AnnaTrainingG May 9, 2023 02:42

AnnaTrainingG approved these changes May 9, 2023

View reviewed changes

ZzSean approved these changes May 9, 2023

View reviewed changes

AnnaTrainingG changed the title ~~fix kp compile bug~~ Fix xpu2 kp compile error May 9, 2023

AnnaTrainingG merged commit 8d340ee into PaddlePaddle:develop May 9, 2023

zhangboSJTU added a commit to zhangboSJTU/Paddle that referenced this pull request May 9, 2023

Fix xpu2 kp compile error (PaddlePaddle#53548)

68ce5ed

Uh oh!

Fix xpu2 kp compile error #53548

Fix xpu2 kp compile error #53548

Uh oh!

Conversation

zhangboSJTU commented May 6, 2023

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented May 6, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhangboSJTU May 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ZzSean left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhangboSJTU May 8, 2023 •

edited

Loading