Add elementwise maximum/minimum ops #4069

doombeaker · 2020-12-31T11:11:43Z

概述

为满足浙工大算法开发需求，增加 math.maximum 及 math.minimum 2 个算子的方向。OneFlow 原有的这两个接口，支持 broadcast，实现反向较为复杂。折中考虑，增加了两个 Op elementwise_maximum 和 elementwise_minimum 分别实现了它们的前向和后向。在 Python 前端根据 x 和 y 的形状，决定调用 elementwise 还是 broadcast 的算子。

换言之，此PR合并后，OneFlow 将支持 elementwise 类型的 maximum 和 minimum 的后向，暂不支持 broadcast 类型的 maximum、minimum 的后向。

功能 CheckList

注意 : 功能复选框均为可选项，若未选择，说明理由即可。例如：该 Op 由 Python 接口拼接而成，因此无 SetBatchAxisInferFn Op 注册；再比如：该 Op 无输入，因此无 SetInputArgModifyFn。

模板中自带的复选框可留空，但是不能删除。可根据实际情况增加复选框选项。

Op

Op SetBatchAxisInferFn
Op SetGetSbpFn
Op SetInputArgModifyFn，未有特殊需求，没有实现
Op 反向梯度注册

Kernel

CPU x:float32 y:float32
CPU x:float32 y:float32
GPU x:float32 y:float32
GPU x:float32 y:float32

Python Wrapper

Python API 参数检查及异常提示（增加了shape的判断）
接口注释（文档沿用之前已有的未做改变）
Example (Example 沿用之前已有的未做改变)

测试

GPU 有效带宽

因为u使用的是 CUDA ELEMENTWISE 模板，所以未测此项。

PR Checklist

PR 标题语句通畅，明确表达 PR 内容，适合直接作为新版本发布时的 changelog
代码格式化
已经本地编译通过
已本地针对改动测试
已添加 type 标签:(填写 type 标签名，如 bug, enhancement, purge, feature, documentation)
已添加 component 标签:(填写 component 标签名，如 op, system, eager, build, xla, python, ci, test, tooling, onnx)
Draft 转正式 PR 前已请人 Review

…flow-Inc/oneflow into add_grad_for_maximum_minimum

leaves-zwx · 2021-01-11T06:48:47Z

oneflow/user/ops/elementwise_maximum_minimum_ops.cpp

+
+}  // namespace
+
+namespace user_op {


这个下面的 macro define 和 op registration 都不用放在 user op 的 namespace 下，直接放在 oneflow namespace 下就可以了

leaves-zwx · 2021-01-11T07:28:56Z

oneflow/user/ops/elementwise_maximum_minimum_ops.cpp

+              .InputBind("dz", ctx->FwOp().output_grad("z", 0))                   \
+              .InputBind("x", ctx->FwOp().input("x", 0))                          \
+              .InputBind("y", ctx->FwOp().input("y", 0))                          \
+              .Output("dx")                                                       \


.Output("dx") .Output("dy")

这两个 output 是不是需要分别根据 x.need_grad 和 y.need_grad 来设置的

if (x.need_grad()) { builder.Output("dx"); } if (y.need_grad()) { builder.Output("dy"); }

leaves-zwx · 2021-01-11T09:28:38Z

oneflow/user/kernels/elementwise_maximum_minimum_kernel.cpp

+  }
+};
+}  // namespace
+


template<typename FunctorT, typename T> inline cudaError_t XimumGrad(FunctorT functor, int64_t n, T* dx, T* dy const T* x, const T* y, const T* dz, cudaStream_t stream) { using FactoryT = SimpleFactory<FunctorT>; return GenericLauncher<FactoryT, R, A, B, C>::Launch(FactoryT(functor), n, dx, dy, x, y, dz, stream); }

leaves-zwx · 2021-01-11T12:43:15Z

oneflow/user/kernels/elementwise_maximum_minimum_kernel.cu

+template<template<typename> class BackwardFunctor, typename T>
+__global__ void ElementwiseBackwardGradGpu(int64_t elem_cnt, const T* dz, const T* x, const T* y,
+                                           T* dx, T* dy) {
+  BackwardFunctor<T>::Backward(elem_cnt, dz, x, y, dx, dy);


CUDA_1D_LOOP(i, elem_cnt) { BackwardFunctor<T>::Backward(dz, x, y, dx, dy); }

* start up of ADD grad for maximum and minimum * refine batch axis * add GPU version * add minimum backward * add static shape unit test * add dynamic test * add sbp and batchaxis infer * refine files hierarchy * elementwise maximum and minimum finished * refine on checking dx/dy if exists * refine (use template functors) * refine test case Co-authored-by: MARD1NO <359521840@qq.com> Co-authored-by: ZZK <42901638+MARD1NO@users.noreply.github.com> Co-authored-by: Zailiang <zailiangyu@gmail.com> Co-authored-by: oneflow-ci-bot <69100618+oneflow-ci-bot@users.noreply.github.com> Former-commit-id: 823603c

doombeaker and others added 10 commits December 31, 2020 19:11

start up of ADD grad for maximum and minimum

ca0050c

refine batch axis

c4d06ce

add GPU version

66e7ece

add minimum backward

f32a9af

Merge branch 'master' into add_grad_for_maximum_minimum

babbb85

add static shape unit test

57377d8

Merge branch 'add_grad_for_maximum_minimum' of https://github.com/One…

e1ff20d

…flow-Inc/oneflow into add_grad_for_maximum_minimum

add dynamic test

94edca2

add sbp and batchaxis infer

217936d

Merge branch 'master' into add_grad_for_maximum_minimum

5897e46

Zailiang requested a review from oneflow-ci-bot January 5, 2021 10:06

oneflow-ci-bot removed their request for review January 5, 2021 10:08

leaves-zwx assigned doombeaker Jan 9, 2021

leaves-zwx self-requested a review January 9, 2021 06:12

doombeaker added 8 commits January 9, 2021 21:04

Merge branch 'master' into add_grad_for_maximum_minimum

58696b2

refine files hierarchy

953834d

elementwise maximum and minimum finished

772339d

refine on checking dx/dy if exists

849baa6

refine (use template functors)

8d12b86

Merge branch 'master' into add_grad_for_maximum_minimum

4fef07c

refine test case

bc3d958

Merge branch 'add_grad_for_maximum_minimum' of https://github.com/One…

b09c27b

…flow-Inc/oneflow into add_grad_for_maximum_minimum

doombeaker changed the title ~~start up of ADD grad for maximum and minimum~~ Add elementwise maximum/minimum ops Jan 10, 2021

doombeaker added enhancement op labels Jan 10, 2021

doombeaker marked this pull request as ready for review January 10, 2021 09:12

doombeaker requested a review from oneflow-ci-bot January 10, 2021 09:13

oneflow-ci-bot removed their request for review January 10, 2021 12:19

Merge branch 'master' into add_grad_for_maximum_minimum

376b228

Zailiang requested a review from MARD1NO January 11, 2021 06:16

Zailiang requested a review from chengtbf January 11, 2021 06:17

Zailiang added the automerge label Jan 11, 2021

Zailiang self-requested a review January 11, 2021 08:06

Zailiang approved these changes Jan 11, 2021

View reviewed changes

Merge branch 'master' into add_grad_for_maximum_minimum

69f9f4b

oneflow-ci-bot self-requested a review January 11, 2021 08:51

Merge branch 'master' into add_grad_for_maximum_minimum

0453e3d

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot January 11, 2021 11:22

leaves-zwx reviewed Jan 11, 2021

View reviewed changes

Merge branch 'master' into add_grad_for_maximum_minimum

5952b32

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot January 11, 2021 14:56

oneflow-ci-bot merged commit 823603c into master Jan 11, 2021

oneflow-ci-bot deleted the add_grad_for_maximum_minimum branch January 11, 2021 17:56

doombeaker mentioned this pull request Jan 12, 2021

Refine grad for maximum minimum #4106

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add elementwise maximum/minimum ops #4069

Add elementwise maximum/minimum ops #4069

doombeaker commented Dec 31, 2020 •

edited

Loading

leaves-zwx Jan 11, 2021

leaves-zwx Jan 11, 2021

leaves-zwx Jan 11, 2021

leaves-zwx Jan 11, 2021

leaves-zwx Jan 11, 2021

Add elementwise maximum/minimum ops #4069

Add elementwise maximum/minimum ops #4069

Conversation

doombeaker commented Dec 31, 2020 • edited Loading

概述

功能 CheckList

Op

Kernel

Python Wrapper

测试

GPU 有效带宽

PR Checklist

leaves-zwx Jan 11, 2021

Choose a reason for hiding this comment

leaves-zwx Jan 11, 2021

Choose a reason for hiding this comment

leaves-zwx Jan 11, 2021

Choose a reason for hiding this comment

leaves-zwx Jan 11, 2021

Choose a reason for hiding this comment

leaves-zwx Jan 11, 2021

Choose a reason for hiding this comment

doombeaker commented Dec 31, 2020 •

edited

Loading