add masked fill op #3515

clackhan · 2020-08-28T06:53:31Z

添加masked fill op
reference https://pytorch.org/docs/stable/tensors.html?highlight=masked_fill#torch.Tensor.masked_fill

oneflow/python/ops/math_ops.py

liujuncheng · 2020-08-28T07:50:59Z

oneflow/user/ops/masked_fill_op.cpp

+REGISTER_USER_OP("masked_fill")
+    .Input("x")
+    .Input("mask")
+    .Attr("value", UserOpAttrType::kAtFloat)


masked_fill支持多种数据类型，没有办法用float精确表达，这里可以参考scalar_mul

oneflow/user/ops/masked_fill_op.cpp

oneflow/user/kernels/masked_fill_kernel.cpp

…dev_add_op_masked_fill

oneflow/python/ops/array_ops.py

oneflow/python/test/ops/test_masked_fill.py

…dev_add_op_masked_fill

oneflow/core/kernel/broadcast_to_compatible_with_kernel.cpp

liujuncheng · 2020-09-02T05:21:38Z

oneflow/core/kernel/constant_like_kernel_half.cu

+
+namespace {
+
+__global__ void NaiveHalfFillGpu(const int64_t elem_cnt, const float16 x, float16* y) {


NewKernelUtil<device_type>::Fill是支持fp16的，是不是只需要处理好operand到float16的转换，而不是新写kernel

liujuncheng · 2020-09-02T05:22:01Z

oneflow/core/kernel/constant_like_kernel_half.cu

+
+REGISTER_HALF_CONSTANT_LIKE_KERNEL
+
+}  // namespace oneflow


注意这里的空行

liujuncheng · 2020-09-02T05:24:15Z

oneflow/user/kernels/scalar_by_tensor_add_kernel_half.cu

+
+namespace {
+
+__global__ void HalfAddByScalarPtrGpu(const int64_t n, const half* x, const half* y, half* z) {


这里应该是给XxxByScalarPtr添加float16类型吧

liujuncheng · 2020-09-02T05:25:16Z

oneflow/user/kernels/where_kernel_half.cu

+namespace {
+
+template<typename CondT>
+__global__ void NaiveHalfWhere(const int64_t elem_cnt, const CondT* cond, const half* lhs,


同理，这里直接给where添加float16就可以了吧

liujuncheng · 2020-09-02T05:26:14Z

oneflow/user/kernels/scalar_by_tensor_add_kernel_half.cu

+
+__global__ void HalfAddByScalarPtrGpu(const int64_t n, const half* x, const half* y, half* z) {
+  const half y_value = y[0];
+  CUDA_1D_KERNEL_LOOP(i, n) { z[i] = x[i] + y_value; }


float16/half类型不能直接用+，要用__hadd

change piar to macro Co-authored-by: Juncheng <liujuncheng1022@gmail.com>

add masked fill op

3a9d5cc

clackhan requested review from lixinqi, liujuncheng and guo-ran August 28, 2020 06:53

add blank line at end of file

de5b362

liujuncheng reviewed Aug 28, 2020

View reviewed changes

clackhan marked this pull request as draft August 28, 2020 08:24

jackalcooper added this to the 0.1.11 milestone Aug 30, 2020

clackhan and others added 4 commits August 30, 2020 18:24

reimplement masked_fill & optimize masked_fill test scrip

29d9d1e

fix code format

7492671

delete old version masked_fill op impl

c7677bd

Merge branch 'master' of https://github.com/Oneflow-Inc/oneflow into …

91d9cd9

…dev_add_op_masked_fill

clackhan marked this pull request as ready for review August 30, 2020 23:49

liujuncheng reviewed Aug 31, 2020

View reviewed changes

oneflow/python/ops/array_ops.py Show resolved Hide resolved

oneflow/python/test/ops/test_masked_fill.py Outdated Show resolved Hide resolved

clackhan added 8 commits August 31, 2020 17:09

add doc for maksed_fill

5907cda

add half compute of a series of op

ae255c1

add half test of masked_fill op

7b05529

Merge branch 'master' of https://github.com/Oneflow-Inc/oneflow into …

445bccf

…dev_add_op_masked_fill

fix code format

457e931

fix masked_fill op doc

c8e128c

fix masked_fill op doc

14862be

Merge branch 'master' of https://github.com/Oneflow-Inc/oneflow into …

271c899

…dev_add_op_masked_fill

liujuncheng reviewed Sep 2, 2020

View reviewed changes

clackhan and others added 3 commits September 2, 2020 00:34

Update oneflow/core/kernel/broadcast_to_compatible_with_kernel.cpp

887892a

change piar to macro Co-authored-by: Juncheng <liujuncheng1022@gmail.com>

optimize masked_fill

6531df9

regist float16 of scalar_by_tensor sub, mul, div

0d44878

liujuncheng approved these changes Sep 2, 2020

View reviewed changes

Merge branch 'master' into dev_add_op_masked_fill

951e68e

liujuncheng added the automerge label Sep 2, 2020

oneflow-ci-bot merged commit 4511d64 into master Sep 2, 2020

oneflow-ci-bot deleted the dev_add_op_masked_fill branch September 2, 2020 17:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add masked fill op #3515

add masked fill op #3515

clackhan commented Aug 28, 2020

liujuncheng Aug 28, 2020

liujuncheng Sep 2, 2020

liujuncheng Sep 2, 2020

liujuncheng Sep 2, 2020

liujuncheng Sep 2, 2020

liujuncheng Sep 2, 2020


		namespace {

		__global__ void NaiveHalfFillGpu(const int64_t elem_cnt, const float16 x, float16* y) {


		namespace {

		__global__ void HalfAddByScalarPtrGpu(const int64_t n, const half* x, const half* y, half* z) {

add masked fill op #3515

add masked fill op #3515

Conversation

clackhan commented Aug 28, 2020

liujuncheng Aug 28, 2020

Choose a reason for hiding this comment

liujuncheng Sep 2, 2020

Choose a reason for hiding this comment

liujuncheng Sep 2, 2020

Choose a reason for hiding this comment

liujuncheng Sep 2, 2020

Choose a reason for hiding this comment

liujuncheng Sep 2, 2020

Choose a reason for hiding this comment

liujuncheng Sep 2, 2020

Choose a reason for hiding this comment