【AMP OP&Test】instance_norm fp16 and bf16 support. #52241

qizhaoaoe · 2023-03-28T11:01:18Z

PR types

Others

PR changes

OPs

Describe

add fp16 and bf16 dtype in instance_norm.
add relative op tests.

… instance_norm_amp

paddle-bot · 2023-03-28T11:01:25Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

… instance_norm_amp

ZzSean · 2023-04-03T06:31:45Z

paddle/phi/kernels/gpu/instance_norm_grad_kernel.cu

                                     const T *x,
-                                     const BatchNormParamType<T> *variance,
+                                     const BatchNormParamType<AccT> *variance,


可以看下BatchNormParamType的定义，这里不需要再使用AccT，直接用T就可以,BatchNormParamType（T==fp16orbf16）就是float了

ZzSean · 2023-04-03T06:37:08Z

python/paddle/fluid/tests/unittests/test_instance_norm_op_v2.py

+        self.bias = np.random.random([self.shape[1]]).astype(np.float32)
+
+    def set_err_thre(self):
+        self.atol = 1e-3


fp32的误差阈值就需要这么大嘛

试了1e-4不通过，就还是保持1e-3吧？

ZzSean · 2023-04-03T06:37:42Z

python/paddle/fluid/tests/unittests/test_instance_norm_op_v2.py

+        self.max_relative_error = 1e-2
+        self.inputs = {
+            'X': convert_float_to_uint16(self.value),
+            'Scale': convert_float_to_uint16(self.scale),


bf16这里的scale和bias应该也不需要convert吧

ZzSean · 2023-04-03T11:12:09Z

python/paddle/fluid/tests/unittests/test_instance_norm_op_v2.py

+        return x_norm, mean.reshape(N * C), std.reshape(N * C)
+
+    def test_check_output(self):
+        place = core.CUDAPlace(0)


这里需要判断下是否编译的是GPU版本，或者直接调用check_output，然后下面bf16的重写一下

ZzSean · 2023-04-04T03:55:50Z

python/paddle/fluid/tests/unittests/test_instance_norm_op_v2.py

+
+    def set_err_thre(self):
+        self.atol = 0.03125
+        self.max_relative_error = 8e-3


使用默认值都无法通过吗

ZzSean · 2023-04-04T03:56:04Z

python/paddle/fluid/tests/unittests/test_instance_norm_op_v2.py

+        self.init_dtype()
+        self.init_shape()
+        self.init_value()
+        self.atol = 1e-2


bf16默认值为1e-2，无需设置

ZzSean · 2023-04-04T03:57:13Z

paddle/phi/kernels/gpu/instance_norm_kernel.cu

@@ -22,6 +22,11 @@
 #include "paddle/phi/kernels/funcs/norm_utils.h"
 #include "paddle/phi/kernels/gpu/instance_norm_utils.h"

+#include "paddle/phi/common/bfloat16.h"
+#include "paddle/phi/common/data_type.h"


删除无用头文件，data_type.h应该包含了bfloat16和float16，device_context.h也需要确认下是否需要

删除了引用 data_type.h and device_context.h.

ZzSean · 2023-04-04T03:57:28Z

paddle/phi/kernels/gpu/instance_norm_grad_kernel.cu

@@ -22,8 +22,13 @@
 #include "paddle/phi/kernels/funcs/norm_utils.h"
 #include "paddle/phi/kernels/gpu/instance_norm_utils.h"

+#include "paddle/phi/common/bfloat16.h"


删除无用头文件

ZzSean · 2023-04-04T08:55:14Z

python/paddle/fluid/tests/unittests/test_instance_norm_op_v2.py

+
+    def init_value(self):
+        np.random.seed(0)
+        self.value = np.random.random(self.shape).astype(self.dtype)


这里对于bf16应该是初始化为fp32，然后convert

ZzSean · 2023-04-04T08:56:06Z

python/paddle/fluid/tests/unittests/test_instance_norm_op_v2.py

+        self.python_api = instance_norm_warpper
+        self.eps = 1e-5
+        self.data_format = "NCHW"
+        self.init_dtype()


bf16的单测并没有对init_dtype重写，所以当前还是dtype还是float32

ZzSean

LGTM

luotao1

LGTM

JiabinYang

LGTM

qizhaoaoe added 2 commits March 28, 2023 15:35

add fp16 and bf16 support for instance_norm

da9fbbd

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

2b7111c

… instance_norm_amp

qizhaoaoe added 3 commits March 29, 2023 10:40

fix /= operator which not support bf16

4b414d8

fix instance_norm_grad kernel and unittests.

e491361

fix fp32 unittests.

134fbcc

qizhaoaoe force-pushed the instance_norm_amp branch from 4027894 to 134fbcc Compare March 30, 2023 09:40

qizhaoaoe added 5 commits March 31, 2023 15:29

fix instance_norm_kernel and unittests.

ecd7ae1

fix instance_norm_grad_kernel and unittest threshold.

0006187

add fp16/bf16 for instance_norm_grad_grad op.

e0af6d2

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f62fd45

… instance_norm_amp

add bf16 dtype check.

a48f6a0

ZzSean reviewed Apr 3, 2023

View reviewed changes

fix conflicts.

9f5a3a9

ZzSean reviewed Apr 3, 2023

View reviewed changes

qizhaoaoe added 3 commits April 3, 2023 20:18

fix cpu support for fp32 op and fix type in instance_norm_grad_kernel.

db1703d

fix type in instance_norm_kernel.

0e25977

fix bf16 outputs in unittests and refine codes.

40ccc84

ZzSean reviewed Apr 4, 2023

View reviewed changes

qizhaoaoe added 4 commits April 4, 2023 12:16

fix dx computation.

89947c1

delete unuseful params and head including.

fdb4f4a

add fp16/bf16 for static graph.

248e9c3

fix device condiction for instance_norm op.

b012cd6

ZzSean reviewed Apr 4, 2023

View reviewed changes

qizhaoaoe added 2 commits April 4, 2023 17:55

fix instance_norm_grad_grad and bf16 op tests.

6d9dd8d

fix op_test to support grad of bf16 can be compared with fp32.

1a674b5

qizhaoaoe and others added 4 commits April 6, 2023 21:19

Merge branch 'develop' into instance_norm_amp

ae01635

remove updates.

a4d9453

add self-defined grad.

c62e9b6

Merge remote-tracking branch 'upstream/develop' into instance_norm_amp

c497cac

ZzSean approved these changes Apr 10, 2023

View reviewed changes

luotao1 approved these changes Apr 10, 2023

View reviewed changes

JiabinYang approved these changes Apr 10, 2023

View reviewed changes

ZzSean merged commit 7c98abd into PaddlePaddle:develop Apr 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【AMP OP&Test】instance_norm fp16 and bf16 support. #52241

【AMP OP&Test】instance_norm fp16 and bf16 support. #52241

qizhaoaoe commented Mar 28, 2023

paddle-bot bot commented Mar 28, 2023

ZzSean Apr 3, 2023

qizhaoaoe Apr 3, 2023

ZzSean Apr 3, 2023

qizhaoaoe Apr 3, 2023

ZzSean Apr 3, 2023

qizhaoaoe Apr 3, 2023

ZzSean Apr 3, 2023

qizhaoaoe Apr 3, 2023

ZzSean Apr 4, 2023

qizhaoaoe Apr 4, 2023

ZzSean Apr 4, 2023

ZzSean Apr 4, 2023

qizhaoaoe Apr 4, 2023

ZzSean Apr 4, 2023

ZzSean Apr 4, 2023

ZzSean Apr 4, 2023

ZzSean left a comment

luotao1 left a comment

JiabinYang left a comment

【AMP OP&Test】instance_norm fp16 and bf16 support. #52241

【AMP OP&Test】instance_norm fp16 and bf16 support. #52241

Conversation

qizhaoaoe commented Mar 28, 2023

PR types

PR changes

Describe

paddle-bot bot commented Mar 28, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ZzSean left a comment

Choose a reason for hiding this comment

luotao1 left a comment

Choose a reason for hiding this comment

JiabinYang left a comment

Choose a reason for hiding this comment