- 
                Notifications
    You must be signed in to change notification settings 
- Fork 5.9k
[dtype] add fp16 support for dist_kernel #56184
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| 你的PR提交成功,感谢你对开源项目的贡献! | 
| Ready for review~ @zhangting2020 | 
| 请问之前的实现引起推理精度下降的问题,在这个PR上有做过验证吗? #53525 | 
| private: | ||
| T p_order_; | ||
| Ty p_order; | ||
| }; | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
变量的命名请参考google-cpp-styleguide
| phi::DistGradKernel, | ||
| float, | ||
| double, | ||
| phi::dtype::float16) {} | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
bfloat16也记得注册一下,另外添加对应单测
        
          
                python/paddle/tensor/linalg.py
              
                Outdated
          
        
      | x (Tensor): 1-D to 6-D Tensor, its data type is float32 or float64. | ||
| y (Tensor): 1-D to 6-D Tensor, its data type is float32 or float64. | ||
| x (Tensor): 1-D to 6-D Tensor, its data type is float16, float32 or float64. | ||
| y (Tensor): 1-D to 6-D Tensor, its data type is float16, float32 or float64. | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
注册bfloat16类型后,文档记得更新
| ) | ||
| check_variable_and_dtype( | ||
| y, 'dtype', ['float16', 'float32', 'float64'], 'dist' | ||
| ) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
同上,bfloat16类型的支持
| 
 比较尴尬,手上没有Turing及以前架构的卡,重新编一个Ampare的可行吗? | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
记得同步更新下中文文档
PR types
Function optimization
PR changes
OPs
Description
refer to #50915