[AMDGPU] Mark `amdgcn_tanh` as canonicalized #150059

shiltian · 2025-07-22T16:38:27Z

Co-authored-by: Mekhanoshin, Stanislav Stanislav.Mekhanoshin@amd.com

shiltian · 2025-07-22T16:38:48Z

[AMDGPU] Mark amdgcn_tanh as canonicalized #150059 👈 (View in Graphite)
[AMDGPU] Add support for v_cvt_pk_bf16_f32 on gfx1250 #150053
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

llvmbot · 2025-07-22T16:39:00Z

@llvm/pr-subscribers-backend-amdgpu

Author: Shilei Tian (shiltian)

Changes

Co-authored-by: Mekhanoshin, Stanislav <Stanislav.Mekhanoshin@amd.com>

Full diff: https://github.com/llvm/llvm-project/pull/150059.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/SIISelLowering.cpp (+1)
(added) llvm/test/CodeGen/AMDGPU/fcanonicalize-elimination.bf16.ll (+44)

diff --git a/llvm/lib/Target/AMDGPU/SIISelLowering.cpp b/llvm/lib/Target/AMDGPU/SIISelLowering.cpp
index bc0fd8d4e814b..d65c3ae76566b 100644
--- a/llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+++ b/llvm/lib/Target/AMDGPU/SIISelLowering.cpp
@@ -13633,6 +13633,7 @@ bool SITargetLowering::isCanonicalized(SelectionDAG &DAG, SDValue Op,
     case Intrinsic::amdgcn_rcp_legacy:
     case Intrinsic::amdgcn_rsq_legacy:
     case Intrinsic::amdgcn_trig_preop:
+    case Intrinsic::amdgcn_tanh:
     case Intrinsic::amdgcn_log:
     case Intrinsic::amdgcn_exp2:
     case Intrinsic::amdgcn_sqrt:
diff --git a/llvm/test/CodeGen/AMDGPU/fcanonicalize-elimination.bf16.ll b/llvm/test/CodeGen/AMDGPU/fcanonicalize-elimination.bf16.ll
new file mode 100644
index 0000000000000..85e7038b38563
--- /dev/null
+++ b/llvm/test/CodeGen/AMDGPU/fcanonicalize-elimination.bf16.ll
@@ -0,0 +1,44 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 5
+; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx1250 %s -o - | FileCheck -check-prefix=GCN %s
+
+define float @test_canonicalize_amdgcn_tanh_f32(float %a) {
+; GCN-LABEL: test_canonicalize_amdgcn_tanh_f32:
+; GCN:       ; %bb.0:
+; GCN-NEXT:    s_wait_loadcnt_dscnt 0x0
+; GCN-NEXT:    s_wait_kmcnt 0x0
+; GCN-NEXT:    v_tanh_f32_e32 v0, v0
+; GCN-NEXT:    s_set_pc_i64 s[30:31]
+  %tanh = call float @llvm.amdgcn.tanh.f32(float %a)
+  %canonicalized = call float @llvm.canonicalize.f32(float %tanh)
+  ret float %canonicalized
+}
+
+define bfloat @test_canonicalize_amdgcn_tanh_bf16(bfloat %a) {
+; GCN-LABEL: test_canonicalize_amdgcn_tanh_bf16:
+; GCN:       ; %bb.0:
+; GCN-NEXT:    s_wait_loadcnt_dscnt 0x0
+; GCN-NEXT:    s_wait_kmcnt 0x0
+; GCN-NEXT:    v_tanh_bf16_e32 v0, v0
+; GCN-NEXT:    v_nop
+; GCN-NEXT:    s_delay_alu instid0(TRANS32_DEP_1) | instskip(NEXT) | instid1(VALU_DEP_1)
+; GCN-NEXT:    v_lshlrev_b32_e32 v0, 16, v0
+; GCN-NEXT:    v_max_num_f32_e32 v0, v0, v0
+; GCN-NEXT:    s_delay_alu instid0(VALU_DEP_1)
+; GCN-NEXT:    v_cvt_pk_bf16_f32 v0, v0, s0
+; GCN-NEXT:    s_set_pc_i64 s[30:31]
+  %tanh = call bfloat @llvm.amdgcn.tanh.bf16(bfloat %a)
+  %canonicalized = call bfloat @llvm.canonicalize.bf16(bfloat %tanh)
+  ret bfloat %canonicalized
+}
+
+define half @test_canonicalize_amdgcn_tanh_f16(half %a) {
+; GCN-LABEL: test_canonicalize_amdgcn_tanh_f16:
+; GCN:       ; %bb.0:
+; GCN-NEXT:    s_wait_loadcnt_dscnt 0x0
+; GCN-NEXT:    s_wait_kmcnt 0x0
+; GCN-NEXT:    v_tanh_f16_e32 v0, v0
+; GCN-NEXT:    s_set_pc_i64 s[30:31]
+  %tanh = call half @llvm.amdgcn.tanh.f16(half %a)
+  %canonicalized = call half @llvm.canonicalize.f16(half %tanh)
+  ret half %canonicalized
+}

shiltian · 2025-07-22T23:55:45Z

Merge activity

Jul 22, 11:55 PM UTC: A user started a stack merge that includes this pull request via Graphite.
Jul 23, 12:01 AM UTC: Graphite rebased this pull request as part of a merge.
Jul 23, 12:03 AM UTC: @shiltian merged this pull request with Graphite.

Co-authored-by: Mekhanoshin, Stanislav <Stanislav.Mekhanoshin@amd.com>

shiltian requested review from changpeng and rampitec July 22, 2025 16:38

llvmbot added the backend:AMDGPU label Jul 22, 2025

shiltian requested a review from arsenm July 22, 2025 16:38

shiltian mentioned this pull request Jul 22, 2025

[AMDGPU] Add support for v_cvt_pk_bf16_f32 on gfx1250 #150053

Merged

rampitec approved these changes Jul 22, 2025

View reviewed changes

shiltian force-pushed the users/shiltian/v_cvt_pk_bf16_f32 branch from cd79e69 to 32c56c8 Compare July 22, 2025 17:31

shiltian force-pushed the users/shiltian/add-fcanonicalize-elimination-bf16 branch from cff95bc to 6d98ee9 Compare July 22, 2025 17:31

shiltian force-pushed the users/shiltian/v_cvt_pk_bf16_f32 branch from 32c56c8 to b37ed4d Compare July 22, 2025 23:57

Base automatically changed from users/shiltian/v_cvt_pk_bf16_f32 to main July 23, 2025 00:00

[AMDGPU] Mark amdgcn_tanh as canonicalized

52d0e94

Co-authored-by: Mekhanoshin, Stanislav <Stanislav.Mekhanoshin@amd.com>

shiltian force-pushed the users/shiltian/add-fcanonicalize-elimination-bf16 branch from 6d98ee9 to 52d0e94 Compare July 23, 2025 00:01

shiltian merged commit 7fc6556 into main Jul 23, 2025
7 of 9 checks passed

shiltian deleted the users/shiltian/add-fcanonicalize-elimination-bf16 branch July 23, 2025 00:03

mahesh-attarde pushed a commit to mahesh-attarde/llvm-project that referenced this pull request Jul 28, 2025

[AMDGPU] Mark amdgcn_tanh as canonicalized (llvm#150059)

e804498

Co-authored-by: Mekhanoshin, Stanislav <Stanislav.Mekhanoshin@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU] Mark `amdgcn_tanh` as canonicalized #150059

[AMDGPU] Mark `amdgcn_tanh` as canonicalized #150059

Uh oh!

shiltian commented Jul 22, 2025

Uh oh!

shiltian commented Jul 22, 2025 •

edited

Loading

Uh oh!

llvmbot commented Jul 22, 2025

Uh oh!

shiltian commented Jul 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[AMDGPU] Mark amdgcn_tanh as canonicalized #150059

[AMDGPU] Mark amdgcn_tanh as canonicalized #150059

Uh oh!

Conversation

shiltian commented Jul 22, 2025

Uh oh!

shiltian commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Jul 22, 2025

Uh oh!

shiltian commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[AMDGPU] Mark `amdgcn_tanh` as canonicalized #150059

[AMDGPU] Mark `amdgcn_tanh` as canonicalized #150059

shiltian commented Jul 22, 2025 •

edited

Loading

shiltian commented Jul 22, 2025 •

edited

Loading