Skip to content

Commit 5b1350f

Browse files
HzfengsyLaurawly
authored andcommitted
TensorCore Support using Intrinsic (apache#4136)
* add tensor core support * avoid memory bank conflict * fix thread sync & better performance * better performance * add schedule test for conv2d * extend into BatchMatMul * support config fragment shape and layout using intrinsic * add TensorCore tutorial * add int support and fix lint * address comment * add 32*16*8 TensorCore test * fix wmma include logic
1 parent 6e0dbee commit 5b1350f

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

python/vta/build_module.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -80,6 +80,7 @@ def add_debug(stmt):
8080
if debug_flag:
8181
pass_list.append((1, add_debug))
8282
pass_list.append((2, ir_pass.inject_alu_intrin))
83+
pass_list.append((3, tvm.ir_pass.LowerStorageAccessInfo))
8384
pass_list.append((3, ir_pass.fold_uop_loop))
8485
pass_list.append((3, ir_pass.cpu_access_rewrite))
8586
return tvm.build_config(add_lower_pass=pass_list, **kwargs)

0 commit comments

Comments
 (0)