Commit 5b1350f
TensorCore Support using Intrinsic (apache#4136)
* add tensor core support
* avoid memory bank conflict
* fix thread sync & better performance
* better performance
* add schedule test for conv2d
* extend into BatchMatMul
* support config fragment shape and layout using intrinsic
* add TensorCore tutorial
* add int support and fix lint
* address comment
* add 32*16*8 TensorCore test
* fix wmma include logic1 parent 6e0dbee commit 5b1350f
1 file changed
+1
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
80 | 80 | | |
81 | 81 | | |
82 | 82 | | |
| 83 | + | |
83 | 84 | | |
84 | 85 | | |
85 | 86 | | |
| |||
0 commit comments