Commit f0364e8
Fix quantized_matmul with 4D inputs (#4335)
Summary:
Pull Request resolved: #4335
MobileBERT has a matmul with 4D inputs (`[1, 4, 8, 32]` by `[1, 4, 32, 8]`) which is erroring out AoT in the meta kernel.
This diff fixes the meta kernel to handle cases where the leading dimensions are more than one (the kernel itself can already handle it!).
Also move the exported graph dump to `export_program`, where it belongs. This prevents some double printing in some cases.
Note: this diff needs at GH approval!
Reviewed By: dulinriley, zonglinpengmeta
Differential Revision: D60050087
fbshipit-source-id: de09ed2fb9c5cdf729cc020119bf090d0f0c70c41 parent 844a69f commit f0364e8
1 file changed
+9
-6
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
76 | 76 | | |
77 | 77 | | |
78 | 78 | | |
| 79 | + | |
79 | 80 | | |
80 | 81 | | |
81 | 82 | | |
| |||
99 | 100 | | |
100 | 101 | | |
101 | 102 | | |
102 | | - | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
103 | 110 | | |
104 | 111 | | |
105 | 112 | | |
| |||
111 | 118 | | |
112 | 119 | | |
113 | 120 | | |
114 | | - | |
115 | | - | |
116 | | - | |
117 | | - | |
118 | | - | |
| 121 | + | |
119 | 122 | | |
120 | 123 | | |
121 | 124 | | |
| |||
0 commit comments