Skip to content

Commit f4fa2c3

Browse files
committed
Update on "[ET-VK] Removed shared memory usage and simplied conv2d dw op shader to improve performance."
This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance. Makes sum a one dimensional array, and moves bias application before storing texel. Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/) [ghstack-poisoned]
2 parents 95e883c + a160857 commit f4fa2c3

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

backends/vulkan/vulkan_preprocess.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -150,14 +150,14 @@ def preprocess( # noqa: C901
150150
program = apply_passes(
151151
program,
152152
[
153+
RemoveRedundantOpsTransform(),
153154
AddmmToLinearTransform(),
154155
FuseQuantizedOpsTransform(program),
155156
SqueezeUnsqueezeInputs(),
156157
FuseViewCopyTransform(),
157158
ViewCopyToSqueezeUnsqueezePass(),
158159
FuseBatchNormWithConvPass(program),
159160
FuseClampPass(),
160-
RemoveRedundantOpsTransform(),
161161
],
162162
)
163163

0 commit comments

Comments
 (0)