Skip to content

Commit 11a46c8

Browse files
committed
reduce test size.
1 parent c2ec489 commit 11a46c8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/warp_specialize/example_warp_specialize_flashmla.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -382,7 +382,7 @@ def ref_program(q, q_pe, kv, k_pe, glse, Output_partial):
382382
return out
383383

384384

385-
def main(batch=1, heads=128, kv_heads=1, kv_ctx=8192, dim=512, pe_dim=64):
385+
def main(batch=1, heads=64, kv_heads=1, kv_ctx=1024, dim=512, pe_dim=64):
386386
qk_flops = 2 * batch * heads * kv_ctx * (dim + pe_dim)
387387
pv_flops = 2 * batch * heads * kv_ctx * dim
388388
total_flops = qk_flops + pv_flops

0 commit comments

Comments
 (0)