Skip to content

Commit 629cada

Browse files
committed
Apply suggestion from @elvischenv
Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>
1 parent b0fd67c commit 629cada

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

vllm/v1/attention/backends/flashinfer.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1209,8 +1209,8 @@ def fast_plan_decode(
12091209
head_dim,
12101210
False, # causal
12111211
window_left,
1212-
-1,
1213-
False,
1212+
-1, # fixed_split_size
1213+
False, # disable_split_kv
12141214
)
12151215
except Exception as e:
12161216
raise RuntimeError(f"Error in tensor core plan: {e}") from e

0 commit comments

Comments
 (0)