Skip to content

Commit fb84464

Browse files
Remove FORCE_OLD_CUTLASS_MLA option
Signed-off-by: Matthew Bonanni <mbonanni001@gmail.com>
1 parent 6be6496 commit fb84464

File tree

1 file changed

+0
-6
lines changed

1 file changed

+0
-6
lines changed

vllm/v1/attention/backends/mla/cutlass_mla.py

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -109,12 +109,6 @@ def __init__(
109109
"are not implemented for "
110110
"CutlassMLAImpl")
111111

112-
self._use_old_cutlass_mla = False
113-
force_old_cutlass = os.environ.get("FORCE_OLD_CUTLASS_MLA", None)
114-
if force_old_cutlass:
115-
logger.warning_once("Forcing old cutlass mla kernel")
116-
self._use_old_cutlass_mla = True
117-
118112
# TODO: Currently, num_kv_splits is limited to 16 to avoid hanging
119113
# issues. In case the code hangs, use:
120114
# FORCE_NUM_KV_SPLITS=1

0 commit comments

Comments
 (0)