Skip to content

Commit 0f7919f

Browse files
authored
[Misc] Expand SUPPORTED_HIDDEN_SIZES for DeepEP low-latency kernels (vllm-project#21818)
Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
1 parent 6144545 commit 0f7919f

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/layers/fused_moe/deepep_ll_prepare_finalize.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ class DeepEPLLPrepareAndFinalize(mk.FusedMoEPrepareAndFinalize):
4040

4141
# DeepEP low-latency kernels are compiled only for certain
4242
# specific hidden sizes.
43-
SUPPORTED_HIDDEN_SIZES = [2048, 2560, 4096, 5120, 7168]
43+
SUPPORTED_HIDDEN_SIZES = [2048, 2560, 4096, 5120, 6144, 7168]
4444

4545
def __init__(self,
4646
buffer: deep_ep.Buffer,

0 commit comments

Comments
 (0)