Skip to content

Commit 974078d

Browse files
varun-sundar-rabindranathVarun Sundar Rabindranath
authored andcommitted
[Kernel] Tuned FP8 Kernels for Ada Lovelace (vllm-project#6677)
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com> Signed-off-by: Alvant <alvasian@yandex.ru>
1 parent 663123b commit 974078d

File tree

6 files changed

+877
-490
lines changed

6 files changed

+877
-490
lines changed

benchmarks/cutlass_benchmarks/w8a8_benchmarks.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@
1313
from vllm import _custom_ops as ops
1414
from vllm.utils import FlexibleArgumentParser
1515

16-
DEFAULT_MODELS = list(WEIGHT_SHAPES.keys())[1:]
16+
DEFAULT_MODELS = list(WEIGHT_SHAPES.keys())
1717
DEFAULT_BATCH_SIZES = [1, 16, 32, 64, 128, 256, 512]
1818
DEFAULT_TP_SIZES = [1]
1919

0 commit comments

Comments
 (0)