Skip to content

Commit 766435e

Browse files
varun-sundar-rabindranathVarun Sundar Rabindranath
andauthored
[Kernel] Tuned FP8 Kernels for Ada Lovelace (#6677)
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
1 parent 7cbd9ec commit 766435e

File tree

6 files changed

+877
-490
lines changed

6 files changed

+877
-490
lines changed

benchmarks/cutlass_benchmarks/w8a8_benchmarks.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@
1313
from vllm import _custom_ops as ops
1414
from vllm.utils import FlexibleArgumentParser
1515

16-
DEFAULT_MODELS = list(WEIGHT_SHAPES.keys())[1:]
16+
DEFAULT_MODELS = list(WEIGHT_SHAPES.keys())
1717
DEFAULT_BATCH_SIZES = [1, 16, 32, 64, 128, 256, 512]
1818
DEFAULT_TP_SIZES = [1]
1919

0 commit comments

Comments
 (0)