Skip to content

Commit

Permalink
[Bugfix] Fix w8a8 benchmarks for int8 case (vllm-project#5643)
Browse files Browse the repository at this point in the history
  • Loading branch information
tlrmchlsmth authored and jimpang committed Jul 24, 2024
1 parent eba0837 commit d206e07
Showing 1 changed file with 2 additions and 3 deletions.
5 changes: 2 additions & 3 deletions benchmarks/cutlass_benchmarks/w8a8_benchmarks.py
Original file line number Diff line number Diff line change
Expand Up @@ -120,9 +120,8 @@ def bench_int8(dtype: torch.dtype, m: int, k: int, n: int, label: str,

# cutlass impl
timers.append(
bench_fn(a, b, scale_a.to(device="cpu"), scale_b.to(device="cpu"),
torch.bfloat16, label, sub_label, cutlass_impl,
"cutlass_i8_i8_bf16_scaled_mm"))
bench_fn(a, b, scale_a, scale_b, torch.bfloat16, label, sub_label,
cutlass_impl, "cutlass_i8_i8_bf16_scaled_mm"))

return timers

Expand Down

0 comments on commit d206e07

Please sign in to comment.