[Benchmark Script] Refactor benchmark script for `bench_datatype_gemm`

>I know this is a bnechmark script but I think this could still be refactored. Perhaps a few functions/objects and a dictionary?

_Originally posted by @ProExpertProg in https://github.com/vllm-project/vllm/pull/19233#discussion_r2133137967_

I think this is a great idea, example config designed:

```py
PROVIDER_CFGS = {
    "int8-tensor-w-token-a": dict(w="tensor", a="token", no_a_quant=False),
    "int8-tensor-w-tensor-a": dict(w="tensor", a="tensor", no_a_quant=False),
    "int8-channel-w-token-a": dict(w="channel", a="token", no_a_quant=False),
    "int8-channel-w-tensor-a": dict(w="channel", a="tensor", no_a_quant=False),
    "int8-tensor-w-token-a-noquant": dict(w="tensor", a="token", no_a_quant=True),
    "int8-tensor-w-tensor-a-noquant": dict(w="tensor", a="tensor", no_a_quant=True),
    "int8-channel-w-token-a-noquant": dict(w="channel", a="token", no_a_quant=True),
    "int8-channel-w-tensor-a-noquant": dict(w="channel", a="tensor", no_a_quant=True),
}
```

After https://github.com/vllm-project/vllm/pull/19233 merged, I can have another PR optimizing this, and update `benchmarks/kernels/bench_fp8_gemm.py` as well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Benchmark Script] Refactor benchmark script for `bench_datatype_gemm` #19364

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Benchmark Script] Refactor benchmark script for bench_datatype_gemm #19364

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[Benchmark Script] Refactor benchmark script for `bench_datatype_gemm` #19364