Skip to content

gguf-split: add --no-tensor-first-split option #1120

gguf-split: add --no-tensor-first-split option

gguf-split: add --no-tensor-first-split option #1120

bench-server-baseline (phi-2, q8_0)

succeeded May 4, 2024 in 14m 11s