Description
The numbers returned by whisper-bench are misleading
I've got a Mac M4 Mini 256G (the cheap one) and a Lenovo Laptop with a 4070 GPU in it. I usually use the laptop in Hybrid mode (Performance mode gets noisy) - the Mac is, of course, silent all the time anyway
I've been relying on whisper-bench to indicate which device is faster, thing is, it's wrong (or more accurately, rather misleading)
I just got the M4 and my Laptop on fairly equal footing when it comes to whisper.cpp facilities as both have OpenVINO available and one has an M4 with OpenCL while the other has CUDA
I'd always thought that the Mac was slower than the Laptop - well, until I drag-raced them against each other via CLI and a 35 minute public domain recording of Aladdin and the Magic Lamp
The figures come out like this (all tests use the medium.en model)...
whisper-bench (total runtime in seconds)
PC (Perf Mode) = 3.694
PC (Hybrid Mode) = 3.721
Mac M4 Mini 256G = 6.929
So the laptop's performance mode is 187.6% the speed of the M4?
Or is it?
Next I run whisper.cli over the recording of Aladdin and the Magic Lamp (35m 06s - mp3@64k - 16M filesize - 5379 words)
whisper-cli (total runtime in seconds + words translated per minute)
Mac M4 Mini 256G = 172.623 = 1869.623 wpm
PC (Perf Mode) = 186.730 = 1728.378 wpm
PC (Hybrid Mode) = 202.850 = 1591.027 wpm
Now the Mac is 108% the speed of the Laptop in Performance Mode
In the real world of course we'd be using whisper.cpp for things more like the whisper-cli test
If anyone wants to run the same test against their setup you can find the audio file here (Archive.org)
Mac was 1/3rd the price of the Laptop (but the Laptop plays better games)
Suppose I'd better switch to Linux on the Laptop and try the same test there next...