Closed
Description
On my system from 40 tok/s down to 33 tok/s, almost 20% slower...
$ ./run /tmp/ramdisk/model110m.bin -s 1 -p 0
Once upon a time <stripped> make a big difference in the world.
achieved tok/s: 40.298507
$ ./run /tmp/ramdisk/model110m.bin -s 1
Once upon a time <stripped> make a big difference in the world.
achieved tok/s: 33.588093
Slowdown is even more dramatic on smaller models
Metadata
Metadata
Assignees
Labels
No labels