-
Couldn't load subscription status.
- Fork 13.4k
Closed
Labels
performanceSpeed related topicsSpeed related topics
Description
Recently, we did a performance benchmark of llama.cpp for Apple Silicon M-series chips: #4167
I am planning to do a similar benchmark for Apple's mobile chips that are used in iPhones and iPads:
https://en.wikipedia.org/wiki/Apple_silicon#A_series
This issue will track the progress of this work. I am also hoping to collect some feedback regarding the implementation and the metrics that would be important to measure. Let me know if you have any thoughts.
Some rough requirements:
- Ease of use
Should be simple for people to build and run the benchmark on their devices - Model size in the range of 1B - 7B
Larger models do not look feasible at the moment, but can reconsider
Ref:
- Starting point would be the Swift examples:
Metadata
Metadata
Assignees
Labels
performanceSpeed related topicsSpeed related topics