The performance (runtime) of any AI model is influenced by its size and precision. AI model developers spend time in optimizing the model size/architecture and precision to achieve better runtime performance. However, there is a limit to reducing model size and precision without losing model quality.
-
Notifications
You must be signed in to change notification settings - Fork 0
mjayw2014/rvm_perf_inference
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Achieve 15-20x performance improvement for vision/perception model inference
