-
Notifications
You must be signed in to change notification settings - Fork 13.4k
Closed
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomersperformanceSpeed related topicsSpeed related topics
Description
I was playing with the 65B model, and it took a minute to read the files. If you wrap the model loader loop with a #pragma omp parallel for
and add -fopenmp
to the compiler flags, you can drop it to 18 seconds.
schneiderfelipeSommerEngineering, Adrian-Samoticha, Coderx7 and schneiderfelipe
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomersperformanceSpeed related topicsSpeed related topics