Closed
Description
This is a task suitable for new contributors
See how we multi-threaded the ggml_rope() operator.
Do the same for the ggml_cpy()
operator and see if there is any benefit.
Use the ggml profiler (GGML_PERF) to measure the benefit of multi-threaded vs non-multi-threaded ggml_cpy()