Open
Description
Hi.
Thank you guys for this awesome library.
In my application, I need to compute a gram matrix (basically, x.t().dot(x)
).
Using gemm
is wasteful, as the result is symmetric and the lower half is redundant, so using syrk
in this case is twice as fast as gemm
.
On a general note, it would be really useful if the library could also easily support calling any general BLAS / LAPACK function using appropriate idioms (like getting memory layout, strides, etc.)