Description
Is your feature request related to a problem? Please describe.
Feature: I would need to deal with quaternion algebra from cupy rather than from numpy.
Describe the solution you'd like
quat.as_quat_array( ) and the resulting algebra from a cupy array on GPU rather than from numpy on CPU.
Describe alternatives you've considered
My own quaternion algebra functions on cupy arrays, with a less compact syntax and presumably slower than with a proper quaternion dtype implemented for cupy.
Additional context
Same context of materials science and 3D imaging community (large datasets) as in the previous issue on float32 precision.
Now the memory issue with a 1024^3 quaternion array is even more severe here, since GPU devices typycally offer (much) less memory than CPU.