Open
Description
Based on the original quarot method, the R2 rotate can be observed by weight, no need for online rotate.
llmc/llmc/compression/quantization/quarot.py
Line 139 in 867fb4f
Metadata
Metadata
Assignees
Labels
No labels
Based on the original quarot method, the R2 rotate can be observed by weight, no need for online rotate.
llmc/llmc/compression/quantization/quarot.py
Line 139 in 867fb4f