Skip to content

Support MRL-E Embedding's Quantization #1370

@LHT129

Description

@LHT129

some embedding model use "MRL-E" means sub-dim is a good quantizer for whole dim vector. We can use "mrle" as a transformer like PCA to preprocess the origin vector. Actually the base quantization "fp32" is better

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions