Enhance the performance of quantized model in terms of accuracy #10950

hieu-nguyen-ts · 2024-12-23T04:57:35Z

hieu-nguyen-ts
Dec 23, 2024

After applying quantization with q4_0, I noticed that the performance of my generated results has declined. I appreciate the speed and reduced VRAM usage that this quantized model offers, but I am seeking ways to enhance its performance. Could you please suggest any solutions or improvements, such as calibration, LoRA, fine-tuning, or other techniques? Thank you for your assistance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance the performance of quantized model in terms of accuracy #10950

{{title}}

Replies: 0 comments

Select a reply

Enhance the performance of quantized model in terms of accuracy #10950

hieu-nguyen-ts Dec 23, 2024

Replies: 0 comments

hieu-nguyen-ts
Dec 23, 2024