Skip to content

Add support for GPTQ-quantized MoE models using MoE Marlin#2557

Merged
danieldk merged 1 commit intomainfrom feature/moe-marlinSep 30, 2024

Commits

Commits on Sep 30, 2024