Skip to content

Add 4-bit quantized inference to run BLOOM-176B on 2 A100 GPUs#2526

Open
RezaYazdaniAminabadi wants to merge 14 commits intomasterfrom quantize-inference

Commits

Commits on Nov 17, 2022

Commits on Nov 18, 2022

Commits on Nov 19, 2022

Commits on Nov 21, 2022

Commits on Dec 14, 2022