Skip to content

Commit

Permalink
ggml : add Q4_3 quantization (ggerganov#1082)
Browse files Browse the repository at this point in the history
  • Loading branch information
ggerganov authored Apr 20, 2023
1 parent 6a9661e commit e0305ea
Show file tree
Hide file tree
Showing 5 changed files with 293 additions and 37 deletions.
1 change: 1 addition & 0 deletions examples/quantize/quantize.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,7 @@ int main(int argc, char ** argv) {
fprintf(stderr, " type = %d - q4_0\n", LLAMA_FTYPE_MOSTLY_Q4_0);
fprintf(stderr, " type = %d - q4_1\n", LLAMA_FTYPE_MOSTLY_Q4_1);
fprintf(stderr, " type = %d - q4_2\n", LLAMA_FTYPE_MOSTLY_Q4_2);
fprintf(stderr, " type = %d - q4_3\n", LLAMA_FTYPE_MOSTLY_Q4_3);
return 1;
}

Expand Down
Loading

0 comments on commit e0305ea

Please sign in to comment.