convert-llama-ggmlv3-to-gguf: Try to handle files older than GGJTv3 #3023

KerfuffleV2 · 2023-09-05T09:42:47Z

Better error messages for files that cannot be converted
Add file type to GGUF output

Better error messages for files that cannot be converted Add file type to GGUF output

KerfuffleV2 · 2023-09-05T10:04:13Z

I'll hold off merging this for a bit until I get some feedback from the people with older GGML files.

cebtenzzre · 2023-09-05T17:05:41Z

Isn't the name of the script technically inaccurate now? Maybe we should at least add something to --help to indicate to users that certain pre-GGMLv3 files are supported as well.

KerfuffleV2 · 2023-09-05T17:17:34Z

Isn't the name of the script technically inaccurate now?

Not just technically. :) I kind of wanted to just rename it, but I felt like that would be pretty confusing for existing users.

Maybe we should at least add something to --help

Sure, that's not a bad idea. It seems like for the most part, users don't really know the exact type of file they have anyway and will just try to run the script but some additional information certainly can't hurt.

KerfuffleV2 · 2023-09-05T17:47:18Z

You know what, I think I will just rename the file since ggmlv3 is wrong to start with. I don't know what I was thinking.

Include original file type information in description

cebtenzzre · 2023-09-06T05:46:52Z

convert-llama-ggml-to-gguf.py

+        if (self.file_format < GGMLFormat.GGJT or self.format_version < 2) and ftype not in (GGMLFType.ALL_F32, GGMLFType.MOSTLY_F16):
+            raise ValueError(f'Quantizations changed in GGJTv2. Can only convert unquantized GGML files older than GGJTv2. Sorry, your {self.file_format.name}v{self.format_version} file of type {ftype.name} is not eligible for conversion.')
+        if (self.file_format == GGMLFormat.GGJT and self.format_version == 2) and ftype in (GGMLFType.MOSTLY_Q4_0, GGMLFType.MOSTLY_Q4_1, GGMLFType.MOSTLY_Q4_1_SOME_F16, GGMLFType.MOSTLY_Q8_0):
+            raise ValueError(f'Q4 and Q8 quantizations changed in GGJTv3. Sorry, your {self.file_format.name}v{self.format_version} file of type {ftype.name} is not eligible for conversion.')


Lines of code should be at most 120 chars wide, for readability:

Suggested change

if (self.file_format < GGMLFormat.GGJT or self.format_version < 2) and ftype not in (GGMLFType.ALL_F32, GGMLFType.MOSTLY_F16):

raise ValueError(f'Quantizations changed in GGJTv2. Can only convert unquantized GGML files older than GGJTv2. Sorry, your {self.file_format.name}v{self.format_version} file of type {ftype.name} is not eligible for conversion.')

if (self.file_format == GGMLFormat.GGJT and self.format_version == 2) and ftype in (GGMLFType.MOSTLY_Q4_0, GGMLFType.MOSTLY_Q4_1, GGMLFType.MOSTLY_Q4_1_SOME_F16, GGMLFType.MOSTLY_Q8_0):

raise ValueError(f'Q4 and Q8 quantizations changed in GGJTv3. Sorry, your {self.file_format.name}v{self.format_version} file of type {ftype.name} is not eligible for conversion.')

if ($

(self.file_format < GGMLFormat.GGJT or self.format_version < 2)$

and ftype not in (GGMLFType.ALL_F32, GGMLFType.MOSTLY_F16)$

):$

raise ValueError($

'Quantizations changed in GGJTv2. Can only convert unquantized GGML files older than GGJTv2. Sorry, '$

f'your {self.file_format.name}v{self.format_version} file of type {ftype.name} is not eligible for '$

'conversion.'$

)$

if ($

self.file_format == GGMLFormat.GGJT and self.format_version == 2$

and ftype in ($

GGMLFType.MOSTLY_Q4_0, GGMLFType.MOSTLY_Q4_1, GGMLFType.MOSTLY_Q4_1_SOME_F16, GGMLFType.MOSTLY_Q8_0,$

)$

):$

raise ValueError($

f'Q4 and Q8 quantizations changed in GGJTv3. Sorry, your {self.file_format.name}v{self.format_version} '$

f'file of type {ftype.name} is not eligible for conversion.'$

)

Thanks for the feedback.

In this particular case, I'm honestly not sure putting the time into improving the formatting is really worth it. I doubt anyone else is going to be messing with this script, and it's probably not even going to be in the repo in a month or so. But I made some formatting cleanups anyway.

Formatting changes to clean up some long lines

convert-llama-ggmlv3-to-gguf: Try to handle files older than GGJTv3

00fe3fd

Better error messages for files that cannot be converted Add file type to GGUF output

KerfuffleV2 added the script Script related label Sep 5, 2023

KerfuffleV2 mentioned this pull request Sep 5, 2023

Converting GGML->GGUF: ValueError: Only GGJTv3 supported #2990

Closed

ggerganov approved these changes Sep 5, 2023

View reviewed changes

cebtenzzre mentioned this pull request Sep 5, 2023

Model magic #3025

Open

Rename to convert-llama-ggml-to-gguf.py

645b6a2

Include original file type information in description

cebtenzzre reviewed Sep 6, 2023

View reviewed changes

Improve some informational output

69de44d

Formatting changes to clean up some long lines

KerfuffleV2 merged commit ea2c85d into ggerganov:master Sep 6, 2023

KerfuffleV2 deleted the feat-ggml-convert-improvements branch November 17, 2023 03:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert-llama-ggmlv3-to-gguf: Try to handle files older than GGJTv3 #3023

convert-llama-ggmlv3-to-gguf: Try to handle files older than GGJTv3 #3023

KerfuffleV2 commented Sep 5, 2023 •

edited

Loading

KerfuffleV2 commented Sep 5, 2023

cebtenzzre commented Sep 5, 2023

KerfuffleV2 commented Sep 5, 2023

KerfuffleV2 commented Sep 5, 2023

cebtenzzre Sep 6, 2023

KerfuffleV2 Sep 6, 2023

-        if (self.file_format < GGMLFormat.GGJT or self.format_version < 2) and ftype not in (GGMLFType.ALL_F32, GGMLFType.MOSTLY_F16):
-            raise ValueError(f'Quantizations changed in GGJTv2. Can only convert unquantized GGML files older than GGJTv2. Sorry, your {self.file_format.name}v{self.format_version} file of type {ftype.name} is not eligible for conversion.')
-        if (self.file_format == GGMLFormat.GGJT and self.format_version == 2) and ftype in (GGMLFType.MOSTLY_Q4_0, GGMLFType.MOSTLY_Q4_1, GGMLFType.MOSTLY_Q4_1_SOME_F16, GGMLFType.MOSTLY_Q8_0):
-            raise ValueError(f'Q4 and Q8 quantizations changed in GGJTv3. Sorry, your {self.file_format.name}v{self.format_version} file of type {ftype.name} is not eligible for conversion.')
+        if ($
+            (self.file_format < GGMLFormat.GGJT or self.format_version < 2)$
+            and ftype not in (GGMLFType.ALL_F32, GGMLFType.MOSTLY_F16)$
+        ):$
+            raise ValueError($
+                'Quantizations changed in GGJTv2. Can only convert unquantized GGML files older than GGJTv2. Sorry, '$
+                f'your {self.file_format.name}v{self.format_version} file of type {ftype.name} is not eligible for '$
+                'conversion.'$
+            )$
+        if ($
+            self.file_format == GGMLFormat.GGJT and self.format_version == 2$
+            and ftype in ($
+                GGMLFType.MOSTLY_Q4_0, GGMLFType.MOSTLY_Q4_1, GGMLFType.MOSTLY_Q4_1_SOME_F16, GGMLFType.MOSTLY_Q8_0,$
+            )$
+        ):$
+            raise ValueError($
+                f'Q4 and Q8 quantizations changed in GGJTv3. Sorry, your {self.file_format.name}v{self.format_version} '$
+                f'file of type {ftype.name} is not eligible for conversion.'$
+            )

convert-llama-ggmlv3-to-gguf: Try to handle files older than GGJTv3 #3023

convert-llama-ggmlv3-to-gguf: Try to handle files older than GGJTv3 #3023

Conversation

KerfuffleV2 commented Sep 5, 2023 • edited Loading

KerfuffleV2 commented Sep 5, 2023

cebtenzzre commented Sep 5, 2023

KerfuffleV2 commented Sep 5, 2023

KerfuffleV2 commented Sep 5, 2023

cebtenzzre Sep 6, 2023

Choose a reason for hiding this comment

KerfuffleV2 Sep 6, 2023

Choose a reason for hiding this comment

KerfuffleV2 commented Sep 5, 2023 •

edited

Loading