Update IPFS links to quantized alpaca with new tokenizer format #352

antimatter15 · 2023-03-21T12:42:11Z

No description provided.

Green-Sky · 2023-03-21T12:56:05Z

its actually v1 now. but the magic changed to ggmf :)
edit: not sure how best to proceed.

gjmulder · 2023-03-21T14:33:38Z

@antimatter15 I included sha256 checksums for the alpaca 7B, 13B and 30B models in pull request #338, but I suspect I have the old models.

Do we have a standardized naming convention for the alpaca model subdirs and model names that is consistent with the llama model dir and file names?

gjmulder · 2023-03-21T14:59:50Z

its actually v1 now. but the magic changed to ggmf :) edit: not sure how best to proceed.

How does one determine the version of a model? I ran file on the model data files but it didn't return any magic numbers.

Green-Sky · 2023-03-21T15:23:23Z

the first couple of bytes are the magic (in little endian, so reversed) + a version byte

llama.cpp/convert-pth-to-ggml.py

Lines 70 to 71 in 2e664f1

    
           0x67676d66,  # magic: ggml in hex 
        
           1, # file version

(the comment there is out of date, its ggmf now)

before it was just the ggml magic bytes and no version id.

(just open any hex editor and check the first 4+1 bytes)

the file command depends on a public data set of known magic ids. ggml @ggerganov 's personal library is likely now known to them.

ggerganov · 2023-03-21T15:34:41Z

Thanks! Would be useful to add links and instructions for the bigger Alpaca models as well

gjmulder · 2023-03-21T16:47:25Z

FYI @antimatter15 I'd like to provide sha256 sums for #238 for the alpaca models, if we're going to support them.

llama.cpp/models$ cat chk_versions.sh 
#!/bin/sh

for B in *B/ggml-model*bin*; do
	xxd $B | head -1 | awk -v model=$B '{printf("Model: %30s, magic: 0x%8s, version: 0x%4s\n", model, $3$2, $4)}'
done
llama.cpp/models$ ./chk_versions.sh | sort -nk 2
Model:  alpaca-7B/ggml-model-q4_0.bin, magic: 0x67676c6d, version: 0x007d
Model: alpaca-13B/ggml-model-q4_0.bin, magic: 0x67676c6d, version: 0x007d
Model: alpaca-30B/ggml-model-q4_0.bin, magic: 0x67676c6d, version: 0x007d
Model:          7B/ggml-model-f16.bin, magic: 0x6767666d, version: 0x0100
Model:         7B/ggml-model-q4_0.bin, magic: 0x6767666d, version: 0x0100
Model:         13B/ggml-model-f16.bin, magic: 0x6767666d, version: 0x0100
Model:        13B/ggml-model-q4_0.bin, magic: 0x6767666d, version: 0x0100
Model:       13B/ggml-model-f16.bin.1, magic: 0x6767666d, version: 0x0100
Model:      13B/ggml-model-q4_0.bin.1, magic: 0x6767666d, version: 0x0100
Model:         30B/ggml-model-f16.bin, magic: 0x6767666d, version: 0x0100
Model:        30B/ggml-model-q4_0.bin, magic: 0x6767666d, version: 0x0100
Model:       30B/ggml-model-f16.bin.1, magic: 0x6767666d, version: 0x0100
Model:       30B/ggml-model-f16.bin.2, magic: 0x6767666d, version: 0x0100
Model:       30B/ggml-model-f16.bin.3, magic: 0x6767666d, version: 0x0100
Model:      30B/ggml-model-q4_0.bin.1, magic: 0x6767666d, version: 0x0100
Model:      30B/ggml-model-q4_0.bin.2, magic: 0x6767666d, version: 0x0100
Model:      30B/ggml-model-q4_0.bin.3, magic: 0x6767666d, version: 0x0100
Model:         65B/ggml-model-f16.bin, magic: 0x6767666d, version: 0x0100
Model:        65B/ggml-model-q4_0.bin, magic: 0x6767666d, version: 0x0100
Model:       65B/ggml-model-f16.bin.1, magic: 0x6767666d, version: 0x0100
Model:       65B/ggml-model-f16.bin.2, magic: 0x6767666d, version: 0x0100
Model:       65B/ggml-model-f16.bin.3, magic: 0x6767666d, version: 0x0100
Model:       65B/ggml-model-f16.bin.4, magic: 0x6767666d, version: 0x0100
Model:       65B/ggml-model-f16.bin.5, magic: 0x6767666d, version: 0x0100
Model:       65B/ggml-model-f16.bin.6, magic: 0x6767666d, version: 0x0100
Model:       65B/ggml-model-f16.bin.7, magic: 0x6767666d, version: 0x0100
Model:      65B/ggml-model-q4_0.bin.1, magic: 0x6767666d, version: 0x0100
Model:      65B/ggml-model-q4_0.bin.2, magic: 0x6767666d, version: 0x0100
Model:      65B/ggml-model-q4_0.bin.3, magic: 0x6767666d, version: 0x0100
Model:      65B/ggml-model-q4_0.bin.4, magic: 0x6767666d, version: 0x0100
Model:      65B/ggml-model-q4_0.bin.5, magic: 0x6767666d, version: 0x0100
Model:      65B/ggml-model-q4_0.bin.6, magic: 0x6767666d, version: 0x0100
Model:      65B/ggml-model-q4_0.bin.7, magic: 0x6767666d, version: 0x0100

ggerganov · 2023-03-21T17:38:26Z

Ok, so it looks like these alpaca models are not with the latest tokenizer from #252 .
This makes them incompatible with latest master branch.
Probably can be converted to compatible format with this script: #324 (comment)

…#352) * Hide unavailable backends & Add tooltip over backend count Hides unavailable backends from the user and if the program is launched without any backends made, it shows an error message to them stating no backends were found and to make them using the 'make' command Add tooltip when hovering over backend count label hovering over the new label that shows the backend count will explain what the numbers are, and show the users which backends are not available or built * add some code comments * hide "missing" if all are built move tooltip functions to helper functions section. hides the string "Missing: ..." from showing if all backends are available " if len(runopts)==6 else + " * small typo fix * remove wrongly added leftover device choosing code * fix labels * move tooltip to function --------- Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>

Update IPFS links to quantized alpaca with new tokenizer format

f7e3a33

ggerganov merged commit e0ffc86 into master Mar 21, 2023

j-f1 deleted the antimatter15-patch-1 branch March 21, 2023 15:42

gjmulder mentioned this pull request Mar 22, 2023

sha256 check sums to verify original and converted model data #338

Merged

Ameobea mentioned this pull request Mar 22, 2023

Download ggml-alpaca-7b-q4.bin failed CHECKSUM #410

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update IPFS links to quantized alpaca with new tokenizer format #352

Update IPFS links to quantized alpaca with new tokenizer format #352

antimatter15 commented Mar 21, 2023

Green-Sky commented Mar 21, 2023 •

edited

Loading

gjmulder commented Mar 21, 2023

gjmulder commented Mar 21, 2023

Green-Sky commented Mar 21, 2023

ggerganov commented Mar 21, 2023

gjmulder commented Mar 21, 2023

ggerganov commented Mar 21, 2023

Update IPFS links to quantized alpaca with new tokenizer format #352

Update IPFS links to quantized alpaca with new tokenizer format #352

Conversation

antimatter15 commented Mar 21, 2023

Green-Sky commented Mar 21, 2023 • edited Loading

gjmulder commented Mar 21, 2023

gjmulder commented Mar 21, 2023

Green-Sky commented Mar 21, 2023

ggerganov commented Mar 21, 2023

gjmulder commented Mar 21, 2023

ggerganov commented Mar 21, 2023

Green-Sky commented Mar 21, 2023 •

edited

Loading