Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
gguf : add file format specification (#302)
* docs: gguf spec first pass * docs(gguf): update with review comments * docs(gguf): update with review comments * docs(gguf): quant version optional for unquant * docs(gguf): normalize naming, add whisper * docs(gguf): more review updates * docs(gguf): add norm eps and added_tokens * docs(gguf): move padding * docs(gguf): remove migration tool * docs(gguf): make offset base explicit * docs(gguf): fix replace oops * docs(gguf): alignment metadata+tensor name len max * docs(gguf): clarification, fixes, tensor names * docs(gguf): clarify license * docs(gguf): minor tweaks * docs(gguf): data layout, GQA eq, no ft, LE GGUF * docs(gguf): fix magic order * docs(gguf): match impl * docs(gguf): specify fallback alignment * docs(gguf): remove TensorInfo::n_elements * docs(gguf): filetype, rope base/linear scale * docs(gguf): v2 - uint64 all the things * docs(gguf): tweak extensibility wording * docs(gguf): fix spec discrepancies * docs(gguf): v3 + other fixes * fix(editorconfig): use 2-space tabs for markdown * docs(gguf): clarify big-endian
- Loading branch information