A deterministic, random-access archive format and toolkit for compressing, storing, diffing and patching large language model weights.
rust machine-learning inference llama quantization model-compression model-weights tensor-compression huggingface llm safetensors gguf llm-compression model-distribution weight-compression
-
Updated
May 27, 2026 - Rust