Skip to content
Change the repository type filter

All

    Repositories list

    • koboldcpp

      Public
      Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
      C++
      GNU Affero General Public License v3.0
      11k100Updated Jan 1, 2025Jan 1, 2025
    • run flux1/sd3 model with beginner GPU (low cost) or even CPU
      Python
      Apache License 2.0
      3000Updated Dec 21, 2024Dec 21, 2024
    • Local AI API Platform
      C++
      Apache License 2.0
      151000Updated Dec 20, 2024Dec 20, 2024
    • Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
      Go
      MIT License
      8000Updated Dec 12, 2024Dec 12, 2024
    • hyllama

      Public
      llama.cpp gguf file parser for javascript
      JavaScript
      MIT License
      2000Updated Dec 11, 2024Dec 11, 2024
    • gguf (GPT-Generated Unified Format) connector
      Python
      MIT License
      1000Updated Dec 9, 2024Dec 9, 2024
    • gguf-core

      Public
      a simple way to interact llama with gguf
      Python
      MIT License
      1000Updated Dec 3, 2024Dec 3, 2024
    • cgg

      Public
      call GGUF model
      Python
      MIT License
      1000Updated Dec 3, 2024Dec 3, 2024
    • This project demonstrates how to download a model from Hugging Face, convert it to GGUF format, and upload it back to Hugging Face using a Colab notebook.
      Jupyter Notebook
      MIT License
      1000Updated Nov 16, 2024Nov 16, 2024
    • Deliver LLMs of GGUF format via Dockerfile.
      Go
      MIT License
      2000Updated Oct 24, 2024Oct 24, 2024
    • C#
      MIT License
      1000Updated Oct 21, 2024Oct 21, 2024
    • Hugging Face Model downloader and GGUF Converter.
      Python
      MIT License
      2000Updated Oct 5, 2024Oct 5, 2024
    • maid_llm

      Public
      maid_llm is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)
      Dart
      MIT License
      16000Updated Aug 27, 2024Aug 27, 2024
    • LLM quantization techniques: absmax, zero-point, GPTQ and GGUF
      Jupyter Notebook
      1000Updated Aug 2, 2024Aug 2, 2024
    • GGUF Reader .NET facilitates reading GGUF files from different LLMs in .NET Core 8. It includes features for dynamic DLL loading, GGUF file interpretation, and interactive prompt execution for advanced operations.
      C#
      MIT License
      1000Updated Jul 22, 2024Jul 22, 2024
    • GGUF implementation in C as a library and a tools CLI program
      C
      MIT License
      17000Updated Jul 3, 2024Jul 3, 2024
    • ggufer

      Public
      Convert & quantize HuggingFace models using llama.cpp on premises
      Jupyter Notebook
      MIT License
      1000Updated May 25, 2024May 25, 2024
    • A small utility library for parsing GGUF file info
      Rust
      MIT License
      3000Updated May 23, 2024May 23, 2024
    • llama-cpp-python(llama.cpp)で実行するGGUF形式のLLM用の簡易Webインタフェースです。
      Python
      MIT License
      2000Updated May 12, 2024May 12, 2024
    • GGUF selector
      Python
      MIT License
      1000Updated Feb 29, 2024Feb 29, 2024
    • callgg

      Public
      GGUF caller
      Python
      MIT License
      1000Updated Feb 28, 2024Feb 28, 2024
    • print_gguf.py is a simple utility to parse the header & tensor_infos of GGUF file.
      Python
      MIT License
      1000Updated Feb 1, 2024Feb 1, 2024
    • Wrapper for simplified use of Llama2 GGUF quantized models.
      Python
      Other
      2000Updated Jan 14, 2024Jan 14, 2024
    • Some random tools for working with the GGUF file format
      Python
      MIT License
      5000Updated Nov 24, 2023Nov 24, 2023