Skip to content

Issues: ggml-org/llama.cpp

examples : add configuration presets
#10932 opened Dec 21, 2024 by ggerganov
Open 3
changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 5
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 14
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

update rope_multi: ggml changes relating to the ggml tensor library for machine learning
#12665 opened Mar 31, 2025 by foldl Loading…
opencl : fix memory allocation size ggml changes relating to the ggml tensor library for machine learning
#12649 opened Mar 30, 2025 by sparkleholic Loading…
vulkan: Hybrid waitForFences/getFenceStatus to reduce fence latency ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12630 opened Mar 28, 2025 by jeffbolznv Loading…
vulkan: Implement split_k for coopmat2 flash attention. ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#12627 opened Mar 28, 2025 by jeffbolznv Loading…
opencl: remove a self-referential macro ggml changes relating to the ggml tensor library for machine learning
#12626 opened Mar 28, 2025 by linehill Loading…
sycl: allow ggml-sycl configuration and compilation using Visual Studio project/solution documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12625 opened Mar 28, 2025 by s-Nick Loading…
opencl: Add support for multiple devices ggml changes relating to the ggml tensor library for machine learning
#12622 opened Mar 28, 2025 by linehill Draft
Enable MMA for BF16 data types on Powerpc ggml changes relating to the ggml tensor library for machine learning
#12565 opened Mar 25, 2025 by shalinib-ibm Draft
vulkan: Implement grouped query attention in the coopmat2 FA shader ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12559 opened Mar 25, 2025 by jeffbolznv Loading…
ggml-quants : weighted rounding algorithms with cumulative search generation quality Quality of model output ggml changes relating to the ggml tensor library for machine learning Less than 4 bits Efforts related to viable quantized models using <4 bits research 🔬 Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
#12557 opened Mar 25, 2025 by compilade Draft
Draft: vulkan: Add bfloat16 support ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12554 opened Mar 24, 2025 by jeffbolznv Loading…
Vulkan: Remove dedicated aligned matrix matrix multiplication shaders ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#12515 opened Mar 22, 2025 by 0cc4m Draft
cmake: Allow to configure GGML_BUILD_NUMBER with file ggml changes relating to the ggml tensor library for machine learning
#12509 opened Mar 22, 2025 by booxter Draft
Evenly and stably pinning thread pool ggml changes relating to the ggml tensor library for machine learning
#12488 opened Mar 21, 2025 by zts9989 Draft
(draft) tts: Orpheus support ggml changes relating to the ggml tensor library for machine learning python python script changes
#12487 opened Mar 21, 2025 by jamorphy Draft
Metal TQ2_0 Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#12485 opened Mar 20, 2025 by dmahurin Loading…
[Issue #12458] Temporarily Clamp inf Values in ggml-cpu.c to Prevent Garbled Output(or coredump) on RK3588 ggml changes relating to the ggml tensor library for machine learning
#12459 opened Mar 19, 2025 by Corsair-cxs Loading…
ci: add Linux cross-compile build devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12428 opened Mar 17, 2025 by bandoti Loading…
[WIP] MUSA: enable fastfp16, correct warp reduce impl and perf tuning ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12383 opened Mar 14, 2025 by BodhiHu Draft
Fixed Eval Bug: 12163 : Fallback to CPU when loading model: vk::PhysicalDevice::createDevice: ErrorExtensionNotPresent. ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12329 opened Mar 11, 2025 by ashwini778 Loading…
PR: Refine ggml-hexagon backend(Qualcomm Hexagon NPU backend) for latest ggml,whisper.cpp,llama.cpp build Compilation issues ggml changes relating to the ggml tensor library for machine learning script Script related testing Everything test related
#12326 opened Mar 11, 2025 by zhouwg Loading…
1 task done
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
vulkan: optimization proposals for coopmat1 mul_mm ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12260 opened Mar 7, 2025 by remyoudompheng Draft
SYCL: Rename oneMKL to oneMath documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12192 opened Mar 5, 2025 by Rbiessy Loading…
fix: AVX2 intrinsics, const correctness, and SIMD headers build Compilation issues ggml changes relating to the ggml tensor library for machine learning
#12186 opened Mar 4, 2025 by sandboxyer Loading…
ProTip! Find all open issues with in progress development work with linked:pr.