-
Notifications
You must be signed in to change notification settings - Fork 11.3k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
update changes relating to the ggml tensor library for machine learning
rope_multi
:
ggml
#12665
opened Mar 31, 2025 by
foldl
Loading…
opencl : fix memory allocation size
ggml
changes relating to the ggml tensor library for machine learning
#12649
opened Mar 30, 2025 by
sparkleholic
Loading…
vulkan: Hybrid waitForFences/getFenceStatus to reduce fence latency
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12630
opened Mar 28, 2025 by
jeffbolznv
Loading…
vulkan: Implement split_k for coopmat2 flash attention.
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#12627
opened Mar 28, 2025 by
jeffbolznv
Loading…
opencl: remove a self-referential macro
ggml
changes relating to the ggml tensor library for machine learning
#12626
opened Mar 28, 2025 by
linehill
Loading…
sycl: allow ggml-sycl configuration and compilation using Visual Studio project/solution
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12625
opened Mar 28, 2025 by
s-Nick
Loading…
opencl: Add support for multiple devices
ggml
changes relating to the ggml tensor library for machine learning
Enable MMA for BF16 data types on Powerpc
ggml
changes relating to the ggml tensor library for machine learning
#12565
opened Mar 25, 2025 by
shalinib-ibm
•
Draft
vulkan: Implement grouped query attention in the coopmat2 FA shader
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12559
opened Mar 25, 2025 by
jeffbolznv
Loading…
ggml-quants : weighted rounding algorithms with cumulative search
generation quality
Quality of model output
ggml
changes relating to the ggml tensor library for machine learning
Less than 4 bits
Efforts related to viable quantized models using <4 bits
research 🔬
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
Tensor Encoding Scheme
https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
Draft: vulkan: Add bfloat16 support
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12554
opened Mar 24, 2025 by
jeffbolznv
Loading…
cmake: Allow to configure GGML_BUILD_NUMBER with file
ggml
changes relating to the ggml tensor library for machine learning
Evenly and stably pinning thread pool
ggml
changes relating to the ggml tensor library for machine learning
Metal TQ2_0
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12485
opened Mar 20, 2025 by
dmahurin
Loading…
[Issue #12458] Temporarily Clamp inf Values in ggml-cpu.c to Prevent Garbled Output(or coredump) on RK3588
ggml
changes relating to the ggml tensor library for machine learning
#12459
opened Mar 19, 2025 by
Corsair-cxs
Loading…
[WIP] MUSA: enable fastfp16, correct warp reduce impl and perf tuning
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
Fixed Eval Bug: 12163 : Fallback to CPU when loading model: vk::PhysicalDevice::createDevice: ErrorExtensionNotPresent.
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12329
opened Mar 11, 2025 by
ashwini778
Loading…
PR: Refine ggml-hexagon backend(Qualcomm Hexagon NPU backend) for latest ggml,whisper.cpp,llama.cpp
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
script
Script related
testing
Everything test related
#12326
opened Mar 11, 2025 by
zhouwg
Loading…
1 task done
tool-call
: Phi-4 support
android
#12288
opened Mar 9, 2025 by
jpohhhh
Loading…
vulkan: optimization proposals for coopmat1 mul_mm
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12260
opened Mar 7, 2025 by
remyoudompheng
•
Draft
SYCL: Rename oneMKL to oneMath
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12192
opened Mar 5, 2025 by
Rbiessy
Loading…
fix: AVX2 intrinsics, const correctness, and SIMD headers
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
#12186
opened Mar 4, 2025 by
sandboxyer
Loading…
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.