-
Notifications
You must be signed in to change notification settings - Fork 13.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17147
opened Nov 10, 2025 by
SavicStefan
Loading…
metal : cap threadgroups size of set_rows
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17146
opened Nov 10, 2025 by
ggerganov
Loading…
metal : make the FA extra sizes consistent
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17143
opened Nov 10, 2025 by
ggerganov
Loading…
server/public_simplechat vision (wip), toolcall (done, with 0 setup clientside builtin tools+), reasoing(done)
examples
python
python script changes
server
#17142
opened Nov 10, 2025 by
hanishkvc
Loading…
Add complete Megrez-MoE support: GGUF conversion + inference.
model
Model specific
python
python script changes
#17141
opened Nov 10, 2025 by
tamarPal
Loading…
hexagon: various Op fixes
ggml
changes relating to the ggml tensor library for machine learning
#17135
opened Nov 10, 2025 by
max-krasnyansky
•
Draft
vulkan: disable rms_norm + mul + rope for old gpus
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17134
opened Nov 10, 2025 by
netrunnereve
Loading…
cpu: skip NOPs to avoid barriers
ggml
changes relating to the ggml tensor library for machine learning
#17133
opened Nov 10, 2025 by
max-krasnyansky
Loading…
SYCL: add full support for ABS unary op
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17126
opened Nov 9, 2025 by
shani-f
Loading…
llama: introduce support for model-embedded sampling parameters
python
python script changes
#17120
opened Nov 9, 2025 by
taronaeo
Loading…
rpc : fix alloc size logic
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
CPU SIMD and pipeline optimizations across vec/mmq/ops/kv-cache/repack
ggml
changes relating to the ggml tensor library for machine learning
#17113
opened Nov 8, 2025 by
NoahOksuz
Loading…
webui : add keyboard shortcut to toggle sidebar
examples
server
#17099
opened Nov 8, 2025 by
danbev
Loading…
Add Metal-4 Tensor API test harness for iOS
examples
#17098
opened Nov 8, 2025 by
ArjunDivecha
Loading…
CUDA: support F32 kernel type for changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
CONV_TRANSPOSE_2D
ggml
#17094
opened Nov 8, 2025 by
AgainstEntropy
Loading…
add version to all shared object files
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
Ascend NPU
issues specific to Ascend NPUs
examples
ggml
changes relating to the ggml tensor library for machine learning
IBM zDNN
issues specific to IBM zDNN Accelerator
Nvidia GPU
Issues specific to Nvidia GPUs
OpenCL
Issues specific to the OpenCL backend
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#17091
opened Nov 7, 2025 by
furrysalamander
Loading…
opencl: add fastdiv and use it in set_rows, ported from cuda
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
HIP: RDNA4 tensor core support for MMF
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17077
opened Nov 7, 2025 by
zhang-hui-yulo
Loading…
[RFC] ggml: new backend for API Remoting
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
#17072
opened Nov 7, 2025 by
kpouget
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.