Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

vendor: split httplib to cpp/h files
#17150 opened Nov 10, 2025 by ngxson Loading…
Install rpc-server when GGML_RPC is ON.
#17149 opened Nov 10, 2025 by nbp Loading…
vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2 ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17147 opened Nov 10, 2025 by SavicStefan Loading…
metal : cap threadgroups size of set_rows Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#17146 opened Nov 10, 2025 by ggerganov Loading…
metal : make the FA extra sizes consistent Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#17143 opened Nov 10, 2025 by ggerganov Loading…
Add complete Megrez-MoE support: GGUF conversion + inference. model Model specific python python script changes
#17141 opened Nov 10, 2025 by tamarPal Loading…
common : implement parser combinators for chat parsing [WIP] testing Everything test related
#17136 opened Nov 10, 2025 by aldehir Draft
5 of 9 tasks
hexagon: various Op fixes ggml changes relating to the ggml tensor library for machine learning
#17135 opened Nov 10, 2025 by max-krasnyansky Draft
vulkan: disable rms_norm + mul + rope for old gpus ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17134 opened Nov 10, 2025 by netrunnereve Loading…
cpu: skip NOPs to avoid barriers ggml changes relating to the ggml tensor library for machine learning
#17133 opened Nov 10, 2025 by max-krasnyansky Loading…
SYCL: add full support for ABS unary op documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17126 opened Nov 9, 2025 by shani-f Loading…
llama: introduce support for model-embedded sampling parameters python python script changes
#17120 opened Nov 9, 2025 by taronaeo Loading…
rpc : fix alloc size logic Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#17116 opened Nov 9, 2025 by ggerganov Draft
2 tasks
Refactor: convert_hf_to_gguf.py python python script changes refactoring Refactoring
#17114 opened Nov 9, 2025 by pwilkin Draft
CPU SIMD and pipeline optimizations across vec/mmq/ops/kv-cache/repack ggml changes relating to the ggml tensor library for machine learning
#17113 opened Nov 8, 2025 by NoahOksuz Loading…
CUDA: support F32 kernel type for CONV_TRANSPOSE_2D ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17094 opened Nov 8, 2025 by AgainstEntropy Loading…
add version to all shared object files Apple Metal https://en.wikipedia.org/wiki/Metal_(API) Ascend NPU issues specific to Ascend NPUs examples ggml changes relating to the ggml tensor library for machine learning IBM zDNN issues specific to IBM zDNN Accelerator Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#17091 opened Nov 7, 2025 by furrysalamander Loading…
opencl: add fastdiv and use it in set_rows, ported from cuda ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#17090 opened Nov 7, 2025 by lhez Draft
convert: (demo) repacking compressed_tensor format of kimi-k2 python python script changes
#17083 opened Nov 7, 2025 by ngxson Draft
HIP: RDNA4 tensor core support for MMF ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17077 opened Nov 7, 2025 by zhang-hui-yulo Loading…
[RFC] ggml: new backend for API Remoting Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues ggml changes relating to the ggml tensor library for machine learning
#17072 opened Nov 7, 2025 by kpouget Loading…
ProTip! Exclude everything labeled bug with -label:bug.