-
Notifications
You must be signed in to change notification settings - Fork 13.9k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
server: fixing naming conflict res_error in server-models.cpp
examples
server
#17679
opened Dec 2, 2025 by
w169q169
Loading…
vulkan: enable mmvq for q2_k on NVIDIA
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17675
opened Dec 2, 2025 by
jeffbolznv
Loading…
vulkan: perf_logger improvements
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17672
opened Dec 2, 2025 by
jeffbolznv
Loading…
Add a couple of file types to the text section
examples
server
#17670
opened Dec 1, 2025 by
pwilkin
Loading…
vulkan: fix top_k bug when there are ties in the input
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#17659
opened Dec 1, 2025 by
jeffbolznv
Loading…
ggml-cpu: Add operator-level execution time profiling
ggml
changes relating to the ggml tensor library for machine learning
#17657
opened Dec 1, 2025 by
kimminsu38oo
Loading…
ggml: use 'exists( const std::filesystem::path&, std::error_code&)' instead of 'exists( const std::filesystem::path&)' to enhance robustness
ggml
changes relating to the ggml tensor library for machine learning
#17653
opened Dec 1, 2025 by
flyinskyin2013
Loading…
ggml: added missing cast sections in memcpy
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17651
opened Dec 1, 2025 by
GermanAizek
Loading…
ggml-cpu: remove duplicate conditional check 'iid'
ggml
changes relating to the ggml tensor library for machine learning
#17650
opened Dec 1, 2025 by
GermanAizek
Loading…
gguf: llama: use changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
= default for trivial constructors and destructors
ggml
#17649
opened Dec 1, 2025 by
GermanAizek
Loading…
sgemm: reuse loaded vector in AVX dot product calculation
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17648
opened Dec 1, 2025 by
GermanAizek
Loading…
llama-vocab: replace postfix with prefix increment for iterators
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17646
opened Dec 1, 2025 by
GermanAizek
Loading…
vec: optimize AVX2/FMA sum-of-squares with loop unrolling and FMA
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17642
opened Dec 1, 2025 by
GermanAizek
Loading…
ggml-quants: use _mm256_testz_si256 for mask checks in AVX2
ggml
changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
#17641
opened Dec 1, 2025 by
GermanAizek
Loading…
ggml-alloc: optimize free block shifting with changes relating to the ggml tensor library for machine learning
vibe-coded
Created with heavy use of LLM assistants, requires human verification
memmove
ggml
#17640
opened Dec 1, 2025 by
GermanAizek
Loading…
vulkan: Replace deprecated VK_EXT_validation_features
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17637
opened Dec 1, 2025 by
rillomas
Loading…
common : compute average token length from vocabulary
#17632
opened Dec 1, 2025 by
yifant-code
•
Draft
llama-router, the C++ "llama-swap" for llama.cpp
examples
need feedback
Testing and feedback with results are needed
server
testing
Everything test related
#17629
opened Nov 30, 2025 by
ServeurpersoCom
•
Draft
vulkan: set all memory allocations to high priority
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17624
opened Nov 30, 2025 by
jeffbolznv
•
Draft
vulkan: Reduce temporary memory usage for TOP_K
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17623
opened Nov 30, 2025 by
jeffbolznv
Loading…
model : Fix marker placement for LFM2-VL in single turn llama-mtmd-cli
examples
#17616
opened Nov 30, 2025 by
tdakhran
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.