-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Granite Four
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#13550
opened May 14, 2025 by
gabe-l-hart
•
Draft
2 tasks
feat: Hybrid unified/recurrent cache
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
server
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#13276
opened May 2, 2025 by
gabe-l-hart
Loading…
server
: streaming of tool calls and thoughts when --jinja
is on
documentation
tool-call
: Phi-4 support
android
#12288
opened Mar 9, 2025 by
jpohhhh
Loading…
tests: use adaptive number of threads
testing
Everything test related
#12236
opened Mar 6, 2025 by
JohannesGaessler
Loading…
Supporting Velvet model
python
python script changes
testing
Everything test related
#11716
opened Feb 6, 2025 by
fbuciuni90
Loading…
Move gguf fuzzers to the llama.cpp repository
enhancement
New feature or request
roadmap
Part of a roadmap project
testing
Everything test related
#11514
opened Jan 30, 2025 by
slaren
Allow s390x to load little endian models unmodified
examples
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#11234
opened Jan 14, 2025 by
AlekseiNikiforovIBM
Loading…
ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
enhancement
New feature or request
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
performance
Speed related topics
python
python script changes
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
testing
Everything test related
#11183
opened Jan 10, 2025 by
compilade
Loading…
Bamba architecture
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#10810
opened Dec 12, 2024 by
gabe-l-hart
•
Draft
3 tasks
Add try/except to test-tokenizer-random.py
python
python script changes
testing
Everything test related
#10276
opened Nov 13, 2024 by
rmusser01
Loading…
2 of 4 tasks
Test tokenizer-0.py rewrite
python
python script changes
testing
Everything test related
#10275
opened Nov 13, 2024 by
rmusser01
Loading…
2 of 4 tasks
add FP8 support to gguf/llama:
build
Compilation issues
examples
ggml
changes relating to the ggml tensor library for machine learning
script
Script related
Tensor Encoding Scheme
https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
testing
Everything test related
sampling: add K-Shift sampler
examples
server
testing
Everything test related
#10048
opened Oct 25, 2024 by
MaggotHATE
Loading…
2 of 4 tasks
[SYCL] pass SYCL CI
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#10041
opened Oct 25, 2024 by
airMeng
Loading…
2 of 4 tasks
llama : add nvidia nemotron chat template (not-working due to bad tokenizer)
testing
Everything test related
llama : adds llama-grammar memoization stacks (#4218)
examples
testing
Everything test related
#9833
opened Oct 11, 2024 by
clarismiranda
Loading…
2 of 4 tasks
naming : normalize the name of callback-related identifiers
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#9405
opened Sep 10, 2024 by
ggerganov
Loading…
llama : initial Mamba-2 support
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
testing
Everything test related
#9126
opened Aug 21, 2024 by
compilade
Loading…
8 of 9 tasks
llama : tokenizer unicode codepoint categories
python
python script changes
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
script
Script related
testing
Everything test related
#8606
opened Jul 20, 2024 by
jaime-m-p
Loading…
2 of 4 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.