Skip to content

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

Memory tests testing Everything test related
#13669 opened May 20, 2025 by gabe-l-hart Loading…
Granite Four Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#13550 opened May 14, 2025 by gabe-l-hart Draft
2 tasks
feat: Hybrid unified/recurrent cache Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#13276 opened May 2, 2025 by gabe-l-hart Loading…
Vulkan: Remove dedicated aligned matrix matrix multiplication shaders ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#12515 opened Mar 22, 2025 by 0cc4m Draft
server: streaming of tool calls and thoughts when --jinja is on documentation Improvements or additions to documentation examples python python script changes script Script related server testing Everything test related tool calling
#12379 opened Mar 14, 2025 by ochafik Draft
5 of 10 tasks
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
tests: use adaptive number of threads testing Everything test related
#12236 opened Mar 6, 2025 by JohannesGaessler Loading…
Supporting Velvet model python python script changes testing Everything test related
#11716 opened Feb 6, 2025 by fbuciuni90 Loading…
tool-call: add support for tool-calls using Model Context Protocol build Compilation issues examples server testing Everything test related
#11556 opened Jan 31, 2025 by bandoti Loading…
8 of 12 tasks
Move gguf fuzzers to the llama.cpp repository enhancement New feature or request roadmap Part of a roadmap project testing Everything test related
#11514 opened Jan 30, 2025 by slaren
Allow s390x to load little endian models unmodified examples ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#11234 opened Jan 14, 2025 by AlekseiNikiforovIBM Loading…
ggml-cuda : add TQ2_0 kernels, for ternary inference on GPU Apple Metal https://en.wikipedia.org/wiki/Metal_(API) enhancement New feature or request ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs performance Speed related topics python python script changes Review Complexity : High Generally require indepth knowledge of LLMs or GPUs testing Everything test related
#11183 opened Jan 10, 2025 by compilade Loading…
Bamba architecture Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#10810 opened Dec 12, 2024 by gabe-l-hart Draft
3 tasks
Add try/except to test-tokenizer-random.py python python script changes testing Everything test related
#10276 opened Nov 13, 2024 by rmusser01 Loading…
2 of 4 tasks
Test tokenizer-0.py rewrite python python script changes testing Everything test related
#10275 opened Nov 13, 2024 by rmusser01 Loading…
2 of 4 tasks
main: add test-cli + ensure completion goes to stdout even w/ --log-disable examples ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#10102 opened Oct 30, 2024 by ochafik Draft
add FP8 support to gguf/llama: build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning script Script related Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes testing Everything test related
#10055 opened Oct 26, 2024 by Djip007 Draft
1 of 3 tasks
sampling: add K-Shift sampler examples server testing Everything test related
#10048 opened Oct 25, 2024 by MaggotHATE Loading…
2 of 4 tasks
[SYCL] pass SYCL CI devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related
#10041 opened Oct 25, 2024 by airMeng Loading…
2 of 4 tasks
llama : add nvidia nemotron chat template (not-working due to bad tokenizer) testing Everything test related
#9869 opened Oct 12, 2024 by ngxson Draft
2 tasks done
llama : adds llama-grammar memoization stacks (#4218) examples testing Everything test related
#9833 opened Oct 11, 2024 by clarismiranda Loading…
2 of 4 tasks
naming : normalize the name of callback-related identifiers Apple Metal https://en.wikipedia.org/wiki/Metal_(API) breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. examples ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#9405 opened Sep 10, 2024 by ggerganov Loading…
llama : initial Mamba-2 support Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level testing Everything test related
#9126 opened Aug 21, 2024 by compilade Loading…
8 of 9 tasks
server: add repeat penalty sigmoid examples server testing Everything test related
#9076 opened Aug 18, 2024 by z80maniac Loading…
2 of 4 tasks
llama : tokenizer unicode codepoint categories python python script changes Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level script Script related testing Everything test related
#8606 opened Jul 20, 2024 by jaime-m-p Loading…
2 of 4 tasks
ProTip! Add no:assignee to see everything that’s not assigned.