Skip to content

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

[CANN]: add the basic supports of Flash Attention kernel Ascend NPU issues specific to Ascend NPUs documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#13627 opened May 19, 2025 by shibizhao Loading…
feat: Hybrid unified/recurrent cache Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#13276 opened May 2, 2025 by gabe-l-hart Loading…
[CANN] Simplify the environment variable setting for GGML_CANN_MEM_POOL and GGML_CANN_ASYNC_MODE Ascend NPU issues specific to Ascend NPUs documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#13104 opened Apr 25, 2025 by bachelor-dou Loading…
server: streaming of tool calls and thoughts when --jinja is on documentation Improvements or additions to documentation examples python python script changes script Script related server testing Everything test related tool calling
#12379 opened Mar 14, 2025 by ochafik Draft
5 of 10 tasks
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
Fix rocWMMA build documentation documentation Improvements or additions to documentation
#12243 opened Mar 7, 2025 by Headcrabed Loading…
Add information for Podman as well as Docker documentation Improvements or additions to documentation
#11660 opened Feb 4, 2025 by rhatdan Loading…
examples : add configuration presets documentation Improvements or additions to documentation enhancement New feature or request examples good first issue Good for newcomers help wanted Extra attention is needed
#10932 opened Dec 21, 2024 by ggerganov
2 of 6 tasks
Cuda build doc documentation Improvements or additions to documentation
#10743 opened Dec 10, 2024 by YannFollet Loading…
Refactor/tinyblas build Compilation issues demo Demonstrate some concept or idea, not intended to be merged documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#10343 opened Nov 16, 2024 by Djip007 Draft
2 of 4 tasks
chore : Fix the error when compiling rocm build on windows using cmake documentation Improvements or additions to documentation
#10310 opened Nov 15, 2024 by cocochick Loading…
2 of 4 tasks
changelog : llama-server REST API documentation Improvements or additions to documentation roadmap Part of a roadmap project
#9291 opened Sep 3, 2024 by ggerganov
changelog : libllama API documentation Improvements or additions to documentation roadmap Part of a roadmap project
#9289 opened Sep 3, 2024 by ggerganov
Revert "ggml : remove OpenCL (#7735) + (#8235)" Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment python python script changes script Script related SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#8986 opened Aug 11, 2024 by okias Draft
2 of 4 tasks
Added perplexity metrics for llama 3.1 with different quantization se… documentation Improvements or additions to documentation examples Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8924 opened Aug 8, 2024 by fedric95 Loading…
1 of 3 tasks
Added dir-assistant to UI projects documentation Improvements or additions to documentation Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#8814 opened Aug 1, 2024 by curvedinf Loading…
2 of 4 tasks
Bug: Grammar readme seems incorrect bug Something isn't working documentation Improvements or additions to documentation low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#7720 opened Jun 3, 2024 by thekevinscott
server: doc: document the --defrag-thold option documentation Improvements or additions to documentation enhancement New feature or request help wanted Extra attention is needed server/webui
#6293 opened Mar 25, 2024 by phymbert
ProTip! no:milestone will show everything without a milestone.