Skip to content

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

SYCL: Add mrope kernel ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13755 opened May 24, 2025 by qnixsynapse Loading…
SYCL: add gelu_erf kernel ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13749 opened May 24, 2025 by qnixsynapse Loading…
SYCL: Implement few same quantized type copy kernels ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13739 opened May 24, 2025 by qnixsynapse Loading…
remove templates from soft_max_f32_submitter to allow SYCL graph updates ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13724 opened May 23, 2025 by lslusarczyk Loading…
sycl: add find_package call for OpenCL ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13643 opened May 19, 2025 by AD2605 Loading…
sycl: Add more debug prints ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13640 opened May 19, 2025 by Rbiessy Loading…
SYCL: Add non contiguous support in RMS_NORM and NORM kernels ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13611 opened May 18, 2025 by qnixsynapse Loading…
llama: Fix typos in multiple files ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ python python script changes SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13369 opened May 8, 2025 by co63oc Loading…
feat: Hybrid unified/recurrent cache Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#13276 opened May 2, 2025 by gabe-l-hart Loading…
llama : try loading tensors with pre-computed hashes Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#13106 opened Apr 25, 2025 by rgerganov Loading…
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
ggml: move kvalues_iq4nl definition to ggml-common.h ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11785 opened Feb 10, 2025 by HungMingWu Loading…
Clean up Test Script + Update it to work on Instruct Tuned Models examples SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11610 opened Feb 3, 2025 by Mr-Thack Loading…
[SYCL] pass SYCL CI devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related
#10041 opened Oct 25, 2024 by airMeng Loading…
2 of 4 tasks
add print cpu info ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#9957 opened Oct 20, 2024 by NeoZhangJianyu Loading…
2 of 4 tasks
[Draft] Tensor Parallel support to llama.cpp ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#9648 opened Sep 26, 2024 by ClarkChin08 Loading…
1 of 3 tasks
Revert "ggml : remove OpenCL (#7735) + (#8235)" Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment python python script changes script Script related SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#8986 opened Aug 11, 2024 by okias Draft
2 of 4 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.