-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Misc. bug: Speed degradation in Something isn't working
bin-win-cpu-x64
compared to bin-win-avx2-x64
on Intel Core i7-12700H
bug
#13664
opened May 20, 2025 by
howlger
Misc. bug: gguf-new-metadata and gguf-editor-gui changes all integer arrays to INT32
bug
Something isn't working
#13557
opened May 15, 2025 by
CISC
Eval bug: llama-cli, spurious token added to assistant response
bug
Something isn't working
#13402
opened May 9, 2025 by
matteoserva
server : crash when -b > -ub with embeddings
bug
Something isn't working
embeddings
embedding related topics
good first issue
Good for newcomers
server
#12836
opened Apr 8, 2025 by
ggerganov
Misc. bug: llama-quantize clobbers input file + crashes when output file matches
bug
Something isn't working
#12753
opened Apr 4, 2025 by
m18coppola
Misc. bug: The inference speed of llama-server is one-third of that of llama-cli
bug
Something isn't working
#12171
opened Mar 4, 2025 by
zts9989
Misc. bug: The KV cache is sometimes truncated incorrectly when making v1/chat/completions API calls
bug
Something isn't working
high priority
Very important issue
#11970
opened Feb 20, 2025 by
vnicolici
Misc. bug: ROCm images cannot be found
bug
Something isn't working
#11913
opened Feb 16, 2025 by
ExposedCat
Eval bug: Error running multiple contexts from multiple threads at the same time with Vulkan
bug
Something isn't working
#11371
opened Jan 23, 2025 by
charlesrwest
Eval bug: segfault on Alpine linux docker image
bug
Something isn't working
#11308
opened Jan 20, 2025 by
pepijndevos
Compile bug: Emulated Linux ARM64 CPU build fails
bug
Something isn't working
build
Compilation issues
#10933
opened Dec 21, 2024 by
SamuelTallet
Misc. bug: softmax may get error answer when src0->ne[3]!=1 on cuda
bug
Something isn't working
#10683
opened Dec 6, 2024 by
A3shTnT
Misc. bug: interface for model quantization is not fully C-compatible
bug
Something isn't working
#10614
opened Dec 1, 2024 by
JohannesGaessler
Misc. bug: inconsistent locale for printing GGUF kv data across examples
bug
Something isn't working
#10613
opened Dec 1, 2024 by
JohannesGaessler
Misc. bug: -sm row does not work with --device
bug
Something isn't working
#10533
opened Nov 26, 2024 by
mostlygeek
Misc. bug: Inconsistent Vulkan segfault
bug
Something isn't working
#10528
opened Nov 26, 2024 by
RobbyCBennett
Bug: Severe Performance Degradation on Q4_0 CPU-only with MacOS / Apple Silicon M2, after PR#9921 / Version 4081
bug
Something isn't working
#10435
opened Nov 20, 2024 by
AndreasKunar
Bug: Vulkan vk::DeviceLostError with multithreaded environment
bug
Something isn't working
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10420
opened Nov 20, 2024 by
ddwkim
Bug: llama-gbnf-validator parses grammar but gets a seg fault when validating an input string against the grammar
bug
Something isn't working
critical severity
Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#10321
opened Nov 15, 2024 by
nissenbenyitskhak
Bug: Segmentation fault when running speculative decoding
bug
Something isn't working
critical severity
Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#9949
opened Oct 19, 2024 by
rationalism
Bug: Unexpected output length (Only one token response!) when set configs "-n -2 -c 256" for llama-server
bug
Something isn't working
good first issue
Good for newcomers
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#9933
opened Oct 18, 2024 by
morgen52
Bug: Failed to run qwen2-57b-a14b-instruct-fp16.
bug
Something isn't working
good first issue
Good for newcomers
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#9628
opened Sep 24, 2024 by
tang-t21
Bug: llama-server api first query very slow
bug
Something isn't working
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#9492
opened Sep 15, 2024 by
bosmart
Bug: loading llava models fails
bug
Something isn't working
critical severity
Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#9455
opened Sep 12, 2024 by
mudler
Bug: cannot create std::vector larger than max_size()
bug
Something isn't working
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#9391
opened Sep 9, 2024 by
imhoffman
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-21.