-
Notifications
You must be signed in to change notification settings - Fork 9.7k
Issues: ggerganov/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Feature Request: Support Zyphra/Zamba2-2.7B
enhancement
New feature or request
model
Model specific
#8795
opened Jul 31, 2024 by
tomasmcm
Bug: Phi-3 4K output broken after 2000~ tokens (Reproducible)
bug
Something isn't working
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
model
Model specific
#7709
opened Jun 3, 2024 by
Amadeus-AI
llama : support Jamba hybrid Transformer-Mamba models
android
Issues specific to Android
embeddings
embedding related topics
enhancement
New feature or request
examples
ggml
changes relating to the ggml tensor library for machine learning
model
Model specific
need feedback
Testing and feedback with results are needed
python
python script changes
refactoring
Refactoring
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
server
Clamp out of range values in K quantizer
bugfix
fixes an issue or bug
model
Model specific
Review Complexity : Medium
Generally require more time to grok but manageable by beginner to medium expertise level
Support for Phi-3 models
good first issue
Good for newcomers
model
Model specific
#6849
opened Apr 23, 2024 by
criminact
Support for RecurrentGemma (Gemma with Griffin Architecture)
enhancement
New feature or request
model
Model specific
stale
#6564
opened Apr 9, 2024 by
TechxGenus
4 tasks done
Adding Support for Custom Qwen2moe Architectures with mergekit-qwen2
model
Model specific
Review Complexity : High
Generally require indepth knowledge of LLMs or GPUs
persimmon crashes with CUDA: assertion failure Something isn't working
model
Model specific
ggml_is_contiguous(src0)
bug
#5823
opened Mar 1, 2024 by
cebtenzzre
Request: Nougat OCR Integration
help wanted
Extra attention is needed
model
Model specific
#3294
opened Sep 21, 2023 by
OhadRubin
Please support the also official Falcon-rw-1b and Falcon-rw-7b model variants
good first issue
Good for newcomers
model
Model specific
#2868
opened Aug 29, 2023 by
maddes8cht
Test replit-code-v1-3b model
help wanted
Extra attention is needed
model
Model specific
#1299
opened May 3, 2023 by
abetlen
ProTip!
Exclude everything labeled
bug
with -label:bug.